Best
17.5K views | +0 today
Best
Best or Top of a Category
Your new post is loading...
Your new post is loading...
Scooped by Dr. Stefan Gruenwald
Scoop.it!

28 Jupyter Notebook tips, tricks and shortcuts

28 Jupyter Notebook tips, tricks and shortcuts | Best | Scoop.it

Jupyter Notebook is a powerful tool for data analysis. Here is a collection of 28 tips, tricks, and shortcuts to turn you into a Jupyter power user!

No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

Python Virtual Environments: A Primer

Python Virtual Environments: A Primer | Best | Scoop.it
This article details how to use a Python virtual environment to manage your Python projects.
No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

Python Tutorial on List Comprehension With Examples

Python Tutorial on List Comprehension With Examples | Best | Scoop.it
No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

A Guide to Scientific Python Plotting With Matplotlib

A Guide to Scientific Python Plotting With Matplotlib | Best | Scoop.it

This article is a beginner-to-intermediate-level walkthrough on Python and matplotlib that mixes theory with examples.

 

A picture says a thousand words, and with Python’s matplotlib library, it fortunately takes far less than a thousand words of code to create a production-quality graphic.

 

However, matplotlib is also a massive library, and getting a plot to look “just right” is often practiced on a trial-and-error basis. Using one-liners to generate basic plots in matplotlib is fairly simple, but skillfully commanding the remaining 98% of the library can be daunting.

 

This guide is a beginner-to-intermediate-level walkthrough on matplotlib that mixes theory with example. While learning by example can be tremendously insightful, it helps to have even a surface-level understanding of the library’s inner workings and layout as well.

No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

Jupyter is where humans and data science intersect

Jupyter is where humans and data science intersect | Best | Scoop.it

Recent beta release of JupyterLab embodies the meta-theme of extensible software architecture for interactive computing with data. While many people think of Jupyter as a “notebook,” that’s merely one building block needed for interactive computing with data. Other building blocks include terminals, file browsers, LaTeX, markdown, rich outputs, text editors, and renderers/viewers for different data formats. JupyterLab is the next-generation user interface for Project Jupyter, and provides these different building blocks in a flexible, configurable, customizable environment. This opens the door for Jupyter users to build custom workflows, and also for organizations to extend JupyterLab with their own custom functionality.

 

Thousands of organizations require data infrastructure for reporting, sharing data insights, reproducing results of analytics, etc. Recent business studies estimate that more than half of all companies globally are precluded from adopting AI technologies due to a lack of digital infrastructure — often because their efforts toward data and reporting infrastructure are buried in technical debt. So much of that infrastructure was built from scratch, even when organizations needed essentially the same building blocks. JupyterLab’s primary goal is to make it routine to build highly customized, interactive computing platforms, while supporting more than 90 different popular programming environments.

 

A major theme builds on top of the other two: computational communication. For data and code to be useful for humans, who need to make decisions, it has to be embedded into a narrative — a story — that that can be communicated to others. Examples of this pattern include: data journalism, reproducible research and open science, computational narratives, open data in society and government, citizen science, and really any area of scientific research (physics, zoology, chemistry, astronomy, etc.), plus the range of economics, finance, and econometric forecasting.

Another growing segment of use cases involves Jupyter as a “last-mile” layer for leveraging AI resources in the cloud. This becomes especially important in light of new hardware emerging for AI needs, vying with competing demand from online gaming, virtual reality, cryptocurrency mining, etc.

 

Take the following as personal opinion, observations, perspectives: We’ve reached a point where hardware appears to be evolving more rapidly than software, while software appears to be evolving more rapidly than effective process. O’Reilly Media work to map the emerging themes in industry, in a process nicknamed “radar”. This perspective about hardware is a theme I’ve been mapping, and meanwhile comparing notes with industry experts. A few data points to consider: Jeff Dean’s talk at NIPS 2017, “Machine Learning for Systems and Systems for Machine Learning” about comparisons of CPUs/GPUs/TPUs, and how AI is transforming the design of computer hardware; The Case for Learned Index Structures, also from Google, about the impact of “branch vs. multiple” costs on decades of database theory; this podcast interview “Scaling machine learning” with Reza Zadeh about the critical importance of hardware/software interfaces in AI apps; the video interview that Wes McKinney and I recorded at JupyterCon 2017 about how Apache Arrow presents a much different take on how to leverage hardware and distributed resources.

No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

Top 10 Videos on Deep Learning in Python

Top 10 Videos on Deep Learning in Python | Best | Scoop.it

Video Tutorials for you!

No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

Practical Machine Learning with R and Python (6 Parts)

Practical Machine Learning with R and Python (6 Parts) | Best | Scoop.it

This is the final and concluding part of my series on ‘Practical Machine Learning with R and Python’. Included are Machine Learning algorithms in R and Python. The algorithms implemented are:

  1. Practical Machine Learning with R and Python – Part 1 The student will learn regression of a continuous target variable. Specifically Univariate, Multivariate, Polynomial regression and KNN regression in both R and Python.
  2. Practical Machine Learning with R and Python – Part 2  The Focus is on Logistic Regression, KNN classification and Cross Validation error for both LOOCV and K-Fold in both R and Python.
  3. Practical Machine Learning with R and Python – Part 3 This 3rd part includes feature selection in Machine Learning. Specifically, best fit, forward fit, backward fit, ridge(L2 regularization) & lasso (L1 regularization). It contains equivalent code in R and Python.
  4. Practical Machine Learning with R and Python – Part 4 In this part, SVMs, Decision Trees, Validation, Precision-Recall, AUC and ROC curves are being discussed.
  5. Practical Machine Learning with R and Python – Part 5  This part touches upon B-splines, natural splines, smoothing splines, Generalized Additive Models (GAMs), Decision Trees, Random Forests and Gradient Boosted Trees.
  6. Practical Machine Learning with R and Python - Part6 This last part covers Unsupervised Machine Learning, specifically the implementations of Principal Component Analysis (PCA), K-Means and Heirarchical Clustering. The R Markdown file can be downloaded from Github.
No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

A Complete Beginner's Guide to Django

A Complete Beginner's Guide to Django | Best | Scoop.it
A Django tutorial series for complete beginners. A comprehensive guide covering all the basic aspects of Django models, views, templates, testing, admin.

 

Learn Django

No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

Top iPython Tutorials for Data Science and Machine Learning

Top iPython Tutorials for Data Science and Machine Learning | Best | Scoop.it
The 11 IPythonTutorials
  • Example Machine Learning - Notebook by Randal S. Olson, supported by Jason H. Moore. University of Pennsylvania Institute for Bioinformatics
  • Python Machine Learning Book - 400 pages rich in useful material just about everything you need to know to get started with machine learning ... from theory to the actual code that you can directly put into action!
  • Learn Data Science - The initial beta release consists of four major topics: Linear Regression, Logistic Regression, Random Forests, K-Means Clustering
  • Machine Learning - This repo contains a collection of IPython notebooks detailing various machine learning algorithms. In general, the mathematics follows that presented by Dr. Andrew Ng's Machine Learning course taught at Stanford University (materials available from ITunes U, Stanford Machine Learning), Dr. Tom Mitchell's course at Carnegie Mellon, and Christopher M. Bishop's "Pattern Recognition And Machine Learning".
  • Research Computing Meetup - Linux and Python for data analysis (tutorials). University of Colorado, Computational Science and Engineering.
  • Theano Tutorial - A brief IPython notebook-based tutorial on basic Theano concepts, including a toy multi-layer perceptron example..
  • IPython Theano Tutorials - A collection of tutorials in ipynb format that illustrate how to do various things in Theano.
  • IPython Notebooks - Demonstrations and use cases for many of the most widely used "data science" Python libraries. Implementations of the exercises presented in Andrew Ng's "Machine Learning" class on Coursera. Implementations of the assignments from Google's Udacity course on deep learning.

 

For more articles about IPython for Data Science and Machine Learning, click here

 

DataScience and Machine Learning Resources

 

Additional Reading

No comment yet.
Rescooped by Dr. Stefan Gruenwald from Everything is related to everything else
Scoop.it!

Try Python in Your Browser With These Free Online Interactive Shells

Try Python in Your Browser With These Free Online Interactive Shells | Best | Scoop.it
Whether you're going through these Python examples or reviewing the basics of arrays and lists, you can test the code right in your browser. Here are the best online Python interpreters we've found.

Via Fernando Gil
No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

14 Best Python Pandas Features

14 Best Python Pandas Features | Best | Scoop.it
Bridging the gap between technology and business
No comment yet.
Rescooped by Dr. Stefan Gruenwald from Nostri Orbis
Scoop.it!

PyVideo.org · Swiss Python Summit 2016

PyVideo.org · Swiss Python Summit 2016 | Best | Scoop.it

Newest developments from the python camp.


Via Fernando Gil
No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

Machine Learning iPython Notebooks

Machine Learning iPython Notebooks | Best | Scoop.it
This is the personal website of a data scientist and machine learning enthusiast with a big passion for Python and open source. Born and raised in Germany, now living in East Lansing, Michigan.
No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

Brian Granger, Chris Colbert & Ian Rose - JupyterLab+Real Time Collaboration

At their PyData Seattle talk on Jupyter Lab, the authors demonstrate opening a 1 trillion row by 1 trillion column csv (and effortlessly scrolling left and right across the columns), as well as realtime collaboration using the Jupyter Lab Google Drive extension, OOTB Vega and GeoJSON compatibility, and plenty of other incredible features.

 
No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

UCSD Analyzed 1 Million Jupyter Notebooks — Now You Can Too

UCSD Analyzed 1 Million Jupyter Notebooks — Now You Can Too | Best | Scoop.it

In July 2017, the Design Lab at UC San Diego scraped and analyzed over 1 million Jupyter Notebooks from GitHub. They are making these data publicly available for everyone to explore! While only a snapshot of one corner of the Jupyter universe, these data provide unique perspective into how people use and share Jupyter Notebooks.

 

The collection includes over 1 million notebooks as well as metadata about the nearly 200,000 repositories where they lived. The full dataset is nearly 600GB so we have created a smaller 5GB sampler dataset for you to get started. This includes roughly 6,000 notebooks from 1000 repositories.

 

They originally collected these data to explore how people use narrative text in Jupyter Notebooks. The UCSD team found many notebooks, even those accompanying academic publications, had little in the way of descriptive text. This is likely because many analysts view their notebooks as personal and messy works-in-progress. On the other hand, many of the notebooks they collected were masterpieces of computational narrative, elegantly explaining complex analyses (one notebook even had more text than The Great Gatsby). The UCSD team members think this spread reflects a tension between data exploration, which tends to produce messy notebooks, and process explanation, in which analysts clean and organize their notebooks for a particular audience.

No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

Basic data analysis on Twitter with Python

Basic data analysis on Twitter with Python | Best | Scoop.it

After creating the Free Wtr bot using Tweepy and Python and this code, the author wanted a way to see how Twitter users were perceiving the bot and what their sentiment was. So he created a simple data analysis program that takes a given number of tweets, analyzes them, and displays the data in a scatter plot.

 

In order to create this, you have to install a few packages, including  Tweepy , Tkinter , Textblob and  matplotlib . These packages can be installed using the pip package manager.

No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

PyTorch, Dynamic Computational Graphs and Modular Deep Learning

PyTorch, Dynamic Computational Graphs and Modular Deep Learning | Best | Scoop.it

Deep Learning frameworks such as Theano, Caffe, TensorFlow, Torch, MXNet and CNTK are the work horses of Deep Learning work. These frameworks as well as the GPU (predominantly Nvidia) are the what enables the rapid growth of Deep Learning. It was refreshing to hear Nando de Freitas acknowledge their work in the recently concluded NIPS 2016 conference. Infrastructure does not get enough of the recognition it deserves in the academic community. Yet, programmers toil on to continually tweak and improve their frameworks.

 

Recently, a new framework was revealed by Facebook and a bunch of other partners (Twitter * NVIDIA * SalesForce * ParisTech * CMU * Digital Reasoning * INRIA * ENS). PyTorch came out of stealth development. PyTorch is an improvement over the popular Torch framework (Torch was a favorite at DeepMind until TensorFlow came along). The obvious change is the support of Python over the less often used Lua language. Almost all of the more popular frameworks use Python, so it is a relief that Torch has finally joined the club.

No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

66 machine learning lectures [2017-2018], including deep learning

66 machine learning lectures [2017-2018], including deep learning | Best | Scoop.it
No comment yet.
Rescooped by Dr. Stefan Gruenwald from Deep_In_Depth: Deep Learning, ML & DS
Scoop.it!

Implementing a Neural Network from Scratch in Python – An Introduction

Implementing a Neural Network from Scratch in Python – An Introduction | Best | Scoop.it

In this post we will implement a simple 3-layer neural network from scratch. We won’t derive all the math that’s required, but we will try to give an intuitive explanation of what we are doing.

 

Learners should be familiar with basic Calculus and Machine Learning concepts, e.g. know what a classification and a regularization is. Ideally students should also know a bit about how optimization techniques like gradient descent work.


Via Eric Feuilleaubois
No comment yet.
Rescooped by Dr. Stefan Gruenwald from Deep_In_Depth: Deep Learning, ML & DS
Scoop.it!

5 Best Deep Learning with Python videos for a Beginner

5 Best Deep Learning with Python videos for a Beginner | Best | Scoop.it
Deep learning is the most interesting and powerful machine learning technique. It is the root of the most enthralling and amazing features that we access today which covers a wide range of areas like robots, image recognition, NLP and artificial intelligence, text classification, text-to-speech and many more. It is also the technology behind widely used features provided by Facebook i.e. tagging each other in pictures or be Google's self-driving cars or speech recognition.

Python is considered to be the most popular and fast-growing language for deep learning. It is also fully featured general purpose programming language with famous deep learning libraries like Theano and TensorFlow.

Via Eric Feuilleaubois
No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

A Dramatic Tour through Python’s Data Visualization Landscape (including ggplot and Altair)

A Dramatic Tour through Python’s Data Visualization Landscape (including ggplot and Altair) | Best | Scoop.it

This somewhat witty and detailed walkthrough will help you explore the difference between the major data visualization tools in the Python ecosystem — including some options that were ported from R!

No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

10+ Best Python IDES for Software Development

10+ Best Python IDES for Software Development | Best | Scoop.it

Python is an interpreted, object-oriented, high-level programming language with dynamic semantics. Using third-party tools, Python code can be packaged into standalone executable programs (such as Py2exe, or Pyinstaller). Python interpreters are available for many operating systems. Programmers often fall in love with Python because of the increased productivity it provides. Since there is no compilation step, the edit-test-debug cycle is incredibly fast.

 

If you are going to develop software using Python, please choose an Best Python IDE(Integrated Development Environment). In this page we have collected some really good Integrated Development Environments for Python, which provides you a convenient environment to code, edit, test, and debug applications written in Python. Let’s have a look on all of them, one by one.

No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

Best Jupyter Notebooks

Best Jupyter Notebooks | Best | Scoop.it

A Jupyter Notebook is a web application that allows you to create and share documents that contain live code, equations, visualizations and explanatory text. Uses include: data cleaning and transformation, numerical simulation, statistical modeling, machine learning and much more.

Click here to edit the content

No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

PyVideo.org · SciPy - 5 Years of Python Lectures and Tutorials [Videos]

PyVideo.org · SciPy - 5 Years of Python Lectures and Tutorials [Videos] | Best | Scoop.it

SciPy 2015, the fourteenth annual Scientific Computing with Python conference, will be held this July 6th-12th in Austin, Texas. SciPy is a community dedicated to the advancement of scientific computing through open source Python software for mathematics, science, and engineering. The annual SciPy Conference allows participants from all types of organizations to showcase their latest projects, learn from skilled users and developers, and collaborate on code development.

 

The full program will consist of two days of tutorials by followed by three days of presentations, and concludes with two days of developer sprints on projects of interest to attendees.

No comment yet.
Scooped by Dr. Stefan Gruenwald
Scoop.it!

A gallery of 1,900+ scientific iPython notebooks

A gallery of 1,900+ scientific iPython notebooks | Best | Scoop.it
  1. Entire books or other large collections of notebooks on a topic
  2. Scientific computing and data analysis with the SciPy Stack
  3. General Python Programming
  4. Notebooks in languages other than Python
  5. Miscellaneous topics about doing various things with the Notebook itself
  6. Reproducible academic publications
  7. Other publications using the Notebook
  8. Data-driven journalism
  9. Whimsical notebooks
  10. Videos of IPython being used in the wild
No comment yet.