Introduction to Interactive Predictive Analytics in Python with scikit-learn

A session at PyCon US 2012

Thursday 8th March, 2012

1:20pm to 4:40pm (PST)

The goal of this tutorial is to give the attendee a first experience of machine learning tools applied to practical software engineering tasks such as language detection of tweets, topic classification of web pages, sentiment analysis of customer products reviews and facial recognition in pictures from the web or from your own webcam.

The demand for software engineers with Data Analytics and Machine Learning skills is rapidly growing and Python / Numpy is one of the best environments for quickly prototyping scalable data-centric applications or interactively exploring your data especially thanks to tools such as IPython and Matplotlib.

scikit-learn is a very active open source project that implements a variety of state-of-the art machine learning algorithms. The goal of this project and tutorial is to take the algorithms out of the academic papers and make them work on a selection of real world tasks to unleash the value of your data.

We will focus on providing hints to perform the right data preprocessing steps and on how to select algorithms and parameters suitable for the task at hand. We will also introduce tools and methodologies to measure the performance of the trained models as objectively as possible.

About the speaker

This person is speaking at this event.
Olivier Grisel

Datageek, engineer @Parietal_INRIA, contributor to scikit-learn. I like Python, NumPy, Spark & interested in Machine Learning, NLProc, {Big|Linked|Open} Data. bio from Twitter

Coverage of this session

Sign in to add slides, notes or videos to this session

Tell your friends!


Time 1:20pm4:40pm PST

Date Thu 8th March 2012

Short URL


Official session page


View the schedule



See something wrong?

Report an issue with this session