Sunday 25th October, 2015
5:00pm to 5:30pm
How often have you looked at a site and thought ""I could make something so
much better, if only I had the data"". Well, now you can! Learn how to
combine Portia and Pandas to extract and process data from around the web.
Portia is an easy to use web ui that allows you to generate spiders by
selecting the data you want and defining simple rules for what links to
Pandas is a library that provides powerful tools allowing you to perform
analysis on your extracted data.
There should be enough information to get you started with extracting and
analysing data quickly and easily.
Software developer at Scrapinghub writing open source tools to help make extracting data from the web quicker and easier.
Sign in to add slides, notes or videos to this session