Scraping the web is a favorite pastime of many sourcers. In fact, a recent SourceCon Denver session about web scraping was one of the most popular sessions of the conference. If you’re new to web scraping, here are a few resources from the SourceCon archives to help you get started.
1) Aaron Lintz shows the SourceCon Live audience how to use Import.io
2) Glenn Gutmacher shows the SourceCon Live audience how to use Outwit Hub
3) Todd Davis’ Presentation in the 2013 SourceCon Labs shows how to extract data from Facebook with Memonic
Tools needed to do what Todd teaches in the video above:
Firefox – Chrome, Internet Explorer, and other browsers won’t allow you to select multiple lines of text from a web page. This can only be done with Firefox.