Getting scraped parlamento.pt data into google spreadsheet

In this opendata project we’re trying to make it easy for non-techie folks to collaborate and contribute to, and since spreadsheets are still the best way for them to get an idea of what data we already have and do some experiments with them (charts and other visualizations) I decided I should try to connect […]

Tools and Libs for Scraping parlamento.pt

So what am I using to scrap the parlamento.pt site? Well, first let me tell you that i’m mostly a Microsoft Developer. I develop web applications with Visual Basic and (less) C# (using Visual Studio) and also MS SQLServer but I’ve also played and did a few projects with PHP and MySQL. Since the Hacklaviva […]

Scraping parlamento.pt

Because scraping is the boring part of exploring data I tried to find the easiest, less time-consuming, yet easy to maintain set of tools to do it. Since the URLs are (somewhat) well known, I was able to skip the crawler, url discovery part of this projects, I just curl(ed) the page that I wanted […]