Tools, libraries, services, books, blog posts, etc. all count as 'resources' for the purposes of this question.
Feel free to characterize your recommendations by whether the intended audience is an experienced dev or a n00b.
UPDATE: If any patterns emerge for pain points or missing docs, examples, or resources for learning I might create a page or an ebook project on GitHub that covers that area.
If you are interested in web crawling, which is often necessary if you want to extract data from very large sites (or many sites), I just wrote up a blog post comparing open source web crawling systems (including scrapy): http://blog.blikk.co/comparison-of-open-source-web-crawlers