insophia is sharing code with you

Bitbucket is a code hosting site. Unlimited public and private repositories. Free for small teams.

Don't show this again

insophia / scrapy http://scrapy.org/

Scrapy has moved to Github: https://github.com/scrapy/scrapy. This legacy Mercurial repo was kept for reference purposes and is no longer being updated.

Clone this repository (size: 8.3 MB): HTTPS / SSH
hg clone https://bitbucket.org/insophia/scrapy
hg clone ssh://hg@bitbucket.org/insophia/scrapy

scrapy overview

Recent commits See more »

Author Revision Comments Message Labels Date
Daniel Graña dc6a85919ac6 Do not filter requests with dont_filter attribute set in OffsiteMiddleware
Pablo Hoffman 6a265601e6c9 scrapyd: updated schedule.json response format
Pablo Hoffman 708c9e73cfd6 added unittest for SpiderState extension
Pablo Hoffman 4a6c0d4d7e8d restored support for spider.DOWNLOAD_DELAY attribute, with deprecation warning
Pablo Hoffman 3a133543f077 replaced use of deprecated w3lib.url.urljoin_rfc by stdlib urlparse.urljoin

This is Scrapy, an opensource screen scraping framework written in Python.

For more info visit the project home page at http://scrapy.org