Open source web scraping system for automated data collection tasks
Collection of Python web scraping scripts for data extraction tasks
Python crawler to download photos and videos from Tumblr blogs
Advanced toolkit for detecting and exploiting CSRF vulnerabilities
The next generation web scraping framework
NBA Stats API via Basketball Reference
Declarative web scraping
Scrape job websites into a single spreadsheet with no duplicates.
Asynchronous tool for finding and checking public proxy servers
A tool to scrape images from SimpCity
DataHen Till is a companion tool to your existing web scraper
Automated mobile app crawler and testing tool built on Appium
Multiprocess Selenium crawler for downloading images by keywords
Open source file indexing & storage analytics powered by Elasticsearch
Creating Scrapy scrapers via the Django admin interface
Asyncio-based Python framework for building fast web crawling spiders
Polite concurrent web crawler library for Go with flexible hooks
Python tool that automates JD.com login and product purchase tasks
ML-based HTML scraper that learns extraction rules from examples
Simple Python framework for building multithreaded web crawlers
Collection of reverse engineering articles curated for learning
Lightweight Ruby DSL for scraping structured data from web pages
Linux for content creation, web scraping, coding, and data analysis.