Skip to main content

License single eBooks

Your results

Search and filter

Showing 1 - 1 of 1
0 eBooks €0

  • Running Web Crawlers/Scrapers on a Big Data Production Scale
    Jay M. Patel
    978-1-4842-6576-5
    2020
    Edition 1
    • Shows you how to process web crawls from Common Crawl, one of the largest publicly available web crawl datasets (petabyte scale) indexing over 25 billion web pages ever month
    • Takes you from developing a simple Python-based web scraper on your personal computer to a distributed crawler with multiple nodes running on the cloud
    • Teaches you to process raw data using NLP techniques and boilerplate removal to extract useful insights that can power businesses with vertical/meta search engines, lead generation and Internet marketing, monitoring of competitors, brands, and prices, and more

    €118

Is this helpful?

Survey

Survey to collect feedback on the helpfulness of this page.

Survey

Survey to collect feedback on the helpfulness of this page.