- Joined
- May 24, 2024
- Messages
- 264
- Reaction score
- 451
- Points
- 43
- Thread Author
- #1
If programming is magic, then web scraping is certainly a form of magic. By writing a simple automated program, you can query web servers, query data, and analyze it to extract the information you need. This completely updated third edition not only introduces you to web scraping, but also serves as a comprehensive guide to scraping virtually every type of data from the modern Internet.
Part I covers the mechanics of web scraping: using Python to request information from a web server, basic processing of the server response, and automatically interacting with sites. Part II covers various more specific tools and applications that are suitable for any web scraping scenario you are likely to encounter.
Parse complex HTML pages
Develop crawlers using the Scrapy framework
Learn how to store the data you process
Read and extract data from documents
Clean and normalize poorly formatted data
Read and write natural languages
View forms and logins
Clean up JavaScript and view APIs
Use and burn image to text software
Avoid scraping traps and bot blockers
Use scrapers to test your website
To see this hidden content, you must reply and react with one of the following reactions : Like, Love, Haha, Wow