Running More Than One Spiders One By One

August 21, 2024 Post a Comment

I am using Scrapy framework to make spiders crawl through some webpages. Basically, what I want is to scrape web pages and save them to database. I have one spider per webpage. But

Solution 1:

scrapyd is indeed a good way to go, max_proc or max_proc_per_cpu configuration can be used to restrict the number of parallel spdiers, you will then schedule spiders using scrapyd rest api like:

$ curl http://localhost:6800/schedule.json -d project=myproject -d spider=somespider

Python Courses, Training, and Tutorials

Running More Than One Spiders One By One

Solution 1:

Post a Comment for "Running More Than One Spiders One By One"