您现在的位置：程式師世界 >> 編程語言 > >> 更多編程語言 >> Python

Python create crawler project

編輯：Python

install Scrapy：pip install scrapy -i https://mirrors.aliyun.com/pypi/simple/ ( Followed by -i https://mirrors.aliyun.com/pypi/simple/ Domestic resources will improve the download speed )
open Cmd / PyCharm–Terminal
Enter the path where you want to create the crawler project , Input ：scrapy startproject Project name （ Create a crawler project ）
Entry project , Input ：scrapy genspider Reptile name “host Address ” （ Create crawler file ）
Set up settings, stay pycharm Set in

Serial number step （1） Set up ROBOTSTXT_OBEY = Falserobots Explanation of the agreement ：https://blog.csdn.net/wz947324/article/details/80633668（ Some websites don't allow crawlers to visit , If the robot agreement is observed , Cannot crawl ）（2） Turn on DOWNLOAD_DELAY = 3 Download delay ：DOWNLOAD_DELAY = 3, Access to the server has passed 3s Ask for more data , Used to simulate user access （3） Turn on ：DEFAULT_REQUEST_HEADERS = { ‘Accept’: ‘text/html,application/xhtml+xml,application/xml;q=0.9,/;q=0.8’,‘Accept-Language’: ‘en’,} You can set the default request header here , Delete the original content Set up ：User-Agent:------ Set up ：Cookie:------（4） Turn on ：DOWNLOADER_MIDDLEWARES = { ‘zhaobiao（ Project name ）.middlewares.ZhaobiaoDownloaderMiddleware’: 543,} Download Middleware ： Configure agent IP（5） Turn on ：ITEM_PIPELINES = { ‘zhaobiao（ Project name ）.pipelines.ZhaobiaoPipeline’: 300,} Pipeline files ： Point to pipelines.py file （6）scrapy Operation of the project Method 1： Create a start file ：from scrapy import cmdline cmdline.execute('scrapy crawl bilian（ Crawler file name ）.split() Method 2：Terminal：cmdline.execute(‘scrapy crawl bilian（ Crawler file name ）’.split())

上一篇文章：推薦練手的70個超火python小項目
下一篇文章： Argparse module in Python

Python

Python error: the following arguments are required

Catalog One 、 background Two

Python常見報錯及解決方案，建議收藏！

來源丨網絡大家好，我是菜鳥哥。如果說寫代碼最害怕什麼，那無疑

What is the development prospect of Python? Is there a future for zero basics Python?

Python As one of the most popu

Python meets SQL, so a useful Python third-party library appears

1. Presentation data All the

Master 8 examples of Python dictionary

Click on the above “ Xiaobai s

最近想要用Python做界面程序的開發，研究了下，主流是使用