Scrapy 简明教程
Scrapy - Web Services
Description
一个正在运行的 Scrapy 网络爬虫可以通过 JSON-RPC 控制。它通过 JSONRPC_ENABLED 设置启用。此服务通过 JSON-RPC 2.0 协议提供对主爬虫对象的访问。用于访问爬虫程序对象的端点为 −
A running Scrapy web crawler can be controlled via JSON-RPC. It is enabled by JSONRPC_ENABLED setting. This service provides access to the main crawler object via JSON-RPC 2.0 protocol. The endpoint for accessing the crawler object is −
http://localhost:6080/crawler
下表包含显示 Web 服务行为的一些设置 −
The following table contains some of the settings which show the behavior of web service −
Sr.No |
Setting & Description |
Default Value |
1 |
JSONRPC_ENABLED This refers to the boolean, which decides the web service along with its extension will be enabled or not. |
True |
2 |
JSONRPC_LOGFILE This refers to the file used for logging HTTP requests made to the web service. If it is not set the standard Scrapy log will be used. |
None |
3 |
JSONRPC_PORT This refers to the port range for the web service. If it is set to none, then the port will be dynamically assigned. |
[6080, 7030] |
4 |
JSONRPC_HOST This refers to the interface the web service should listen on. |
'127.0.0.1' |