Scrapy 简明教程

Scrapy Tutorial

Scrapy 是一个使用 Python 编写的高速、开放源代码 Web 爬取框架,它用于借助基于 XPath 的选择器从网页提取数据。

Scrapy is a fast, open-source web crawling framework written in Python, used to extract the data from the web page with the help of selectors based on XPath.

Audience

本教程专为需要从头开始学习 Scrapy 网络爬虫的软件程序员而设计。

This tutorial is designed for software programmers who need to learn Scrapy web crawler from scratch.

Prerequisites

您应该对计算机编程术语和 Python 有基本的了解。对 XPath 的基本了解是一个加分项。

You should have a basic understanding of Computer Programming terminologies and Python. A basic understanding of XPath is a plus.