Beautiful Soup 简明教程

Discuss Beautiful Soup

在本教程中,我们将向您展示如何使用 Beautiful Soup 4 在 Python 中执行网络爬取,以从 HTML、XML 和其他标记语言中获取数据。在这里,我们将尝试从各种不同网站(包括 IMDB)中爬取网页。我们将介绍 beautiful soup 4、python 基本工具,用于有效且清晰地导航、搜索和解析 HTML 网页。

In this tutorial, we will show you, how to perform web scraping in Python using Beautiful Soup 4 for getting data out of HTML, XML and other markup languages. In this we will try to scrap webpage from various different websites (including IMDB). We will cover beautiful soup 4, python basic tools for efficiently and clearly navigating, searching and parsing HTML web page.

在本教程中,我们已尝试介绍 Beautiful Soup 4 的几乎所有功能。你可以将本教程中介绍的多个功能整合到一个更大的程序中,从网站中捕获多个有意义的数据,作为输入放入其他子程序。

We have tried to cover almost all the functionalities of Beautiful Soup 4 in this tutorial. You can combine multiple functionalities introduced in this tutorial into one bigger program to capture multiple meaningful data from the website into some other sub-program as input.