Teradata 简明教程

Teradata - Architecture

Teradata 架构基于大规模并行处理 (MPP) 架构。Teradata 的主要组件有解析引擎、BYNET 和访问模块处理器 (AMP)。下图显示了 Teradata 节点的高层次架构。

Teradata architecture is based on Massively Parallel Processing (MPP) architecture. The major components of Teradata are Parsing Engine, BYNET and Access Module Processors (AMPs). The following diagram shows the high level architecture of a Teradata Node.

teradata node architecture

Components of Teradata

Teradata 的关键组件如下所示 −

The key components of Teradata are as follows −

  1. Node − It is the basic unit in Teradata System. Each individual server in a Teradata system is referred as a Node. A node consists of its own operating system, CPU, memory, own copy of Teradata RDBMS software and disk space. A cabinet consists of one or more Nodes.

  2. Parsing Engine − Parsing Engine is responsible for receiving queries from the client and preparing an efficient execution plan. The responsibilities of parsing engine are − Receive the SQL query from the client Parse the SQL query check for syntax errors Check if the user has required privilege against the objects used in the SQL query Check if the objects used in the SQL actually exists Prepare the execution plan to execute the SQL query and pass it to BYNET Receives the results from the AMPs and send to the client

  3. Message Passing Layer − Message Passing Layer called as BYNET, is the networking layer in Teradata system. It allows the communication between PE and AMP and also between the nodes. It receives the execution plan from Parsing Engine and sends to AMP. Similarly, it receives the results from the AMPs and sends to Parsing Engine.

  4. Access Module Processor (AMP) − AMPs, called as Virtual Processors (vprocs) are the one that actually stores and retrieves the data. AMPs receive the data and execution plan from Parsing Engine, performs any data type conversion, aggregation, filter, sorting and stores the data in the disks associated with them. Records from the tables are evenly distributed among the AMPs in the system. Each AMP is associated with a set of disks on which data is stored. Only that AMP can read/write data from the disks.

Storage Architecture

当客户端运行查询以插入记录时,解析引擎会将这些记录发送给 BYNET。BYNET 检索记录并将行发送给目标 AMP。AMP 将这些记录存储在其磁盘上。下图显示了 Teradata 的存储架构。

When the client runs queries to insert records, Parsing engine sends the records to BYNET. BYNET retrieves the records and sends the row to the target AMP. AMP stores these records on its disks. Following diagram shows the storage architecture of Teradata.

storage architecture

Retrieval Architecture

当客户端运行查询以检索记录时,解析引擎会向 BYNET 发送一个请求。BYNET 将检索请求发送给适当的 AMP。然后,AMP 会并行搜索其磁盘并识别所需记录,然后发送给 BYNET。然后 BYNET 会将记录发送给解析引擎,解析引擎会将其发送给客户端。以下是 Teradata 的检索架构。

When the client runs queries to retrieve records, the Parsing engine sends a request to BYNET. BYNET sends the retrieval request to appropriate AMPs. Then AMPs search their disks in parallel and identify the required records and sends to BYNET. BYNET then sends the records to Parsing Engine which in turn will send to the client. Following is the retrieval architecture of Teradata.

retrieval architecture