Data Warehousing
What is Data Warehouse? Types, Definition & Example
What is Data Warehousing? A Data Warehousing (DW) is process for collecting and managing data from...
Teradata is an open-source Database Management System for developing large-scale data warehousing applications. This tool provides support for multiple data warehouse operations simultaneously using the concept of parallelism. Teradata is a massively open processing system that supports Unix/Linux/Windows server platforms.
Teradata software is developed by Teradata Corporation, which is an American IT firm. It is a vendor of analytics data platforms, applications, and other related services. The firm develops a product to consolidate data from various sources and make the data available for analysis.
In this Teradata tutorial for beginners, you will learn Teradata basics like:
Teradata was a division of NCR Corporation. It incorporated in 1979 but parted away from NCR in October 2007. Michael Koehler became the first CEO of Teradata.
Milestones of Teradata Corporation:
Next in this Teradata tutorial, we will learn about features of Teradata.
Teradata offers following powerful features:
Next in this Teradata SQL tutorial, we will learn about Teradata Architecture.
Teradata architecture is a Massively Parallel Processing Architecture.
Three important components of Teradata are:
Teradata Architecture Diagram:
Teradata Architecture Diagram
Parsing Engine:
The Parsing Engine parses the queries and prepares the execution plan. It manages sessions for users. It optimizes & sends a request to the users.
So, when the client executes queries for inserting records, Parsing Engine sends the records to Message Passing layer. Message passing layer or BYNET is a software and hardware component. It offers networking capability. It also retrieves the records and sends the row to the target AMP.
AMP:
AMP stands for Access Module Processor. It stores records on these disks. AMP conduct following activities:
When the client runs queries to retrieve records, the Parsing engine sends a request to BYNET. Then BYNET sends the retrieval request to appropriate AMPs.
AMPs search their disks in parallel and recognize the required records and send to BYNET. BYNET sends the records to Parsing Engine which in turn will send to the client.
Next in this Teradata Database tutorial, we will learn about TeraData SQL commands.
Teradata Database supports following basic SQL commands:
| COMMAND | Description |
|---|---|
| CREATE | Creates a new database, table, user, etc. |
| DROP | Removes a new database, table, user, etc. |
| ALTER | Changes a Table, column, trigger, etc. |
| MODIFY | Changes a Database or user definition |
| RENAME | Changes name of tables,views,macros, etc. |
| COMMAND | Description |
|---|---|
| GRANT/REVOKE | Used to control privileges of a user on an object |
| GRANT LOGON/REVOKE LOGON | Used to control logon privileges to a host or host group |
| GIVE | Used to give a database object to another database object |
| COMMAND | Description |
|---|---|
| DELETE | Removes a row from table |
| ECHO | Used to echo a string or command to the client |
| CHECKPOINT | Defines a recovery point in the journal that can be used later to restore the table content |
| SELECT | Used to return a specific row data in a table form |
| UPDATE | Modifies data in one or more rows of a table |
Following are the popular Teradata Applications:
Next in this Teradata for beginners tutorial, we will learn about the difference between Teradata and other RDBMS.
| Parameter | TERA DATA | RDBMS |
|---|---|---|
| Architectures | Follows Shared Nothing Architecture. | Shared Everything and allows resource contention. |
| Processes | MIPS [Millions of Instructions/sec | KIPS [Thousands of institutions/sec] |
| Indexes | Better Distribution and Retrieval | Only offers FASI Retrieval |
| Parallelism | Supports Un-conditional parallelism. | Parallelism is conditional and unpredictable |
| Bulk Load | Teradata allows bulk load. | Allows only limited bulk load. |
| Scalability | Linear scalability with a slope of one | Scalability with diminishing returns |
| Database buffer | A single database buffer used by all UoP's. (A unite of parallelism). A single data store accessed by all UoP's | Query Controller ships functions to UoP's that own the data |
| Stores | It stores TERA BYTES[Billions of rows] | GIGA BYTES[Millions of rows] |
| MPP | SMP |
|---|---|
| MPP - Massively Parallel Processing. It is Computer system which is attached to many independent arithmetic units or entire microprocessors, that run in parallel. | Symmetric Multi-Processing. In an SMP processing system, the CPU's share the same memory, and as a result code running in one system may affect the memory used by another. |
| Databases can expand by adding new CPUs. | SMP databases generally use one CPU to perform database searches. |
| In an MPP environment, performance is improved because no resources must be shared among physical computers. | The workload for a parallel job is distributed across the processors in the system. |
| Performance of a Massive parallel processing system is linear. However, it will increase in proportion to the number of nodes. | SMP databases can run on multiple servers. However, will share another resource. |
What is Data Warehousing? A Data Warehousing (DW) is process for collecting and managing data from...
What is ETL? ETL is an abbreviation of Extract, Transform and Load. In this process, an ETL tool...
Download PDF 1) How do you define Teradata? Give some of the primary characteristics of the same....
{loadposition top-ads-automation-testing-tools} What is Business Intelligence Tool? BUSINESS...
What is MOLAP? Multidimensional OLAP (MOLAP) is a classical OLAP that facilitates data analysis by...
What is DataStage? Datastage is an ETL tool which extracts data, transform and load data from...