1.

Explain partitioning in ETL and write its type.

Answer»

Essentially, partitioning is the process of dividing up a data storage area for IMPROVED performance. It can be used to organize your work. Having all your data in one place without organization makes it more difficult for DIGITAL tools to find and analyze the data. It is easier and faster to locate and analyze data when your data warehouse is partitioned. The following reasons make partitioning important: 

  • Facilitate easy data MANAGEMENT and enhance performance.
  • Ensures that all of the SYSTEM's requirements are balanced.
  • Backups/recoveries made easier.
  • Simplifies management and optimizes hardware performance.

Types of Partitioning -

  • Round-robin Partitioning: This is a method in which data is evenly spread among all partitions. Therefore, each partition has approximately the same number of ROWS. Unlike hash partitioning, the partitioning columns do not need to be specified. New rows are assigned to partitions in round-robin style.
  • Hash Partitioning: With hash partitioning, rows are evenly distributed across partitions based on a partition key. Using a hash function, the server creates partition keys to group data.


Discussion

No Comment Found