1.

What Is Apache Tajo?

Answer»

Apache Tajo is a relational and distributed DATA processing framework. It is DESIGNED for low latency and scalable ad-hoc query analysis.

  • Tajo SUPPORTS standard SQL and various data formats. Most of the Tajo queries can be executed without any modification.
  • Tajo has fault-tolerance through a restart mechanism for failed tasks and extensible query rewrite ENGINE.
  • Tajo performs the necessary ETL (Extract Transform and LOAD process) operations to summarize large datasets stored on HDFS. It is an alternative choice to Hive/Pig.

Apache Tajo is a relational and distributed data processing framework. It is designed for low latency and scalable ad-hoc query analysis.



Discussion

No Comment Found