1.

Does Impala Performance Improve As It Is Deployed To More Hosts In A Cluster In Much The Same Way That Hadoop Performance Does?

Answer»

Yes. IMPALA scales with the number of HOSTS. It is important to install Impala on all the DataNodes in the CLUSTER, because otherwise some of the nodes must do remote reads to retrieve data not available for local reads. Data locality is an important architectural ASPECT for Impala performance.

Yes. Impala scales with the number of hosts. It is important to install Impala on all the DataNodes in the cluster, because otherwise some of the nodes must do remote reads to retrieve data not available for local reads. Data locality is an important architectural aspect for Impala performance.



Discussion

No Comment Found