| 1. |
Where Can I Get Sample Data To Try? |
|
Answer» You can get scripts that produce data files and set up an environment for TPC-DS style BENCHMARK tests from this Github repository. In addition to being useful for experimenting with PERFORMANCE, the TABLES are suited to experimenting with many aspects of SQL on Impala: they contain a good mixture of data types, data DISTRIBUTIONS, PARTITIONING, and relational data suitable for join queries. You can get scripts that produce data files and set up an environment for TPC-DS style benchmark tests from this Github repository. In addition to being useful for experimenting with performance, the tables are suited to experimenting with many aspects of SQL on Impala: they contain a good mixture of data types, data distributions, partitioning, and relational data suitable for join queries. |
|