Impala Best Practices 1
Below are Impala performance tuning options: Pre-execution Checklist Data types Partitioning File Format Data Type Choices Define integer columns as INT/BIGINT Operations on INT/BIGINT more efficient than STRING Convert “external” data to good “internal” types on load e.g. CAST date strings to TIMESTAMPS This avoids expensive CASTs in queries later Partitioning The fastest I/O is the one […]