apache hive - Hive Performance - Hive Optimizations- hive tutorial - hadoop hive - hadoop hive - hiveql
apache hive related article tags - hive tutorial - hadoop hive - hadoop hive - hiveql - hive hadoop - learnhive - hive sql
Hive Performance - Hive Optimizations
Architecting Hive Data :
learn hive - hive tutorial - apache hive - hive dba - Architecting Hive Data - hive examples
apache hive related article tags - hive tutorial - hadoop hive - hadoop hive - hiveql - hive hadoop - learnhive - hive sql
Column Pruning :
- T contains 5 columns (a,b,c,d,e)
- Columns c,d are discarded
- Select only the relevant columns
- Enabled by default
- hive.optimize.cp = true
apache hive related article tags - hive tutorial - hadoop hive - hadoop hive - hiveql - hive hadoop - learnhive - hive sql
Predicate Pushdown :
--------------- SELECT * FROM T1 JOIN T2 ON (T1.c1=T2.c2) WHERE T1.c1 < 10
- Left outer join: predicates on the left side aliases are pushed
- Right outer join: predicates on the right side aliases are pushed
- Full outer join: none of the predicates are pushed
apache hive related article tags - hive tutorial - hadoop hive - hadoop hive - hiveql - hive hadoop - learnhive - hive sql
Partition Pruning :
SELECT * FROM
(SELECT c1, COUNT(1) FROM T GROUP BY c1) subq
WHERE subq.prtn = 100;
SELECT * FROM T1 JOIN
(SELECT * FROM T2) subq ON (T1.c1=subq.c2)
WHERE subq.prtn = 100;
apache hive related article tags - hive tutorial - hadoop hive - hadoop hive - hiveql - hive hadoop - learnhive - hive sql
Tips to make fast with Hadoop - Data Layout Considerations for Fast Hive :
- Divide data among different files which can be pruned out.
- Partitions, buckets and skews.
- Skip records during scans using small embedded indexes.
- Automatic when you use ORCFile format.
- Sort data ahead of time.
- Simplifies joins and skipping becomes more effective.
learn hive - hive tutorial - apache hive - hive performance - hive examples
learn hive - hive tutorial - apache hive - hive no compression - hive examples
learn hive - hive tutorial - apache hive - hive column sorting to faciliate skipping - hive examples