HIVE
1. When doing multi table joins, except the last ( right most ) table, other tables are buffered in the reducers. So to have less memory footprint on reducers, largest tables should be the last one as it is not buffered but streamed to reducers. For Impala , it is different. 2. You can filter the unwanted records within the join itself. SELECT a.*, b. * FROM a LEFT OUTER JOIN b ON ( a.key=b.key AND b.name= 'Tim' AND a .name= 'Kim' ) 3. Joins are left associative. So execution happens from left to right and the result of joins is fed into next join operation. 4. To find the value of variable set hive.execution.engine 5. To find values of all variables set 6. If you want to pass different hive configuration settings in hive shell, you can use --hiveconf. For example hive --hiveconf hivi.optimize.sort.dynamic.partition=true --hiveconf hive.exec.dynamic.partition.mode=nonstrict 7. In case you accidentally drop the p