Hive Meetups

February 2015 Hive User Meetup Presentation

Hive Join Optimizations: MR and Spark (Szehon Ho)
Cascading and Hive (Ryan Desmond)

November 2013 Hive Contributors Meetup Presentations

Using Dynamic Compilation with Hive (Edward Capriolo)
Let There Be Tez: Current Status and Demo (Gunther Hagleitner)
Hive Authorization Improvement Proposal - HIVE-5837 (Thejas Nair)
Insert, update, and delete in Hive with full ACID support - HIVE-5317 (Owen O'Malley)
Apache Sentry (Brock Noland)
Cost Based Optimizer Design - HIVE-5775 (John Pullokkaran)
Join syntax improvement - HIVE-5555 (Ashutosh Chauhan)

November 2011 NYC Hive Meetup Presentations

Breaking First Normal Form (Edward Capriolo, Media 6 Degrees)
HAWK: Performance Monitoring for Hive (JunHo Cho, NexR)
RHive: Integrating R and Hive (JunHo Cho, NexR)
Replacing the Hive Execution Layer with Pervasive Turbo Rush (Jim Falgout, Pervasive Software)
Using JDBC with Hive (Bennie Schut, eBuddy)

June 2012 Hadoop Summit Hive Meetup Presentations

What's News in Apache Hive 0.9.0 (Ashutosh Chauhan, Hortonworks)
HiveServer2 Project (Carl Steinbach, Cloudera)
Column Statistics in Hive (Shreepadma Venugopalan, Cloudera)
HCatalog: Extending Hive's Metadata to Pig and MR (Alan Gates, Hortonworks)
Continuous Aggregations with Hive (Viral Bajaria, Hulu)

February 2013 Hive User Group Meetup

Hive at KIXEYE and in the Gaming Industry (Aaron Sun)
Brickhouse: Klout's Open Source UDF Library for Hive (Jerome Banks)
Hive Client/Server Deployment Options (Prasad Mujumdar)
Case Study: Utilizing Windowing and Partitioned Table Functions with Hive (Murtaza Doctor)
New Features in the Next Version of Hive (Ashutosh Chauhan)

June 2013 Hadoop Summit Hive Meetup Presentations

Hive Correlation Optimizer (Yin Huai)

Older Hive Presentations

Hive ApacheCon 2008, New Oreleans, LA (Ashish Thusoo, Facebook)
Facebook and Open Source, UIUC, (Zheng Shao, Facebook)
Hive: Data Warehousing with Hadoop, NYC Hadoop User Meetup (Jeff Hammerbacher, Cloudera)
Hive: Data Warehousing Analytics on Hadoop, UC Berkeley, (Joydeep Sarma, Namit Jain, Zheng Shao, Facebook)
An Introduction to Hive, Jeff Hammerbacher, Facebook
Large Scale Data Processing using commodity SW/HW, IIT-Delhi CS Dept., (Joydeep Sen Sarma, Facebook)
Data Warehousing & Analytics on Hadoop, Percon Conference, Santa Clara, CA, USA (Ashish Thusoo, Prasad Chakka, Facebook)
Hive: Hadoop Summit 2009, Santa Clara, CA, USA (Namit Jain, Zheng Shao, Facebook)
Hive: VLDB 2009, Lyon, France (Facebook)
Hive Object Model, (Zheng Shao, Facebook)
Hive User Group Meeting August 2009, (Facebook)
Rethinking the Data Warehouse with Hadoop and Hive (Ashish Thusoo, Facebook at Hadoop World NYC 2009)
Hive Anatomy – System & Pseudo-code level Architecture Review, (Ning Zhang, internal presentations, Facebook)
User Defined Table Generating Functions, (Paul Yang, Facebook)
Hive Training – Motivations and Real World Use Cases, (Ning Zhang, Facebook, at Cloudera's training session)
Hive Presentation at QCon Nov 2009 (Ashish Thusoo and Namit Jain, Facebook)
Hive Paper and Presentation at ICDE 2010, Long Beach, California (Raghotham Murthy and Namit Jain)
Hive User Group Presentation from Netflix (Eva Tse and Jerome Boulon, Netflix)
Hive User Group Presentation - New Features and APIs from Facebook (Ning Zhang, Yongqiang He, Namit Jain, John Sichi, Paul Yang, Zheng Shao, Facebook)
Hive User Group Presentation - Hive Quick Start Tutorial (Carl Steinbach, Cloudera)
HBase Meetup HUG10 - Hive/HBase Integration (John Sichi, Facebook)
Using Hadoop and Hive to Optimize Travel Search (Jonathan Seidman and Ramesh Venkataramaiah, Orbitz)
Hive Presentation at ApacheCon NA 2010 (John Sichi, Facebook)
Join Optimization in Hive (Liyin Tang, Facebook)
RCFile Paper at ICDE 2011, Hannover, Germany (Yongqiang He (Facebook), Rubao Lee (OSU), Yin Huai (OSU), Zheng Shao (Facebook), Namit Jain (Facebook), Xiaodong Zhang (OSU), Zhiwei Xu (ICT, CAS))
Replacing an Oracle DB/DW with Hadoop/Hive from the 2011 Hadoop Summit Hive Contributor Meeting (JunHo Cho, NexR)
Join Strategies in Hive from the 2011 Hadoop Summit (Liyin Tang, Namit Jain)
High Volume Updates in Hive from 2012 Hadoop Summit (Owen O'Malley)

Related Work

Processing Theta-Joins using MapReduce (A. Okcan, M. Riedewald)
Optimizing Joins in a Map-Reduce Environment (F. Afrati, J. Ullman)
Efficient Parallel Set-Similarity Joins Using MapReduce (R. Vernica, M. Carey, C. Li)
A Comparison of Join Algorithms for Log Processing in MapReduce (S. Blanas, J. Patel, V. Ercegovac, J. Rao, E. Shekita, Y. Tian)
HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads (A. Abouzeid, K. Bajda-Pawlikowski, D. Abadi, A. Silberschatz, A. Rasin)
Tenzing: A SQL Implementation On The MapReduce Framework
Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience
YSmart: Yet Another SQL-to-MapReduce Translator (R. Lee, T. Luo, Y. Huai, F. Wang, Y. He, X. Zhang)
Query Optimization Using Column Statistics in Hive (A. Gruenheid, E. Omiecinski, L. Mark)

Space shortcuts

Child pages

Hive Meetups

February 2015 Hive User Meetup Presentation

November 2013 Hive Contributors Meetup Presentations

November 2011 NYC Hive Meetup Presentations

June 2012 Hadoop Summit Hive Meetup Presentations

February 2013 Hive User Group Meetup

June 2013 Hadoop Summit Hive Meetup Presentations

Older Hive Presentations

Related Work