April 02 to 04 2018, Santa Clara, USA.


Speaker "Ashutosh Chuahan" Details

Name :
ashutosh chuahan
Company :
Title :
Topic :

Recents Advancements in Apache Hive

Abstract :

Apache Hive is a rapidly evolving project which continues to enjoy great adoption in big data ecosystem. Although, Hive started primarily as batch ingestion and reporting tool, community is hard at work in improving it along many different dimensions and use cases. This talk will provide an overview of latest and greatest features and optimizations which have landed in project over last year. Materialized view, micro managed tables and workload management are some noteworthy features. I will deep dive into some optimizations which promise to provide major performance gains. Support for ACID tables has also improved considerably. Although some of these features and enhancements are not novel but have existed for years in other DB systems, implementing them on Hive poses some unique challenges and results in lessons which are generally applicable in many other contexts. I will also provide a glimpse of what is expected to come in near future.

Profile :

Ashutosh has been in the area of Big Data for 6 years. He is the VP of Hive PMC in Apache Software Foundation.


Get latest updates of Global Data Science Conference
sent to your inbox.

Weekly insight from industry insiders.
Plus exclusive content and offers.