Back

Speaker "Nita Dembla" Details Back

 

Topic

LLAP: Sub-Second Analytical Queries in Hive

Abstract

We discuss LLAP (Live Long and Process) – the new arrival in Hive 2.0 that enables concurrent sub-second execution of analytical queries. LLAP is a hybrid execution model that enables performance improvement in and across querues, such as caching of columnar data with cache coherence and intelligent eviction for disaggregated storage models (like S3, Isilon, Azure), JIT-friendly operator pipelines, asynchronous I/O, data pre-fetching and multi-threaded processing. LLAP features robust machine and service failure tolerance achieved by building on top of the time-tested fault tolerant subsystems, as well as a concurrency-directed design that achieves high utilization with low latency via resource sharing, reducing overheads for multiple queries, and enabling the system to preempt tasks of lower priority without failing any query in-flight. 

Profile

Individual with 15 years of SQL on relational databases experience. Focused on improving Hive performance.