Back

Speaker "Milind Bhandarkar" Details Back

 

Topic

Future of Big Data Analytics is Fast

Abstract

Hadoop turns 10 in 2016. In these ten years, starting with MapReduce & HDFS, Hadoop has evolved into tens of components (mostly open-source projects). These include many SQL-in-Hadoop systems (Impala, HAWQ, Presto, Drill), DAG-execution frameworks (Spark, Tez, Flink), OLAP Engines (Kylin), Streaming analytics engines (Storm, Apex, Samza). In this talk, we will discuss the underlying reasons of this proliferation of big data computation frameworks, and describe a unifying framework & architecture in which, end-to-end analytical pipelines can be built. We will discuss commodity hardware trends, which make this unification possible. Ampool is building enabling technologies for building such unified architecture. We will demonstrate how Ampool enables fast analytics on Big Data.

Profile

Milind Bhandarkar was the founding member of the team at Yahoo! that took Apache Hadoop from 20-node prototype to datacenter-scale production system, and has been contributing and working with Hadoop since version 0.1.0. He started the Yahoo! Grid solutions team focused on training, consulting, and supporting hundreds of new migrants to Hadoop. Parallel programming languages and paradigms has been his area of focus for over 20 years. He worked at the Center for Development of Advanced Computing (C-DAC), National Center for Supercomputing Applications (NCSA), Center for Simulation of Advanced Rockets, Siebel Systems, Pathscale Inc. (acquired by QLogic), Yahoo! and Linkedin. Currently, he is the Chief Scientist at Greenplum, a division of EMC.