Speaker "Vladimir Bacvanski" Details Back
-
Name
Vladimir Bacvanski
-
Company
Paypal
-
Designation
Principal Architect
Topic
Unifying Big Data Batch and Real-Time Streaming with Apache Flink
Abstract
Apache Flink continues on the ideas from Hadoop but adds optimization and transformation mechanisms from distributed databases and parallel collections. Flink runs on top of HDFS and YARN, but the execution is optimized, in a similar way as the relational databases optimize SQL. The execution model is based on a memory management scheme that favors in-memory processing, but then gracefully degrades to disk when necessary. The same engine supports both batch and true streaming. Flink has a very elegant Scala based API, which highly resembles using Scala collection libraries.
Join us in exploring the Apache Flink’s state of the art data processing capabilities, end user benefits and new Big Data algorithms.