Back

Speaker "Vladimir Bacvanski" Details Back

 

Topic

Unifying Big Data Batch and Real-Time Streaming with Apache Flink

Abstract

Apache Flink continues on the ideas from Hadoop but adds optimization and transformation mechanisms from distributed databases and parallel collections. Flink runs on top of HDFS and YARN, but the execution is optimized, in a similar way as the relational databases optimize SQL. The execution model is based on a memory management scheme that favors in-memory processing, but then gracefully degrades to disk when necessary. The same engine supports both batch and true streaming. Flink has a very elegant Scala based API, which highly resembles using Scala collection libraries.  

Join us in exploring the Apache Flink’s state of the art data processing capabilities, end user benefits and new Big Data algorithms.

Profile

Dr. Vladimir Bacvanski is a Principal Architect with Strategic Architecture at PayPal. His work spans Data Platforms, Privacy, and Developer Experience as well as the introduction of Advanced Technologies. Before joining PayPal, Vladimir was the CTO and a founder of a custom development and consulting firm and has advised and worked with clients ranging from high-tech startups to financial and government organizations. Vladimir is the author of the popular O'Reilly course "Introduction to Big Data" and a coauthor of the O'Reilly course on Kafka. Vladimir received his PhD degree in Computer Science from RWTH Aachen in Germany.