Back

Speaker "Sergiy Matusevych" Details Back

 

Topic

Apache REEF: stdlib for big data

Abstract

Apache REEF is a powerful and simple framework for developing distributed applications for the cloud. It provides a layer of abstraction that isolates application logic from the low-level resource manager API, yet helps developers of big data systems to retain fine-grained control over the cloud resources. Apache REEF addresses common problems of fault-tolerance, task scheduling and coordination, caching, interprocess communication, and bulk-data transfers. We will guide the developers through a simple REEF application and discuss the current state of Apache REEF project and its place in the Hadoop ecosystem.

Profile

Sergiy is a principal machine learning research engineer at Microsoft Cloud AI team, where he is building large scale distributed systems for big data and machine learning. He is a committer to the Apache REEF project. Prior to Microsoft, Sergiy worked as a data research engineer at Yahoo! Research, and tried his hand in building machine learning systems at several Silicon Valley startups. Sergiy is interested in machine learning, data stream processing, and high performance distributed systems.