Event Speaker - Global Data Science Conference

Name

Maximo Gurmendez
Company

Montevideo Labs
Designation

Founder

Ask a Question to this Speaker

Topic

Building a petabyte scale machine learning engine using Apache Spark

Abstract

The central premise of DataXu is to apply data science to better marketing. At its core, is the Real Time Bidding Platform that processes 2 Petabytes of data per day and responds to ad auctions at a rate of 2.1 million requests per second across 5 different continents. Serving on top of this platform is Dataxu’s analytics engine that gives their clients insightful analytics reports addressed towards client marketing business questions. Some common requirements for both these platforms are the ability to do real-time processing, scalable machine learning, and ad-hoc analytics. This talk will showcase DataXu’s successful use-cases of using the Apache Spark framework to address all of the above challenges while maintaining its agility and rapid prototyping strengths to take a product from initial R&D phase to full production. The team will share their best practices and highlight the steps of large scale Spark ETL processing, model testing, all the way through to interactive analytics.

Profile

Maximo holds a master's degree in computer science/AI from Northeastern University, where he attended as a Fulbright Scholar. As Chief Engineer of Montevideo Labs he leads data science engineering projects for complex systems in large US companies. He is an expert in big data technologies and co-author of the popular book "Mastering Machine Learning on AWS" . Additionally, Maximo is a computer science professor at the University of Montevideo and is director of its data science for business program.

Subscribe Newsletter

Please fill below credentials to Subscribe Newsletter

Global Big Data Conference

Speaker "Maximo Gurmendez" Details Back

Name

Maximo Gurmendez

Company

Montevideo Labs

Designation

Founder

Topic

Abstract

Profile