Back

Speaker "Pat Patterson" Details Back

 

Topic

Filling the Enterprise Data Lake: How Cox Automotive Ingests Data From 25 Companies

Abstract

Cox Automotive comprises more than 25 companies, including name brands such as Kelley Blue Book and Autotrader, dealing with different aspects of the car ownership lifecycle. The challenge for Cox was to create an efficient engine for the timely and trustworthy ingest of data from a wide and shifting variety of data sources and schemas. As their subsidiary companies' business evolves, the structure and composition of their data gradually drifts, breaking traditional approaches to data ingest. Discover how the Cox Automotive big data engineering team overcame data drift to populate their data lake, allowing analysts easy access to data from subsidiary companies and producing new data assets unique to the industry.

Profile

Pat Patterson has been working with Internet technologies since 1997, building software and communities at Sun Microsystems, Huawei, Salesforce and StreamSets. At Sun, Pat was the community lead for the OpenSSO open source project, while at Huawei he developed cloud storage infrastructure software. As a developer evangelist at Salesforce, Pat focused on identity, integration and the Internet of Things. Now community champion at StreamSets, Pat is responsible for the care and feeding of the StreamSets open source community.