Speaker "Edward Pollack" Details Back
-
Name
Edward Pollack
-
Company
Transfinder
-
Designation
Data Architect
Topic
Quality AI Requires Quality Data
Abstract
As AI adoption increases rapidly and organizations clamor to take advantage of OpenAI, Copilot, and other powerful tools, many mistakes and oversights are made. These mistakes manifest themselves in security breaches, invalid results, and even offensive content. The immediate results are lost time, money, and embarrassment. What do these problems have in common? Bad data! Quality AI solutions require clean, documented, and validated data. This session dives into data quality with a strong focus on how data is maintained as it moves from transactional to analytic workloads. Some topics include: • How transactional data be maintained effectively without compromising performance. • The importance of documentation in ensuring data is used correctly. • How to validate data as it is moved, transformed, and crunched. • Security implications of handing off data to AI applications. • Ensuring that a source-of-truth exists for each data source. This is a fast-paced session that promises both helpful best practices and also some fun along the way.
Who is this presentation for?
Developers, data professionals, technical leaders, and anyone looking to ensure security, performance, and availability for AI/ML applications
Prerequisite knowledge:
Basic understanding of data/databases and AI algorithms/applications.
What you'll learn?
The role of data quality in AI, as well as best practices for maximizing data quality to improve ML/AI software applications.
Profile
Ed Pollack is a Microsoft Data Platform MVP with a passion for learning how Data Platforms work and sharing that knowledge with the community. His experiences in data architecture, database design, performance optimization, and data security are motivation for public speaking, writing, coding, and other community activities. Ed has spoken at SQL Saturday events, SQL Bits, PASS Summit, EightKB, and many other regional and international events. Ed is the organizer of the Capital Area SQL Server Group and SQL Saturday Albany, as well as a co-organizer of SQL Saturday New York City, and Future Data Driven. He has published a number of books, including "Dynamic SQL: Applications, Performance, and Security in Microsoft SQL Server", "Expert Performance Indexing in Azure SQL and SQL Server 2022", and "Analytics Optimization with Columnstore Indexes in Microsoft SQL Server: Optimizing OLAP Workloads". Ed is also an active contributor of content to SimpleTalk. In his free time, Ed enjoys video games, traveling, cooking exceptionally spicy foods, and hanging out with his amazing wife and sons.