Data curation is the highest-leverage way to improve AI systems, from foundational models to RAG pipelines.
In AI like in Machine Learning, data is the crux of most problems. Whether you are developing GenAI apps, building AI agents, curating an evaluation set for your RAG pipeline, creating a training set to fine-tune an LLM, or trying to understand how your product is performing, you must become one with your data. In this talk, Airtrain Founder/CEO Emmanuel Turlay will discuss the importance of high-quality data for AI/ML data workflows, why and when you should use a data-first approach when choosing AI tooling, what real-life business and academic use cases benefit, and how Airtrain AI can improve and maintain data quality.