Let the Model Be the Judge: Scaling Evaluation with AI
11 Jun 2025
Data Excellence Stage
Data Excellence
In the era of Agents, where AI systems become ever more powerful and general purpose, evaluation becomes more expensive. This talk introduces a scalable, effective and cost-conscious approach: using large language models to power evaluation. While it might sound counter intuitive, the talk will explore how the latest LLM-as-a-Judge techniques allow for reliable evaluation. Whether you're deploying AI in production or benchmarking research models, this session will offer practical insights and strategies for assessing AI—without breaking your budget.
Pass Type
VIP All Access Pass,Delegate Pass,Start-up & Investor Pass,Academic Pass,Press Pass
Content Focus
Strategy
Session Type
Fireside Chat
Session Focus
Strategy
Session Keyword
AI Tools and Applications