The AI Summit London audience

The AI Summit London 2025

Loading

Let the Model Be the Judge: Scaling Evaluation with AI

11 Jun 2025
Data Excellence Stage
Data Excellence

In the era of Agents, where AI systems become ever more powerful and general purpose, evaluation becomes more expensive. This talk introduces a scalable, effective and cost-conscious approach: using large language models to power evaluation. While it might sound counter intuitive, the talk will explore how the latest LLM-as-a-Judge techniques allow for reliable evaluation. Whether you're deploying AI in production or benchmarking research models, this session will offer practical insights and strategies for assessing AI—without breaking your budget.

Speakers
Matt Farrelly, Machine Learning Engineering Manager - Trainline

Pass Type

VIP All Access Pass,Delegate Pass,Start-up & Investor Pass,Academic Pass,Press Pass

Content Focus

Strategy

Session Type

Fireside Chat

Session Focus

Strategy

Session Keyword

AI Tools and Applications
Secure Your Pass

The AI Summit London 2025 Sponsors

Headliner Partners

Loading

Diamond Sponsors

Loading

Platinum Sponsors

Loading

Gold Sponsors

Loading

Silver Sponsors

Loading

Bronze Sponsors

Associate Sponsors

Loading

Media & Community Partners