This is a short summary. ↗ Open original to view full content
Evaluating the Effectiveness of LLM-Evaluators (aka LLM-as-Judge)
from blog Eugene Yan, | ↗ original
Related
More from Eugene Yan
39 lessons from Industry ML Conferences in 2024
3 Nov 2024 |
original ↗
ML systems, production & scaling, execution & collaboration, building for users, conference etiquette.
AlignEval: Building an App to Make Evals Easy, Fun, and Automated
27 Oct 2024 |
original ↗
Look at and label your data, build and evaluate your LLM-evaluator, and optimize it against your labels.
Hackathon Judge - Weights & Biases LLM-Evaluator Hackathon
22 Sept 2024 |
original ↗
Being a human judge at the Weights & Biases LLM-as-a-Judge Hackathon
Building the Same App Using Various Web Frameworks
8 Sept 2024 |
original ↗
FastAPI, FastHTML, Next.js, SvelteKit, and thoughts on how coding assistants influence builders' choices.
How to Interview and Hire ML/AI Engineers
7 Jul 2024 |
original ↗
What to interview for, how to structure the phone screen, interview loop, and debrief, and a few tips.