RAG-Arena

Posts in tags: "RAG-Arena" (1 post)

April 26, 2026

Research

Inside the RAG Arena: When the Judges Don't Agree

A 200-item RAG arena tied at the mean, but two LLM judges only agreed at Spearman ρ=0.55. They aren't measuring the same thing.

RAG-ArenaScoredQARAG RoutingEXITLLM-as-JudgeSpearmanEvaluationQLoRA