Detecting and Preventing Hallucinations in Large Vision Language Models
Paper
•
2308.06394
•
Published
None defined yet.
Agentic Rubrics as Contextual Verifiers for SWE Agents
ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents