A community of founders and builders creating the next generation of technology.

Cerebral Valley

<@U06A1GMKZ7T> and I just published a *<https://medium.com/relari/a-practical-guide-to-rag-pipeline-evaluation-part-1-27a472b09893|deep dive on our findings of RAG Eval>*.

What’s inside:
• *In-Depth Metric Analysis:* pros &amp; cons of various *deterministic* and *LLM-based* retrieval *metrics*
• *Comparative Benchmarking:* GPT-4, GPT-3.5, and Claude 2.1 in retrieval assessment without ground truth labels
• *Step-by-Step Guide:* using metrics for *systematic quality enhancement*
• *Open-Source Tool:* <https://github.com/relari-ai/continuous-eval|continuous-eval> to run plug-&amp;-play evaluation on your dataset

Whether you are already super experienced with RAG Eval or new to setting them up for your pipeline, *we'd love to hear your feedback!*