evaluate-rag

Name: evaluate-rag
Availability: InStock
Author: hamelsmu

Community

Evaluate RAG pipeline quality.

Software Engineering #llm #rag #evaluation #retrieval #nlp #generation

Authorhamelsmu

Version1.0.0

Installs0

System Documentation

What problem does it solve?

This Skill addresses the challenge of evaluating the performance of Retrieval-Augmented Generation (RAG) systems by providing a structured approach to assess both retrieval and generation components.

Core Features & Use Cases

Component-wise Evaluation: Separates the assessment of retrieval quality (e.g., Recall@k, MRR) from generation quality (faithfulness, relevance).
Dataset Generation: Offers methods for creating evaluation datasets, including manual curation and synthetic QA pair generation.
Chunking Optimization: Guides users on tuning chunking strategies (size, overlap, content-awareness) to improve retrieval.
Use Case: Debugging a RAG system that returns irrelevant information or generates factually incorrect answers by pinpointing whether the issue lies in document retrieval or the LLM's response generation.

Quick Start

Use the evaluate-rag skill to assess the retrieval quality of the RAG pipeline using the Recall@5 metric.

evaluate-rag

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper