⚒️ Evaluate AI agent response correctness with OpenAI and RAGAS methodology

1,460 views · ⚒️ Engineering

Description

This n8n template demonstrates how to calculate the evaluation metric “Correctness” which in this scenario, measures the compares and classifies the agent’s response against a set of ground truths.

The scoring approach is adapted from the open-source evaluations project RAGAS and you can see the source here https://github.com/explodinggradients/ragas/blob/main/ragas/src/ragas/metrics/_answer_correctness.py

How it works

Requirements

🔗 Nodes Used

HTTP Request, AI Agent, Basic LLM Chain, OpenAI Chat Model, Structured Output Parser, Chat Trigger

📥 Import

Download workflow.json and import into n8n: Workflow menu → Import from File

📖 Importing guide · 🔑 Credential setup