⚒️ Evaluation metric: summarization

1,131 views · ⚒️ Engineering

Description

This n8n template demonstrates how to calculate the evaluation metric “Summarization” which in this scenario, measures the LLM’s accuracy and faithfulness in producing summaries which are based on an incoming Youtube transcript.

The scoring approach is adapted from https://cloud.google.com/vertex-ai/generative-ai/docs/models/metrics-templates#pointwise_summarization_quality

How it works

Requirements

🔗 Nodes Used

Webhook, Google Drive, Basic LLM Chain, OpenAI Chat Model, Structured Output Parser, Extract from File

📥 Import

Download workflow.json and import into n8n: Workflow menu → Import from File

📖 Importing guide · 🔑 Credential setup