๐Ÿ”ฌ Audio transcription & chat bot with AssemblyAI, Gemini, and Pinecone RAG

โšก 569 views ยท ๐Ÿ”ฌ Document Extraction & Analysis

Description

Whoโ€™s it for

This template is designed for podcasters, researchers, educators, product teams, and support teams who work with audio content and want to turn it into searchable knowledge. It is especially useful for users who need automated transcription, structured summaries, and conversational access to audio data.

What it does / How it works

This workflow starts with a public form where users upload an audio file.
The audio is sent to AssemblyAI for speech-to-text processing, including speaker labels and bullet-point summarization.
Once transcription is complete, the full text is converted into a document, split into chunks, and embedded using Google Gemini.
The embeddings are stored in a Pinecone vector database along with metadata, making the content retrievable for future use.

In parallel, the workflow logs uploaded file information into Google Sheets for tracking.
A separate chat trigger allows users to ask questions about the uploaded audio files.
An AI agent retrieves relevant context from Pinecone and responds using Gemini, enabling conversational search over audio transcripts.

Requirements

How to set up

  1. Connect AssemblyAI, Gemini, Pinecone, and Google Sheets credentials in n8n.
  2. Configure the Pinecone index for storing transcripts.
  3. Verify the Google Sheet has columns for file name and status.
  4. Test by uploading an audio file through the form.
  5. Enable the workflow for continuous use.

How to customize the workflow

๐Ÿ”— Nodes Used

Google Sheets, HTTP Request, AI Agent, Simple Memory, Recursive Character Text Splitter, n8n Form Trigger

๐Ÿ“ฅ Import

Download workflow.json and import into n8n: Workflow menu โ†’ Import from File

๐Ÿ“– Importing guide ยท ๐Ÿ”‘ Credential setup