๐Ÿ“– Process documents with recursive chunking using Google Drive, OpenAI & Gemini RAG

โšก 3,048 views ยท ๐Ÿ“– Internal Wiki & Knowledge Base

Description

1. Document Ingestion & Processing

Google Drive Trigger monitors for new files โ†’ Loop Over Items processes each file โ†’ File Info extracts metadata โ†’ Google Drive downloads the actual content โ†’ Switch routes to appropriate extractors (PDF or TEXT) based on file type

2. Content Transformation & Chunking

Document Data node processes extracted text โ†’ Recursive Splitter breaks content into contextual chunks โ†’ Chunk Splitting applies intelligent segmentation while preserving document context and relationships between chunks

3. Embedding & Storage

Basic LLM Chain processes chunks โ†’ OpenAI Chat Model generates contextual understanding โ†’ Summarize creates document summaries โ†’ Supabase Vector Store saves embeddings with metadata โ†’ Embeddings OpenAI creates vector representations โ†’ Default Data Loader handles storage operations

4. Query Processing & Retrieval

When Clicking Execute triggers user queries โ†’ OpenAI processes and understands the question โ†’ AI Agent orchestrates hybrid search (combining vector similarity + keyword matching) โ†’ Google Gemini Chat Model generates final responses using retrieved context โ†’ HTTP Request handles additional external data sources

๐Ÿ”— Nodes Used

Google Drive, Google Drive Trigger, AI Agent, Basic LLM Chain, Embeddings OpenAI, OpenAI Chat Model

๐Ÿ“ฅ Import

Download workflow.json and import into n8n: Workflow menu โ†’ Import from File

๐Ÿ“– Importing guide ยท ๐Ÿ”‘ Credential setup