π Build & query RAG system with Google Drive, OpenAI GPT-4o-mini, and Pinecone
β‘ 5,058 views Β· π Internal Wiki & Knowledge Base
Description
π What This Workflow Does
This RAG Pipeline in n8n automates document ingestion from Google Drive, vectorizes it using OpenAI embeddings, stores it in Pinecone, and enables chat-based retrieval using LangChain agents.
Main Functions:
π Auto-detects new files uploaded to a specific Google Drive folder. π§ Converts the file into embeddings using OpenAI. π¦ Stores them in a Pinecone vector database. π¬ Allows a user to query the knowledge base through a chat interface. π€ Uses a GPT-4o-mini model with LangChain to generate intelligent responses using retrieved context. βοΈ Setup Instructions
- Connect Accounts Ensure these services are connected in n8n:
β Google Drive (OAuth2) β OpenAI β Pinecone You can do this in n8n > Credentials > New and use the matching names from the file:
Google Drive: βGoogle Drive account 2β OpenAI: βOpenAi successβ Pinecone: βPineconeApi account 2β 2. Folder Setup Upload your documents to this folder in Google Drive:
π Power Folder
The workflow is triggered every minute when a new file is uploaded.
- Workflow Overview A. File Ingestion Path
Google Drive Trigger β detects new file. Google Drive (Download) β downloads the new file. Recursive Text Splitter β splits text into chunks. Default Data Loader β loads content as LangChain documents. OpenAI Embeddings β converts text chunks into embeddings. Pinecone Vector Store β stores them in βragfileβ index. B. Chat Retrieval Path
When chat message received β AI Agent β LangChain agent managing tools. OpenAI Chat Model (GPT-4o-mini) β generates replies. Pinecone Vector Store (retrieval) β retrieves matching content. Embeddings OpenAI1 β helps match queries to document chunks.
π Nodes Used
Google Drive, Google Drive Trigger, AI Agent, Embeddings OpenAI, OpenAI Chat Model, Recursive Character Text Splitter
π₯ Import
Download workflow.json and import into n8n:
Workflow menu β Import from File