๐Ÿ”ฌ Parse Google Drive documents to RAG-ready Markdown with Landing.ai and Supabase cache

โšก 8 views ยท ๐Ÿ”ฌ Document Extraction & Analysis

Description

Make your unstructured large documents LLM ready markdown using LandingAI Document Parsing.


Use Cases


External services:


Credentials Required

Required


How it works

Once the pdf land in google drive location it trigger and it convert pdf (even more then 200 pages to LLM ready markdown). It also check in database if the parsing is already done or not, this help to avoid any unnecessary landingAI api call.

Setup Instructions

Step 1: Google Drive

  1. Create or select a folder in Google Drive
  2. Copy the folder ID
  3. Update the Google Drive Trigger node with this folder ID

Step 2: Landing.ai

  1. Create a Landing.ai account
  2. Generate an API key
  3. Add it in n8n as an HTTP Bearer Auth credential
  4. Update the organization-id header if required

Step 3: Supabase

  1. Create a Supabase project
  2. Create a table named landing_parse_cache
  3. Add fields such as:
    • file_id
    • document_name
    • mime_type
    • file_size_bytes
    • job_id
    • job_status
    • markdown
    • uploaded_at
    • workflow_run_id
  4. Connect Supabase credentials in n8n

Expected Input


Expected Output


Error Handling & Edge Cases


Customization Ideas


Categories


Difficulty Level

Advanced


Happy Automating - from Alok

๐Ÿ”— Nodes Used

HTTP Request, Google Drive, Google Drive Trigger, Supabase

๐Ÿ“ฅ Import

Download workflow.json and import into n8n: Workflow menu โ†’ Import from File

๐Ÿ“– Importing guide ยท ๐Ÿ”‘ Credential setup