🔬 Nested PDF analysis with Mistral AI & OneDrive for document summarization

720 views · 🔬 Document Extraction & Analysis

Description

Description

Automate the discovery and analysis of PDF files across a deeply nested OneDrive folder structure. This workflow recursively searches folders, filters for new or updated PDFs, extracts text, and uses a Mistral AI agent to generate a concise Executive Summary, Key Findings, and Structured Metadata (Date, Location, etc.), storing all insights into a n8n Data Table for easy access and further automation.

Key Features & How It Works

Smart Deduplication & Filtering:

For each new PDF, the workflow extracts the text and passes it to a Mistral AI model for dual-stream analysis:

Structured Output and Archiving:

Setup Notes

OneDrive Folder: You must specify the exact name of your main folder in the ‘Search for Main Folder’ node.

Data Table: Ensure your n8n Data Table exists with the required columns: Summary, Key_Findings, Scope, Date, Location, File_Name, and Path.

Deep Folder Structure: The current configuration supports up to 8 levels of subfolders. If your files go deeper, you may need to add more “Get items in a folder” and “If” nodes.

AI Customization: Review the AI agent prompts and the structured output schemas to customize the fields you want to extract or the summary style you require.

Extend This Workflow

The final output is organized data. You can easily extend this workflow to:

🔗 Nodes Used

Microsoft OneDrive, Schedule Trigger, AI Agent, Basic LLM Chain, Structured Output Parser, Extract from File

📥 Import

Download workflow.json and import into n8n: Workflow menu → Import from File

📖 Importing guide · 🔑 Credential setup