🔬 Automate data extraction from faxes & PDFs using Google Gemini and Google Sheets

⚡ 742 views · 🔬 Document Extraction & Analysis

💡 Pro Tip — HTTP Request scraping tends to break when sites update their markup. If you’re scraping a major platform, check if ScraperNode covers it — it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.

View All Scrapers

Description

This n8n template from Intuz provides a complete solution to automate the extraction of critical information from PDF documents like faxes, or any PDFs.

It uses the power of Google Gemini’s multimodal capabilities to read the document, identify key fields, and organize the data into a structured format, saving it directly to a Google Sheet.

Who’s this workflow for?

How it works

1. Upload via Web Form: The process starts when a user uploads a fax (as a PDF file) through a simple, secure web form generated by n8n.

2. AI Document Analysis: The PDF is sent directly to Google Gemini’s advanced multimodal model, which reads the entire document—including text, tables, and form fields. It extracts all relevant information based on a detailed prompt.

3. AI Data Structuring: The raw extracted text is then passed to a second AI step. This step cleans the information and strictly structures it into a predictable JSON format (e.g., Patient ID, Name, DOB, etc.).

4. Save to Google Sheets: The final, structured data is automatically appended as a new, clean row in your designated Google Sheet, creating an organized and usable dataset from the unstructured fax.

Key Requirements to Use This Template

1. n8n Instance & Required Nodes:

2. Google Accounts:

Customer Setup Guide:

Here is a detailed, step-by-step guide to help you configure and run this workflow.

1. Before You Begin: Prerequisites Please ensure you have the following ready:

2. Step-by-Step Configuration Step 1: Import the Workflow

Step 2: Set Up the Form Trigger The workflow starts with the “On form submission” node.

Step 3: Configure the Google Drive Node

Step 4: Configure the Google Gemini AI Nodes (Very Important) This workflow uses AI in two places, and both need to be connected.

  1. First AI Call (PDF Reading):
  1. Second AI Call (Data Structuring):

Step 5: (Optional) Customize What Data is Extracted You have full control over what information the AI looks for.

Step 6: Configure the Final Google Sheets Node

4. Running the Workflow

Connect with us

For Custom Worflow Automation

Click here- Get Started

đź”— Nodes Used

Google Sheets, HTTP Request, Google Drive, Basic LLM Chain, Structured Output Parser, n8n Form Trigger

📥 Import

Download workflow.json and import into n8n: Workflow menu → Import from File

📖 Importing guide · 🔑 Credential setup