🔬 Compare local Ollama Vision models for image analysis using Google Docs

5,003 views · 🔬 Document Extraction & Analysis

💡 Pro Tip — HTTP Request scraping tends to break when sites update their markup. If you’re scraping a major platform, check if ScraperNode covers it — it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.

View All Scrapers

Description

Compare Local Ollama Vision Models for Image Analysis using Google Docs

Process images using locally hosted Ollama Vision Models to extract detailed descriptions, contextual insights, and structured data. Save results directly to Google Docs for efficient collaboration.

Who is this for?

This workflow is ideal for developers, data analysts, marketers and AI enthusiasts who need to process and analyze images using locally hosted Ollama Vision Language Models. It’s particularly useful for tasks requiring detailed image descriptions, contextual analysis, and structured data extraction.

What problem is this workflow solving? / Use Case

The workflow solves the challenge of extracting meaningful insights from images in exhaustive detail, such as identifying objects, analyzing spatial relationships, extracting textual elements, and providing contextual information. This is especially helpful for applications in real estate, marketing, engineering, and research.

What this workflow does

This workflow:

  1. Downloads an image file from Google Drive.
  2. Processes the image using multiple Ollama Vision Models (e.g., Granite3.2-Vision, Gemma3, Llama3.2-Vision).
  3. Generates detailed markdown-based descriptions of the image.
  4. Saves the output to a Google Docs file for easy sharing and further analysis.

Setup

  1. Ensure you have access to a local instance of Ollama. https://ollama.com/
  2. Pull the Ollama vision models.
  3. Configure your Google Drive and Google Docs credentials in n8n.
  4. Provide the image file ID from Google Drive in the designated node.
  5. Update the list of Ollama vision models
  6. Test the workflow by clicking ‘Test Workflow’ to trigger the process.

How to customize this workflow to your needs

Key Features:

🔗 Nodes Used

HTTP Request, Google Drive, Google Docs, Extract from File

📥 Import

Download workflow.json and import into n8n: Workflow menu → Import from File

📖 Importing guide · 🔑 Credential setup