⚒️ 🔐🦙Private & local Ollama self-hosted + dynamic LLM router

15,233 views · ⚒️ Engineering

Description

Who is this for?

This workflow template is designed for AI enthusiasts, developers, and privacy-conscious users who want to leverage the power of local large language models (LLMs) without sending data to external services. It’s particularly valuable for those running Ollama locally who want intelligent routing between different specialized models.

What problem is this workflow solving?

When working with multiple local LLMs, each with different strengths and capabilities, it can be challenging to manually select the right model for each specific task. This workflow automatically analyzes user prompts and routes them to the most appropriate specialized Ollama model, ensuring optimal performance without requiring technical knowledge from the end user.

What this workflow does

This intelligent router:

Setup

  1. Ensure you have Ollama installed and running locally
  2. Pull the required models mentioned in the workflow using Ollama CLI (e.g., ollama pull phi4)
  3. Configure the Ollama API credentials in n8n (default: http://127.0.0.1:11434)
  4. Activate the workflow and start interacting through the chat interface

How to customize this workflow to your needs

This workflow demonstrates how n8n can be used to create sophisticated AI orchestration systems that respect user privacy by keeping everything local while still providing intelligent model selection capabilities.

🔗 Nodes Used

AI Agent, Ollama Chat Model, Simple Memory, Chat Trigger

📥 Import

Download workflow.json and import into n8n: Workflow menu → Import from File

📖 Importing guide · 🔑 Credential setup