🤖 Multimodal Slack AI assistant with voice, image & video processing

673 views · 🤖 AI Chatbots & Agents

Description

What it’s for:

This is a base template for anyone trying to develop a Slack bot AI Agent. This base allows for multiple inputs (Voice, Picture, Video, and Text inputs) to be processed by an AI model of their choosing to a get a User started. From here, the User may connect any tools that they see fit to the AI Agent for their n8n workflows.

NOTE: This build is specifically for integrating a Slack bot into a CHAT Channel

If you want to allow the Slack bot to be integrated into the whole workspace, you’ll need to adjust some of the nodes and bot parameters

How it works:

Input: Slack message mentioning a bot in a chat channel

n8n Processing: Switch node determines the type:

  1. Voice Message
  2. Picture Message
  3. Video Message
  4. Text Message

(Currently uses OpenAI and Gemini to analyze Voice/Photo/Video content but feel free to change these nodes with other models)

AI Agent Proccessing: LLM of your choosing examines message and based on system prompt, generates an output

Output: AI Output is sent back in Slack Message

How to use:

1. Create your Slack bot and generate access token

This part will be longest part of the guide but feel free to Youtube search “How to install Slack AI agent” or soemthing similar in case it’s hard to follow

2. Create Credentials in n8n

3. Add Bot Token to HTTP Request nodes

4. Change ALL Slack nodes to your Slack Workspace and Channel

5. Create LLM access token

(Different per LLM but search your LLM + API in google)

Requirements:

Customizing this workflow:

🔗 Nodes Used

HTTP Request, Slack, AI Agent, Anthropic Chat Model, Simple Memory, OpenAI

📥 Import

Download workflow.json and import into n8n: Workflow menu → Import from File

📖 Importing guide · 🔑 Credential setup