πŸ”¬ Beginner AI dataset generator using OpenAI + LangChain in n8n

⚑ 2,003 views Β· πŸ”¬ Document Extraction & Analysis

Description

This n8n workflow dynamically generates a realistic sample dataset based on a single topic you provide. It uses OpenAI (via LangChain) and n8n’s built-in nodes to:

  1. Generate structured JSON data for 5 columns with 3–5 values each
  2. Flatten that data into a single text blob
  3. Infer meaningful column names via a second AI call
  4. Pivot, split, merge, and rename columns automatically
  5. Output a clean, labeled dataset ready for export or further processing

βš™οΈ Prerequisites

  1. OpenAI API Key

  2. LangChain nodes enabled in your n8n instance

πŸ₯‡ Step 1: Set Up OpenAI Credential

  1. Go to OpenAI API Keys
  2. Create and copy your key
  3. In n8n: Credentials β†’ New β†’ OpenAI API β†’ paste key as β€œOpenAi account”

πŸ₯ˆ Step 2: Manual Trigger

πŸ₯‰ Step 3: Set Topic

✨ Step 4: Generate Structured Data

πŸ”§ Step 5: Parse AI Output

πŸ”„ Step 6: Flatten Data

🧠 Step 7: Generate Column Names

πŸ”’ Step 8: Pivot Names Row

πŸͺ“ Step 9: Split Columns

πŸ”— Step 10: Merge Rows

🏷️ Step 11: Rename Columns

πŸ”— Step 12: Final Output


🏁 Done! You now have a fully AI-driven, labeled dataset generated from a single topicβ€”no external services needed. Easily extend by adding a Google Sheets or HTTP node to export.

πŸ“¬ Need Help or Want to Customize This?

πŸ“§ robert@ynteractive.com
πŸ”— LinkedIn

πŸ”— Nodes Used

AI Agent, OpenAI Chat Model, Structured Output Parser, Think Tool

πŸ“₯ Import

Download workflow.json and import into n8n: Workflow menu β†’ Import from File

πŸ“– Importing guide Β· πŸ”‘ Credential setup