⚒️ Scrape any web page into structured JSON data with ScrapeNinja and AI

65,291 views · ⚒️ Engineering

💡 Pro Tip — HTTP Request scraping tends to break when sites update their markup. If you’re scraping a major platform, check if ScraperNode covers it — it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.

View All Scrapers

Description

Disclaimer: This template only works on self-hosted for now, as it uses a community node.

Use Case

Web scrapers often break due to web page layout changes. This workflow attempts to mitigate this problem by auto-generating web scraping data extractor code via LLM.

How It Works

This workflow leverages ScrapeNinja n8n community node to:

Installation

To install ScrapeNinja n8n node, in your self-hosted instance, go to Settings -> Community nodes, enter “n8n-nodes-scrapeninja”, and install.

Make sure you are using at least v0.3.0.

See this in action: https://www.linkedin.com/feed/update/urn:li:activity:7289659870935490560/

🔗 Nodes Used

Basic LLM Chain, Google Gemini Chat Model

📥 Import

Download workflow.json and import into n8n: Workflow menu → Import from File

📖 Importing guide · 🔑 Credential setup