๐Ÿ“Š ๐Ÿ’ก๐ŸŒ Essential multipage website scraper with Jina.ai

โšก 18,059 views ยท ๐Ÿ“Š Market Research & Insights

๐Ÿ’ก Pro Tip โ€” HTTP Request scraping tends to break when sites update their markup. If youโ€™re scraping a major platform, check if ScraperNode covers it โ€” it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.

View All Scrapers

Description

๐Ÿ’ก๐ŸŒ Essential Multipage Website Scraper with Jina.ai

Use responsibly and follow local rules and regulations

This N8N workflow enables automated multi-page website scraping using Jina.aiโ€™s powerful web scraping capabilities, with seamless integration to Google Drive for content storage. Hereโ€™s how it works:

Main Features

The workflow automatically scrapes multiple pages from a websiteโ€™s sitemap and saves each pageโ€™s content as a separate Google Drive document.

Key Components

Input Configuration

Scraping Process

Storage Integration

Usage Instructions

  1. Set your target websiteโ€™s sitemap URL in the โ€œSet Website URLโ€ node
  2. Configure the โ€œFilter By Topics or Pagesโ€ node to select specific content
  3. Adjust the โ€œLimitโ€ node (default: 20 pages) to control batch size
  4. Connect your Google Drive account
  5. Run the workflow to begin automated scraping

Additional Features

The workflow requires no API key for Jina.ai, making it accessible for immediate use while maintaining responsible scraping practices.

๐Ÿ”— Nodes Used

HTTP Request, Google Drive, Filter

๐Ÿ“ฅ Import

Download workflow.json and import into n8n: Workflow menu โ†’ Import from File

๐Ÿ“– Importing guide ยท ๐Ÿ”‘ Credential setup