πŸ”¬ Turn websites into a Google Sheets database with MrScraper and Gmail

⚑ 2 views Β· πŸ”¬ Document Extraction & Analysis

πŸ’‘ Pro Tip β€” Job boards are notoriously hard to scrape β€” CAPTCHAs, rate limits, constantly changing layouts. ScraperNode has maintained scrapers for Indeed jobs, Glassdoor reviews, and Glassdoor jobs that handle all of that for you.

View All Scrapers

Description

Turn Internet Into Database β€” n8n Workflow

Description

This n8n template automates the entire process of turning any website into a structured database β€” no manual scraping required. It uses MrScraper’s AI-powered agents to crawl a domain, extract listing pages, scrape detail pages, and export everything into Google Sheets with an email notification via Gmail.

Whether you’re building a real estate database, product catalog, job board aggregator, or competitor price tracker, this workflow handles the full pipeline end-to-end.


How It Works


How to Set Up

  1. Create 3 scrapers in your MrScraper account:

    • Map Agent Scraper (for crawling/URL discovery)
    • Listing Agent Scraper (for extracting detail URLs from listing pages)
    • General Agent Scraper (for extracting structured data from detail pages)
    • Copy the scraperId for each β€” you’ll need these in n8n.
  2. Enable AI Scraper API access in your MrScraper account settings.

  3. Add your credentials in n8n:

    • MrScraper API token
    • Google Sheets OAuth2
    • Gmail OAuth2
  4. Configure the Map Agent node:

    • Set your target domain URL (e.g. https://example.com)
    • Set includePatterns to match listing pages (e.g. /category/)
    • Adjust maxDepth, maxPages, and limit as needed
  5. Configure the Listing Agent node:

    • Enter the Listing scraperId
    • Set maxPages based on how many pages per listing URL to scrape
  6. Configure the General Agent node:

    • Enter the General scraperId
  7. Connect Google Sheets:

    • Enter your spreadsheet and sheet tab URL
    • Choose append or upsert strategy (recommended: upsert by url)
  8. Configure Gmail:

    • Set recipient email, subject line, and message body

Requirements


Good to Know


Customising This Workflow

πŸ”— Nodes Used

Google Sheets, Gmail

πŸ“₯ Import

Download workflow.json and import into n8n: Workflow menu β†’ Import from File

πŸ“– Importing guide Β· πŸ”‘ Credential setup