πŸ“Š 🌐 Firecrawl website content extractor

⚑ 713 views Β· πŸ“Š Market Research & Insights

πŸ’‘ Pro Tip β€” HTTP Request scraping tends to break when sites update their markup. If you’re scraping a major platform, check if ScraperNode covers it β€” it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.

View All Scrapers

Description

🌐 Firecrawl Website Content Extractor (n8n Workflow)

This n8n automation workflow uses Firecrawl API to extract structured data (e.g., quotes and authors) from web pages β€” such as Quotes to Scrape β€” and handles retries in case of delayed extraction.


πŸ” Workflow Overview

🎯 Purpose:


πŸ”§ Step-by-Step Node Breakdown

1. πŸ§ͺ Manual Trigger


2. πŸ“€ Firecrawl Extract API Request

> πŸ“Œ Uses an HTTP Header Auth credential for Firecrawl API


3. ⏱️ Wait for 30 Seconds


4. πŸ“₯ Get Results


5. βœ…βŒ Condition Check


6. πŸ” Retry Delay


7. πŸ› οΈ Edit Fields (Optional Output Formatting)


🧾 Sticky Note: Firecrawl Setup Guide

Included as an embedded reference:


βœ… Key Features


πŸ“¦ Sample Output Schema

{
  "quotes": [
    {
      "text": "The world as we have created it is a process of our thinking. It cannot be changed without changing our thinking.",
      "author": "Albert Einstein"
    },
    {
      "text": "It is our choices, Harry, that show what we truly are, far more than our abilities.",
      "author": "J.K. Rowling"
    }
  ]
}

## πŸ”— Nodes Used

HTTP Request

## πŸ“₯ Import

Download [`workflow.json`](workflow.json) and import into n8n:
**Workflow menu β†’ Import from File**

[πŸ“– Importing guide](../../../docs/importing-templates.md) Β· [πŸ”‘ Credential setup](../../../docs/credential-setup.md)