πŸ‘₯ Automated structured data extract & summary via Decodo + Gemini & Google Sheets

⚑ 148 views Β· πŸ‘₯ HR & Recruitment

πŸ’‘ Pro Tip β€” HTTP Request scraping tends to break when sites update their markup. If you’re scraping a major platform, check if ScraperNode covers it β€” it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.

View All Scrapers

Description

Who this is for

This workflow is designed for:

It is ideal for anyone who wants to transform any public URL into structured data + clean summaries automatically.

What problem this workflow solves

Web content is often unstructured, verbose, and inconsistent, making it difficult to:

This workflow solves the problem of turning arbitrary web pages into machine-readable JSON and human-readable summaries, without custom scrapers or manual parsing logic.

What this workflow does

The workflow integrates Decodo, Google Gemini, and Google Sheets to perform automated extraction of structured data.

Here’s how it works step-by-step:

  1. Input Setup

    • The workflow begins when the user executes it manually or passes a valid URL.
    • The input includes url.
  2. Profile Extraction with Decodo

Uses Google Gemini to:

  1. JSON Parsing & Merging

    • The Code Node cleans and parses the JSON output from the AI for reliable downstream use.
    • The Merge Node combines both structured data and the AI-generated summary.
  2. Data Storage in Google Sheets

    • The Google Sheets Node appends or updates the record, storing the structured JSON and summary into a connected spreadsheet.
  3. End Output

    • A unified, machine-readable data in JSON + an executive-level summary suitable data analysis or downstream automation.

Setup Instructions

Prerequisites

If you are new to Decode, please signup on this link visit.decodo.com

image.png

n8n Decodo

Setup Steps

  1. Import the workflow into your n8n instance.

  2. Configure Credentials

    • Add your Decodo API credentials in the Decodo node.
    • Connect your Google Gemini (PaLM) credentials for both AI nodes.
    • Authenticate your Google Sheets account.
  3. Edit Input Node

    • In the Set the Input Fields node, replace the default URL with your desired profile or dynamic data source.
  4. Run the Workflow

    • Trigger manually or via webhook integration for automation.
    • Verify that structured profile data and summary are written to the linked Google Sheet.

How to customize this workflow to your needs

You can easily extend or adapt this workflow:

Modify Structured Output

Improve Summarization

Change Output Destination

Add Validation or Filtering

Scale the Workflow

Summary

This workflow provides a powerful, generic solution for converting unstructured web pages into structured, AI-enriched datasets.

By combining Decodo for scraping, Google Gemini for intelligence, and Google Sheets for persistence, it enables repeatable, scalable, and production-ready data extraction without custom scrapers or brittle parsing logic.

πŸ”— Nodes Used

Google Sheets, Basic LLM Chain, Google Gemini Chat Model

πŸ“₯ Import

Download workflow.json and import into n8n: Workflow menu β†’ Import from File

πŸ“– Importing guide Β· πŸ”‘ Credential setup