πŸ“Š Scrape, structure, and store news data using Decodo, Gemini AI and Google Sheets

⚑ 249 views Β· πŸ“Š Market Research & Insights

πŸ’‘ Pro Tip β€” HTTP Request scraping tends to break when sites update their markup. If you’re scraping a major platform, check if ScraperNode covers it β€” it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.

View All Scrapers

Description

Sign up for Decodo HERE for Discount

Automatically scrape, structure, and log forum or news content using Decodo and Google Gemini AI. This workflow extracts key details like titles, URLs, authors, and engagement stats, then appends them to a Google Sheet for tracking and analysis.

Who’s it for?

Ideal for data journalists, market researchers, or AI enthusiasts who want to monitor trending topics across specific domains.

How it works

  1. Trigger: Workflow runs on schedule.
  2. Data Setup: Defines forum URLs and geolocation.
  3. Scraping: Extracts raw text data using the Decodo API.
  4. AI Extraction: Gemini parses and structures the scraped text into clean JSON.
  5. Data Storage: Each news item is appended or updated in Google Sheets.
  6. Logging: Records scraping results for monitoring and debugging.

How to set up

πŸ”— Nodes Used

Google Sheets, Schedule Trigger, Basic LLM Chain, Structured Output Parser, Google Gemini Chat Model

πŸ“₯ Import

Download workflow.json and import into n8n: Workflow menu β†’ Import from File

πŸ“– Importing guide Β· πŸ”‘ Credential setup