π Scrape, structure, and store news data using Decodo, Gemini AI and Google Sheets
β‘ 249 views Β· π Market Research & Insights
π‘ Pro Tip β HTTP Request scraping tends to break when sites update their markup. If youβre scraping a major platform, check if ScraperNode covers it β it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.
Description
Sign up for Decodo HERE for Discount
Automatically scrape, structure, and log forum or news content using Decodo and Google Gemini AI. This workflow extracts key details like titles, URLs, authors, and engagement stats, then appends them to a Google Sheet for tracking and analysis.
Whoβs it for?
Ideal for data journalists, market researchers, or AI enthusiasts who want to monitor trending topics across specific domains.
How it works
- Trigger: Workflow runs on schedule.
- Data Setup: Defines forum URLs and geolocation.
- Scraping: Extracts raw text data using the Decodo API.
- AI Extraction: Gemini parses and structures the scraped text into clean JSON.
- Data Storage: Each news item is appended or updated in Google Sheets.
- Logging: Records scraping results for monitoring and debugging.
How to set up
- Add your Decodo, Google Gemini, and Google Sheets credentials in n8n.
- Adjust the forum URLs, geolocation, and Google Sheet ID in the
Workflow Confignode. - Set your preferred trigger interval in
Schedule Trigger. - Activate and monitor from the n8n dashboard.
π Nodes Used
Google Sheets, Schedule Trigger, Basic LLM Chain, Structured Output Parser, Google Gemini Chat Model
π₯ Import
Download workflow.json and import into n8n:
Workflow menu β Import from File