๐Ÿ“Š Automate news article scraping with ScrapegraphAI and store in Google Sheets

โšก 2,051 views ยท ๐Ÿ“Š Market Research & Insights

๐Ÿ’ก Pro Tip โ€” HTTP Request scraping tends to break when sites update their markup. If youโ€™re scraping a major platform, check if ScraperNode covers it โ€” it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.

View All Scrapers

Description

This workflow contains community nodes that are only compatible with the self-hosted version of n8n.

News Article Scraping and Analysis with AI and Google Sheets Integration

๐ŸŽฏ Target Audience

๐Ÿš€ Problem Statement

Manual news monitoring is time-consuming and often misses important articles. This template solves the challenge of automatically collecting, structuring, and storing news articles from any website for comprehensive analysis and tracking.

๐Ÿ”ง How it Works

This workflow automatically scrapes news articles from websites using AI-powered extraction and stores them in Google Sheets for analysis and tracking.

Key Components

๐Ÿ“Š Google Sheets Column Specifications

The template creates the following columns in your Google Sheets:

ColumnData TypeDescriptionExample
titleStringArticle headline and titleโ€œโ€˜My friend died right in front of meโ€™ - Student describes moment air force jet crashed into schoolโ€
urlURLDirect link to the articleโ€https://www.bbc.com/news/articles/cglzw8y5wy5oโ€
categoryStringArticle category or sectionโ€Asiaโ€

๐Ÿ› ๏ธ Setup Instructions

Estimated setup time: 10-15 minutes

Prerequisites

Step-by-Step Configuration

1. Install Community Nodes
# Install ScrapeGraphAI community node
npm install n8n-nodes-scrapegraphai
2. Configure ScrapeGraphAI Credentials
3. Set up Google Sheets Connection
4. Customize News Source Parameters
5. Configure Schedule Trigger
6. Test and Validate

๐Ÿ”„ Workflow Customization Options

Modify News Sources

Extend Data Collection

Output Customization

๐Ÿ“ˆ Use Cases

๏ฟฝ๏ฟฝ Important Notes

๐Ÿ”ง Troubleshooting

Common Issues:

Pro Tips:

Support Resources:

๐Ÿ”— Nodes Used

Google Sheets, Schedule Trigger

๐Ÿ“ฅ Import

Download workflow.json and import into n8n: Workflow menu โ†’ Import from File

๐Ÿ“– Importing guide ยท ๐Ÿ”‘ Credential setup