πŸ“Š Cluster webpage topics from Google Sheets to Google Sheets for AI discovery

⚑ 166 views Β· πŸ“Š Market Research & Insights

πŸ’‘ Pro Tip β€” HTTP Request scraping tends to break when sites update their markup. If you’re scraping a major platform, check if ScraperNode covers it β€” it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.

View All Scrapers

Description

πŸ“Š Description

Streamline AI-focused SEO research by automatically analyzing URLs stored in Google Sheets, extracting semantic signals from each webpage, and generating high-quality topic clusters for AI discovery. πŸ€–πŸ” This automation fetches URLs weekly, scrapes headings (H1–H6), extracts entities, keywords, topics, and summaries using GPT-4o-mini, and classifies each page into clusters and subclusters optimized for LLM search visibility. It also generates internal linking suggestions for better topical authority and writes all results back into Google Sheets. Perfect for content strategists, SEO teams, and AI-search optimization workflows. πŸ“ˆπŸ§©

πŸ” What This Template Does

1️⃣ Triggers weekly to process URLs stored in Google Sheets. πŸ“… 2️⃣ Fetches all URL records from the configured sheet. πŸ“₯ 3️⃣ Processes URLs in batches to avoid API overload. πŸ” 4️⃣ Extracts webpage HTML and pulls semantic headings (H1–H6). πŸ“° 5️⃣ Sends headings + URL context to GPT-4o-mini for structured extraction of: β€” title β€” entities β€” keywords β€” topics β€” summary 6️⃣ Generates high-level cluster + subcluster labels for each page. 🧠 7️⃣ Recommends 3–5 internal linking URLs to strengthen topical authority. πŸ”— 8️⃣ Updates Google Sheets with all extracted fields + status flags. πŸ“Š 9️⃣ Repeats the process until all URLs are analyzed. πŸ”„

⭐ Key Benefits

βœ… Automates topical clustering for AI search optimization βœ… Extracts entities, keywords, and topics with high semantic accuracy βœ… Strengthens internal linking strategies using AI suggestions βœ… Eliminates manual scraping and analysis work βœ… Enables scalable content audits for large URL datasets βœ… Enhances visibility in AI-driven search systems and answer engines

🧩 Features

πŸ” Requirements

🎯 Target Audience

πŸ”— Nodes Used

Google Sheets, HTTP Request, Schedule Trigger, AI Agent, OpenAI Chat Model, Simple Memory

πŸ“₯ Import

Download workflow.json and import into n8n: Workflow menu β†’ Import from File

πŸ“– Importing guide Β· πŸ”‘ Credential setup