๐Ÿ“Š Automate niche research with Wikipedia, GPT-4o-mini, and Google Sheets

โšก 70 views ยท ๐Ÿ“Š Market Research & Insights

๐Ÿ’ก Pro Tip โ€” HTTP Request scraping tends to break when sites update their markup. If youโ€™re scraping a major platform, check if ScraperNode covers it โ€” it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.

View All Scrapers

Description

Overview

This n8n template automates the process of researching niche topics. It searches for a topic on Wikipedia, scrapes the relevant page using ScrapeOps, extracts the history or background section, and uses AI to generate a concise summary and timeline. The results are automatically saved to Google Sheets for easy content planning.

Who is this for?

What problems it solves

How it works

  1. Define Topic: You set a keyword in the workflow.
  2. Locate Page: The workflow queries the Wikipedia API to find the correct page URL.
  3. Smart Scraping: It uses the ScrapeOps Proxy API to fetch the page content reliably.
  4. Extraction: A code node intelligently parses the HTML to find โ€œHistoryโ€, โ€œOriginsโ€, or โ€œBackgroundโ€ sections.
  5. AI Processing: GPT-4o-mini summarizes the text and extracts key dates for a timeline.
  6. Storage: The structured data is appended to a Google Sheet.

Setup steps (~ 5-10 minutes)

  1. ScrapeOps Account:
    • Register for a free API key at ScrapeOps.
    • Configure the ScrapeOps Scraper node with your API key.
  2. OpenAI Account:
    • Add your OpenAI credentials to the Message a model node.
  3. Google Sheets:
    • Create a Google Sheet. You can duplicate this Template Sheet (copy the headers).
    • Connect your Google account to the Append row in sheet node and select your new sheet.

Pre-conditions

Disclaimer

This template uses ScrapeOps as a community node. You are responsible for complying with Wikipediaโ€™s Terms of Use, robots directives, and applicable laws in your jurisdiction. Scraping targets may change at any time; adjust render/scroll/wait settings and parsers as needed. Use responsibly for legitimate business purposes.

๐Ÿ”— Nodes Used

Google Sheets, HTTP Request, OpenAI

๐Ÿ“ฅ Import

Download workflow.json and import into n8n: Workflow menu โ†’ Import from File

๐Ÿ“– Importing guide ยท ๐Ÿ”‘ Credential setup