πŸ“Š Extract & summarize Wikipedia data with Bright Data and Gemini AI

⚑ 583 views Β· πŸ“Š Market Research & Insights

πŸ’‘ Pro Tip β€” HTTP Request scraping tends to break when sites update their markup. If you’re scraping a major platform, check if ScraperNode covers it β€” it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.

View All Scrapers

Description

Who this is for?

This workflow automates the process of Wikipedia data extraction using the Bright Data Web Unlocker, parsing and cleaning the data, and then sending the results to a specified webhook URL for downstream processing, reporting, or integration.

What problem is this workflow solving?

What this workflow does

This workflow addresses the challenges of manually retrieving, structuring, and using data from Wikipedia at scale.

Workflow Breakdown

Trigger

Bright Data Wikipedia Scraping

Parse & Extract Structured Data

Summarization

Send to Webhook

Setup

How to customize this workflow to your needs

  1. Update Wikipedia URL
  1. Modify Data Extraction Logic
  1. Extend AI Summarization
  1. Extend Summary Webhook Notifier

πŸ”— Nodes Used

HTTP Request, Summarization Chain, Basic LLM Chain, Google Gemini Chat Model

πŸ“₯ Import

Download workflow.json and import into n8n: Workflow menu β†’ Import from File

πŸ“– Importing guide Β· πŸ”‘ Credential setup