๐Ÿ“Š Amazon product search scraper with BrightData, GPT-4, and Google Sheets

โšก 12,598 views ยท ๐Ÿ“Š Market Research & Insights

๐Ÿ’ก Pro Tip โ€” HTTP Request scraping tends to break when sites update their markup. If youโ€™re scraping a major platform, check if ScraperNode covers it โ€” it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.

View All Scrapers

Description

This workflow automates web scraping of Amazon search result pages by retrieving raw HTML, cleaning it to retain only the relevant product elements, and then using an LLM to extract structured product data (name, description, rating, reviews, and price), before saving the results back to Google Sheets.

It integrates Google Sheets to supply and collect URLs, BrightData to fetch page HTML, a custom n8n Function node to sanitize the HTML, LangChain (OpenRouter GPT-4) to parse product details, and Google Sheets again to store the output.

CleanShot 20250506 at 16.42.332x.png URL to scape

.

CleanShot 20250506 at 16.41.062x.png Result

Who Needs Amazon Search Result Scraping?

This scraping workflow is ideal for teams and businesses that need to monitor Amazon product listings at scale:

If you need reliable, structured data from Amazon search results delivered directly into your spreadsheets, this workflow saves you hours of manual copy-and-paste.

Why Use This Workflow?

Step-by-Step: How This Workflow Scrapes Amazon

  1. Get URLs from Google Sheets โ€“ Reads a list of search result URLs.
  2. Loop Over Items โ€“ Iterates through each URL in controlled batches.
  3. Fetch Raw HTML โ€“ Uses BrightDataโ€™s Web Unlocker proxy to retrieve the page.
  4. Clean HTML โ€“ A Function node removes doctype, scripts, styles, head, comments, classes, and non-whitelisted tags, collapsing extra whitespace.
  5. Extract with LLM โ€“ Passes cleaned HTML into LangChain โ†’ GPT-4 to output JSON for each product:
    • name, description, rating, reviews, price
  6. Save Results โ€“ Appends the JSON fields as columns back into a โ€œresultsโ€ sheet in Google Sheets.

Customization: Tailor to Your Needs

๐Ÿ”‘ Prerequisites

๐Ÿš€ Installation & Setup

โš  Important

Summary

This workflow delivers a fully automated, scalable solution to extract structured product data from Amazon search pages directly into Google Sheetsโ€”streamlining your competitive analysis and data collection. ๐Ÿš€

.


Phil | Inforeole | Linkedin

๐Ÿ‡ซ๐Ÿ‡ท Contactez nous pour automatiser vos processus

๐Ÿ”— Nodes Used

Google Sheets, HTTP Request, Basic LLM Chain, Structured Output Parser, OpenRouter Chat Model

๐Ÿ“ฅ Import

Download workflow.json and import into n8n: Workflow menu โ†’ Import from File

๐Ÿ“– Importing guide ยท ๐Ÿ”‘ Credential setup