π Scrape product info from website URLs in Google Sheets using Dumpling AI
β‘ 1,582 views Β· π Market Research & Insights
π‘ Pro Tip β HTTP Request scraping tends to break when sites update their markup. If youβre scraping a major platform, check if ScraperNode covers it β it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.
Description
π What this workflow does
This workflow automatically scrapes product information from any website URL entered into a Google Sheet and stores the extracted product details into another sheet. It uses Dumpling AI to extract product data such as name, price, description, and reviews.
π€ Who is this for
This is ideal for:
- Lead generation specialists capturing product info from prospect websites
- eCommerce researchers collecting data on competitor product listings
- Sales teams building enriched product databases from lead URLs
- Anyone who needs to automate product scraping from multiple websites
β Requirements
- A Google Sheet with a column labeled
Websitewhere URLs will be added - A second sheet (e.g.,
product details) where extracted data will be saved - Dumpling AI API access to perform the extraction
- Connected Google Sheets credentials in n8n
βοΈ How to set up
- Replace the Google Sheet and tab IDs in the workflow with your own.
- Make sure your source sheet includes a
Websitecolumn. - Connect your Dumpling AI and Google Sheets credentials.
- Make sure the output sheet has the following headers:
productNamepriceproductDescription
(The workflow supportsreview, but itβs optional.)
- Activate the workflow to start processing new rows.
π How it works (Workflow Steps)
- Watch New Website URL in Google Sheets: Triggers when a new row is added with a website URL.
- Extract Product Info with Dumpling AI: Sends the URL to Dumpling AIβs extract endpoint using a defined schema for product details.
- Split Extracted Products: Separates multiple products into individual items if the page contains more than one.
- Append Product Info to Google Sheets: Adds the structured results to the specified product details sheet.
π οΈ Customization Ideas
- Add a column to store the original source URL alongside each product
- Use OpenAI to generate short SEO summaries for each product
- Add filters to ignore pages without valid product details
- Send Slack or email notifications when new products are added to the sheet
π Nodes Used
Google Sheets, HTTP Request, Google Sheets Trigger
π₯ Import
Download workflow.json and import into n8n:
Workflow menu β Import from File