🔬 Extract Amazon book data & generate purchase reports with Decodo Scraper

442 views · 🔬 Document Extraction & Analysis

💡 Pro Tip — HTTP Request scraping tends to break when sites update their markup. If you’re scraping a major platform, check if ScraperNode covers it — it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.

View All Scrapers

Description

Decodo Scraper API Workflow Template (n8n Automation Amazon Book Purchase Report)

Watch the demo video below:

Watch the video > This workflow demos how to use Decodo Scraper API to crawl any public web page (headless JS, device emulation: mobile/desktop/tablet), extract structured product data from the returned HTML, generate a purchase-ready report, and automatically deliver it as a Google Doc + PDF to Slack/Drive.

🚀 Try Decodo — Web Scraping & Data API (Coupon: TRUNG)

Decodo Logo Decodo is a powerful public data access platform offering managed web scraping APIs and proxy infrastructure to collect structured web data at scale. It handles proxies, anti-bot protection, JavaScript rendering, retries, and global IP rotation—so you can focus on data, not scraping complexity. Why Decodo

Who’s it for

How it works / What it does

  1. Trigger – Manually run the workflow.
  2. Edit Fields (manual) – Provide inputs:
    • targetUrl (e.g., an Amazon category/search/listing page)
    • deviceType (desktop | mobile | tablet)
    • Optional: maxItems, notes, reportTitle, reportOwner
  3. Scraper API Request (HTTP Request → POST)
    Calls Decodo Scraper API with:
    • URL to crawl, headless JS enabled
    • Device emulation (UA + viewport)
    • Optional waitFor / executeJS to ensure late-loading content is captured
  4. HTML Response Parser (Code/Function or HTML node)
    Pulls the HTML string from Decodo response and normalizes it (strip scripts/styles, collapse whitespace).
  5. Product Analyzer Agent (LLM + Structured Output Parser)
    Prompts an LLM to extract structured “book” objects from the HTML: The Structured Output Parser enforces a strict JSON schema and drops malformed items.
  6. Build 📚 Book Purchase Report (Code/LLM)
    Converts the JSON array into a Markdown (or HTML) report with:
    • Executive summary (top picks, average price/rating)
    • Table of items (rank, title, author, price, rating, link)
    • “Recommended to buy” shortlist (rules configurable)
    • Notes / owner / timestamp
  7. Configure Google Drive Folder (manual)
    Choose/create a Drive folder for output artifacts.
  8. Create Document File (Google Docs API)
    Creates a Doc from the generated Markdown/HTML.
  9. Convert Document to PDF (Google Drive export)
    Exports the Doc to PDF.
  10. Upload report to Slack
    Sends the PDF (and/or Doc link) to a chosen Slack channel with a short summary.

How to set up

1 Prerequisites

3 Nodes configuration (high level)

Edit Fields (Set node) Scraper API Request (HTTP Request → POST) HTML Response Parser (Code node) Product Analyzer Agent Build Book Purchase Report (Code/LLM) Create Document File Convert to PDF Upload to Slack

Requirements

How to customize the workflow

🔗 Nodes Used

HTTP Request, Slack, Google Drive, AI Agent, OpenAI Chat Model, Structured Output Parser

📥 Import

Download workflow.json and import into n8n: Workflow menu → Import from File

📖 Importing guide · 🔑 Credential setup