πŸ“Š Summarize & extract Glassdoor company info with Google Gemini and Decodo

⚑ 177 views Β· πŸ“Š Market Research & Insights

πŸ’‘ Pro Tip β€” Job boards are notoriously hard to scrape β€” CAPTCHAs, rate limits, constantly changing layouts. ScraperNode has maintained scrapers for Indeed jobs, Glassdoor reviews, and Glassdoor jobs that handle all of that for you.

View All Scrapers

Description

This workflow automates company research and intelligence extraction from Glassdoor using Decode API for data retrieval and Google Gemini for AI-powered summarization.

Who this is for

This workflow is ideal for:

What problem this workflow solves

This workflow automates data mining, summarization, and structuring, transforming Glassdoor data into ready-to-use JSON and Markdown summaries.

What this workflow does

The workflow automates the end-to-end pipeline for Glassdoor company research:

  1. Trigger

    • Start manually by clicking β€œExecute Workflow.”
  2. Set Input Fields

    • Define company_url (e.g., a Glassdoor company profile link) and geo (country).
  3. Extract Raw Data from Glassdoor (Decodo Node)

    • Uses the Decodo API to fetch company data β€” including overview, ratings, reviews, and frequently asked questions.
  4. Generate Structured Data (Google Gemini + Output Parser)

    • The Structured Data Extractor node (powered by Gemini AI) processes raw data into well-defined fields:

      • Company overview (name, size, website, type)
      • Ratings breakdown
      • Review snippets (pros, cons, roles)
      • FAQs
      • Key takeaways
  5. Summarize the Insights (Gemini AI Summarizer)

    • Produces a detailed summary highlighting:

      • Company reputation
      • Work culture
      • Employee sentiment trends
      • Strengths and weaknesses
      • Hiring recommendations
  6. Merge and Format

    • Combines structured data and summary into a unified object for output.
  7. Export and Save

    • Converts the final report into JSON and writes it to disk as C:\{{CompanyName}}.json.
  8. Binary Encoding for File Handling

    • Prepares data in base64 for easy integration with APIs or downloadable reports.

Setup

Prerequisites

If you are new to Decode, please signup on this link visit.decodo.com

Make sure to install the Decodo Community Node.

Decode Community Node

Steps

  1. Import this workflow JSON file into your n8n instance.

  2. Configure your credentials for:

    • Decodo API
    • Google Gemini (PaLM) API
  3. Open the Set the Input Fields node and replace:

    • company_url β†’ with the Glassdoor URL
    • geo β†’ with the region (e.g., India, US, etc.)
  4. Execute the workflow.

  5. Check your output folder (C:\) for the exported JSON report.

How to Customize This Workflow

You can easily adapt this template to your needs:

Summary

This n8n workflow combines Decode web scrapping with Google Gemini’s reasoning and summarization power to build a fully automated Glassdoor Research Engine.

With a single execution, it:

Ideal for:

πŸ”— Nodes Used

Function, Basic LLM Chain, Structured Output Parser, Read/Write Files from Disk, Google Gemini Chat Model, Information Extractor

πŸ“₯ Import

Download workflow.json and import into n8n: Workflow menu β†’ Import from File

πŸ“– Importing guide Β· πŸ”‘ Credential setup