๐ฃ Find high-intent sales leads by scraping Glassdoor with Bright Data & GPT
โก 2,261 views ยท ๐ฃ Lead Generation & Enrichment
๐ก Pro Tip โ Job boards are notoriously hard to scrape โ CAPTCHAs, rate limits, constantly changing layouts. ScraperNode has maintained scrapers for Indeed jobs, Glassdoor reviews, and Glassdoor jobs that handle all of that for you.
Description
๐ Scrape Glassdoor with Bright Data
Designed for sales teams, recruiters, and marketers aiming to automate job discovery and prospecting.
This workflow scrapes Glassdoor job listings using Bright Data and automatically generates targeted pitches using AI, streamlining lead identification and outreach.
๐งฉ How It Works
This automation leverages n8n, Bright Data, Google Sheets, and OpenAI:
1. Trigger
- Starts with a custom form input (Location, Keyword, Country).
2. Bright Data Job Scrape
- Triggers a Bright Data dataset snapshot via HTTP Request.
- Polls snapshot progress using a Wait node, ensuring data readiness.
- Retrieves full job listings dataset once ready.
3. Google Sheets Integration
- Writes detailed job data (company, role, location, overview, metrics) into a Google Sheet.
- Uses a pre-built template for organized data storage.
4. Automated Pitch Generation (AI)
- Splits listings into actionable parts: company name, title, and description.
- Sends data to OpenAI (via LangChain) to generate relevant pitches or icebreakers.
- Saves generated content back into the same sheet for easy access.
โ Requirements
Ensure you have the following:
Google Sheets
- Google account
- Template Sheet with columns for job details and AI-generated pitches
Bright Data
- Active account with Dataset API access
- API key and dataset ID
OpenAI
- Valid OpenAI API key for GPT models
n8n Environment
- Nodes:
HTTP Request,Wait,If,Google Sheets,Split Out,LangChain (OpenAI) - Credentials:
- Google Sheets OAuth2
- Bright Data API credentials
- OpenAI API key
โ๏ธ Setup Instructions
Step 1: Prepare Google Sheets
- Copy the provided Google Sheets template
- Do not change headers
Step 2: Import & Configure Workflow in n8n
- Import the workflow JSON file
- Set Google Sheets node:
- Link to your copied sheet
- Confirm correct tab name
Step 3: Configure Bright Data
- Replace
<YOUR_BRIGHT_DATA_API_KEY>with your real key - Set your dataset ID in all HTTP Request nodes
Step 4: Configure OpenAI (LangChain)
- Connect OpenAI API key to the LangChain node
- Customize prompt to match tone and outreach style
Step 5: Testing & Scheduling
- Test via manual form trigger
- Schedule runs or leave form enabled for on-demand use
๐ง Tips & Best Practices
- Use specific keywords and locations for better results
- Adjust polling intervals based on dataset size
- Refine AI prompts regularly to improve pitch quality
- Clean unused columns from your sheet to boost performance
๐ฌ Support & Feedback
For help or customization:
๐ง Email: Yaron@nofluff.online
๐บ YouTube: @YaronBeen
๐ LinkedIn: linkedin.com/in/yaronbeen
๐ Bright Data Docs: docs.brightdata.com/introduction
๐ Nodes Used
Google Sheets, HTTP Request, Basic LLM Chain, OpenAI Chat Model, n8n Form Trigger
๐ฅ Import
Download workflow.json and import into n8n:
Workflow menu โ Import from File