๐Ÿ“Š Real-time extract of job, company, salary details via Bright Data MCP & OpenAI

โšก 880 views ยท ๐Ÿ“Š Market Research & Insights

๐Ÿ’ก Pro Tip โ€” Job boards are notoriously hard to scrape โ€” CAPTCHAs, rate limits, constantly changing layouts. ScraperNode has maintained scrapers for Indeed jobs, Glassdoor reviews, and Glassdoor jobs that handle all of that for you.

View All Scrapers

Description

Realtime Extract of Job Description, Salary Details via Bright Data MCP OpenAI 4o mini.png

Notice

Community nodes can only be installed on self-hosted instances of n8n.

Who this is for

This workflow automates the real-time extraction of Job Descriptions and Salary Information from job listing pages using Bright Data MCP and analyzes content using OpenAI GPT-4o mini.

This workflow is ideal for:

What problem is this workflow solving?

Traditional scraping of job portals can be challenging due to cluttered content, anti-scraping measures, and inconsistent formatting. Manually analyzing salary ranges and job descriptions is tedious and error-prone.

This workflow solves the problem by:

What this workflow does

Components & Flow

Input Nodes

MCP Client Operations

Extracts full job description as structured Markdown content

OpenAI GPT-4o mini Nodes

Salary Information Extractor

Job Description Refiner

Company Information Extractor

Merge Node

Aggregate node

Final Output Handling

The output is handled in three different formats depending on your downstream needs:

Pre-conditions

  1. Knowledge of Model Context Protocol (MCP) is highly essential. Please read this blog post - model-context-protocol
  2. You need to have the Bright Data account and do the necessary setup as mentioned in the Setup section below.
  3. You need to have the Google Gemini API Key. Visit Google AI Studio
  4. You need to install the Bright Data MCP Server @brightdata/mcp
  5. You need to install the n8n-nodes-mcp

Setup

  1. Please make sure to setup n8n locally with MCP Servers by navigating to n8n-nodes-mcp
  2. Please make sure to install the Bright Data MCP Server @brightdata/mcp on your local machine.
  3. Sign up at Bright Data.
  4. Navigate to Proxies & Scraping and create a new Web Unlocker zone by selecting Web Unlocker API under Scraping Solutions.
  5. Create a Web Unlocker proxy zone called mcp_unlocker on Bright Data control panel.
  6. In n8n, configure the OpenAi account credentials.
  7. In n8n, configure the credentials to connect with MCP Client (STDIO) account with the Bright Data MCP Server as shown below.

MCPClientAccount.png

Make sure to copy the Bright Data API_TOKEN within the Environments textbox above as API_TOKEN=<your-token>

How to customize this workflow to your needs

Modify Input Source

Tweak LLM Prompts (Optional)

Change Output Format

๐Ÿ”— Nodes Used

Function, Google Sheets, HTTP Request, OpenAI Chat Model, Read/Write Files from Disk, Information Extractor

๐Ÿ“ฅ Import

Download workflow.json and import into n8n: Workflow menu โ†’ Import from File

๐Ÿ“– Importing guide ยท ๐Ÿ”‘ Credential setup