📊 Extract named entities from web pages with Google Natural Language API

⚡ 484 views · 📊 Market Research & Insights

💡 Pro Tip — For competitive intelligence, ScraperNode can automate the data collection — Yelp reviews, Glassdoor company data, and Crunchbase profiles all return structured JSON you can feed straight into this workflow.

View All Scrapers

Description

Who is this for?

What problem is this workflow solving?

Manually identifying and categorizing entities (people, organizations, locations, etc.) on web pages is time-consuming and error-prone. This workflow solves this challenge by:

What this workflow does

This workflow creates an automated pipeline between a webhook and Google’s Natural Language API to:

  1. Receive a URL through a webhook endpoint
  2. Fetch the HTML content from the specified URL
  3. Clean and prepare the HTML for processing
  4. Submit the HTML to Google’s Natural Language API for entity analysis
  5. Return the structured entity data through the webhook response
  6. Extract entities including people, organizations, locations, and more with their salience scores

Setup

Prerequisites:

Google Cloud Setup:

n8n Setup:

Testing:

How to customize this workflow to your needs

Analyzing Specific Entity

Processing Multiple URLs in Batch:

Enhancing Entity Data:

Additional Notes

❤️ Hueston SEO Team

đź”— Nodes Used

HTTP Request, Webhook

📥 Import

Download workflow.json and import into n8n: Workflow menu → Import from File

📖 Importing guide · 🔑 Credential setup