📊 Scrape latest 20 TechCrunch articles

⚡ 6,766 views · 📊 Market Research & Insights

💡 Pro Tip — HTTP Request scraping tends to break when sites update their markup. If you’re scraping a major platform, check if ScraperNode covers it — it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.

Description

Retrieve 20 Latest TechCrunch Articles

Who is this for?

This workflow is designed for developers, content creators, and data analysts who need to scrape recent articles from TechCrunch. It’s perfect for anyone looking to aggregate news articles or create custom feeds for analysis, reporting, or integration into other systems.

What problem is this workflow solving?

This workflow automates the process of scraping recent articles from TechCrunch. Manually collecting article data can be time-consuming and inefficient, but with this workflow, you can quickly gather up-to-date news articles with relevant metadata, saving time and effort.

What this workflow does

This workflow retrieves the latest 20 news articles from TechCrunch’s “Recent” page. It extracts the article URLs, metadata (such as titles and publication dates), and main content for each article, allowing you to access the information you need without any manual effort.

Setup

Clone or download the workflow template.
Ensure you have a working n8n environment.
Configure the HTTP Request nodes with your desired parameters to connect to the TechCrunch API.
(Optional) Customize the workflow to target specific sections or topics of interest.
Run the workflow to retrieve the latest 20 articles.

How to customize this workflow to your needs

Modify the HTTP request to pull articles from different pages or sections of TechCrunch.
Adjust the number of articles to retrieve by changing the selection criteria.
Add additional processing steps to further filter or analyze the article data.

Workflow Steps

Send an HTTP request to the TechCrunch “Recent” page.
Parse a posts box that holds the list of articles.
Parse all posts to extract all articles.
spilt out posts for each article.
Extract the URL and metadata from each article.
Send an HTTP request for each article using its URL.
Locate and parse the main content of each article.

Note: Be sure to update the HTTP Request nodes with any necessary headers or authentication to work with TechCrunch’s website.

🔗 Nodes Used

HTTP Request

📥 Import

Download workflow.json and import into n8n: Workflow menu → Import from File

📖 Importing guide · 🔑 Credential setup