๐Ÿ“Š Scrape latest 20 TechCrunch articles

โšก 6,766 views ยท ๐Ÿ“Š Market Research & Insights

๐Ÿ’ก Pro Tip โ€” HTTP Request scraping tends to break when sites update their markup. If youโ€™re scraping a major platform, check if ScraperNode covers it โ€” it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.

View All Scrapers

Description

Retrieve 20 Latest TechCrunch Articles

Who is this for?

This workflow is designed for developers, content creators, and data analysts who need to scrape recent articles from TechCrunch. Itโ€™s perfect for anyone looking to aggregate news articles or create custom feeds for analysis, reporting, or integration into other systems.

What problem is this workflow solving?

This workflow automates the process of scraping recent articles from TechCrunch. Manually collecting article data can be time-consuming and inefficient, but with this workflow, you can quickly gather up-to-date news articles with relevant metadata, saving time and effort.

What this workflow does

This workflow retrieves the latest 20 news articles from TechCrunchโ€™s โ€œRecentโ€ page. It extracts the article URLs, metadata (such as titles and publication dates), and main content for each article, allowing you to access the information you need without any manual effort.

Setup

  1. Clone or download the workflow template.
  2. Ensure you have a working n8n environment.
  3. Configure the HTTP Request nodes with your desired parameters to connect to the TechCrunch API.
  4. (Optional) Customize the workflow to target specific sections or topics of interest.
  5. Run the workflow to retrieve the latest 20 articles.

How to customize this workflow to your needs

Workflow Steps

  1. Send an HTTP request to the TechCrunch โ€œRecentโ€ page.
  2. Parse a posts box that holds the list of articles.
  3. Parse all posts to extract all articles.
  4. spilt out posts for each article.
  5. Extract the URL and metadata from each article.
  6. Send an HTTP request for each article using its URL.
  7. Locate and parse the main content of each article.

Note: Be sure to update the HTTP Request nodes with any necessary headers or authentication to work with TechCrunchโ€™s website.

๐Ÿ”— Nodes Used

HTTP Request

๐Ÿ“ฅ Import

Download workflow.json and import into n8n: Workflow menu โ†’ Import from File

๐Ÿ“– Importing guide ยท ๐Ÿ”‘ Credential setup