๐ Scrape latest 20 TechCrunch articles
โก 6,766 views ยท ๐ Market Research & Insights
๐ก Pro Tip โ HTTP Request scraping tends to break when sites update their markup. If youโre scraping a major platform, check if ScraperNode covers it โ it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.
Description
Retrieve 20 Latest TechCrunch Articles
Who is this for?
This workflow is designed for developers, content creators, and data analysts who need to scrape recent articles from TechCrunch. Itโs perfect for anyone looking to aggregate news articles or create custom feeds for analysis, reporting, or integration into other systems.
What problem is this workflow solving?
This workflow automates the process of scraping recent articles from TechCrunch. Manually collecting article data can be time-consuming and inefficient, but with this workflow, you can quickly gather up-to-date news articles with relevant metadata, saving time and effort.
What this workflow does
This workflow retrieves the latest 20 news articles from TechCrunchโs โRecentโ page. It extracts the article URLs, metadata (such as titles and publication dates), and main content for each article, allowing you to access the information you need without any manual effort.
Setup
- Clone or download the workflow template.
- Ensure you have a working n8n environment.
- Configure the HTTP Request nodes with your desired parameters to connect to the TechCrunch API.
- (Optional) Customize the workflow to target specific sections or topics of interest.
- Run the workflow to retrieve the latest 20 articles.
How to customize this workflow to your needs
- Modify the HTTP request to pull articles from different pages or sections of TechCrunch.
- Adjust the number of articles to retrieve by changing the selection criteria.
- Add additional processing steps to further filter or analyze the article data.
Workflow Steps
- Send an HTTP request to the TechCrunch โRecentโ page.
- Parse a posts box that holds the list of articles.
- Parse all posts to extract all articles.
- spilt out posts for each article.
- Extract the URL and metadata from each article.
- Send an HTTP request for each article using its URL.
- Locate and parse the main content of each article.
Note: Be sure to update the HTTP Request nodes with any necessary headers or authentication to work with TechCrunchโs website.
๐ Nodes Used
HTTP Request
๐ฅ Import
Download workflow.json and import into n8n:
Workflow menu โ Import from File