π Extract & process specific links from sitemap.xml
β‘ 3,316 views Β· π Market Research & Insights
Description
Description
This workflow reads a sitemap.xml file, extracts all URLs, and allows you to filter out specific types of linksβsuch as PDF files, images, or any other contentβbased on your needs.
Who Is This For?
- SEO Specialists looking to analyze specific URLs in their sitemap.
- Developers who need to extract links for automated processing.
- Content Managers filtering out downloadable assets like PDFs or images.
How It Works
- Fetch
sitemap.xmlβ The workflow reads the sitemap file from a given URL. - Extract URLs β Parses all the URLs listed in the sitemap.
- Filter URLs β Use a simple filter to extract only the links you need (e.g., *.pdf).
- Export or Process β The filtered list can be sent via email, stored in a database, or used in another workflow.
Customization
- Edit the Set sitemap URL block and edit the
sitemapUrlvalue to the sitemap you want to fetch. - Edit the Filter URLs block and edit the filter conditions to meet your needs.
π Nodes Used
HTTP Request, Filter
π₯ Import
Download workflow.json and import into n8n:
Workflow menu β Import from File