🔬 Remove duplicate entries & update Google Sheets based on profile URLs
⚡ 558 views · 🔬 Document Extraction & Analysis
💡 Pro Tip — HTTP Request scraping tends to break when sites update their markup. If you’re scraping a major platform, check if ScraperNode covers it — it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.
Description
Remove Duplicates & Update Google Sheets
How it Works
This workflow helps you keep your Google Sheets clean and up-to-date by automatically removing duplicate entries and re-uploading the cleaned data back to your sheet. It’s especially useful for large lead lists, email databases, or any dataset where duplicate rows can cause confusion and inefficiency.
The flow:
-
Trigger the workflow manually.
-
Fetch all rows from a specific Google Sheet.
-
Identify and remove duplicate rows based on the profileUrl field.
-
Convert the cleaned dataset into a file.
-
Update the original Google Sheet with the new, deduplicated data.
Setup Steps
-
Connect your Google Sheets and Google Drive credentials in n8n.
-
Update the workflow with your desired spreadsheet and sheet ID.
-
Run the workflow by clicking “Execute Workflow” whenever you want to clean up your data.
-
The process only takes a few seconds and ensures your sheet stays organized without any manual effort.
Use Cases
-
CRM lead management (avoiding duplicate prospects).
-
Contact lists with scraped or imported data.
-
Marketing databases with overlapping submissions.
🔗 Nodes Used
Google Sheets, Google Drive, Convert to File
📥 Import
Download workflow.json and import into n8n:
Workflow menu → Import from File