🎬 Create .SRT subtitles & .LRC lyrics from audio with Whisper AI and GPT-5-nano

543 views · 🎬 Content Creation & Video

💡 Pro Tip — YouTube’s API quotas can be a bottleneck when you’re pulling data at scale. ScraperNode is a community node with dedicated scrapers for channels, videos, and comments — no quota limits, just structured data.

View All Scrapers

Description

Overview

This workflow automates the entire process of creating professional subtitle (.SRT) and synced lyrics (.LRC) files from audio recordings. Upload your vocal track, let Whisper AI transcribe it with precise timestamps, and GPT-5-nano segments it into natural, singable lyric lines. With an optional quality control step, you can manually refine the output while maintaining perfect timestamp alignment.

Key Features

Use Cases

Perfect For

What You’ll Need

Required Setup

How It Works

Automatic Mode (No Quality Check)

  1. Upload your MP3 vocal track to the workflow
  2. Transcription: Whisper AI processes audio with word-level timestamps
  3. Segmentation: GPT-5-nano formats text into natural lyric lines
  4. Generation: Workflow creates .SRT and .LRC files
  5. Download your ready-to-use subtitle files

Manual Quality Control Mode

  1. Upload your MP3 vocal track and enable quality check
  2. Transcription: Whisper AI processes audio with timestamps
  3. Initial Segmentation: GPT-5-nano creates first draft
  4. Download the .TXT file for review
  5. Edit lyrics in any text editor (keep line structure intact)
  6. Re-upload corrected .TXT file
  7. Smart Matching: Advanced diff algorithm aligns changes with original timestamps
  8. Download final .SRT and .LRC files with perfect timing

Technical Details

Output Formats

.SRT (SubRip Subtitle)

Standard format for:

.LRC (Lyric File)

Synced lyrics format for:

Pro Tips

💡 For Best Results:

⚙️ Customization Options:

Workflow Components

  1. Audio Input: Upload interface for MP3 files
  2. Whisper Transcribe: OpenAI API call with timestamp extraction
  3. Post-Processing: GPT-5-nano segmentation into lyric format
  4. Routing Quality Check: Decision point for manual review
  5. Timestamp Matching: Diff and alignment for corrected text
  6. Subtitles Preparation: JSON formatting for both output types
  7. File Generation: Convert to .SRT and .LRC formats
  8. Download Nodes: Export final files

Template Author:

Questions or need help with setup? 📧 Email:xciklv@gmail.com 💼 LinkedIn:https://www.linkedin.com/in/vaclavcikl/

🔗 Nodes Used

HTTP Request, Basic LLM Chain, OpenAI Chat Model, n8n Form Trigger, Convert to File

📥 Import

Download workflow.json and import into n8n: Workflow menu → Import from File

📖 Importing guide · 🔑 Credential setup