π¬ Generate high-quality audio with Voxtral Small 24B 2507
β‘ 674 views Β· π¬ Content Creation & Video
Description
Generate High-Quality Audio with Voxtral Small 24B 2507
This workflow integrates the notdaniel/voxtral-small-24b-2507 model from Replicate to generate audio content from provided inputs. It handles API authentication, creates predictions, polls until completion, and outputs the final generated audio file.
β‘ Section 1: Trigger & Authentication
- π On clicking βexecuteβ β Manually starts the workflow.
- π Set API Key β Stores your Replicate API key to authenticate requests.
Benefit: Securely connects your workflow to Replicateβs API and ensures only authorized requests are made.
ποΈ Section 2: Create Prediction
-
π Create Prediction β Sends a request to Replicateβs API with parameters like:
- audio: Input audio file (e.g., a reference sample).
- max_new_tokens: Maximum number of tokens to generate (controls audio length/complexity).
Benefit: Starts the audio generation process with configurable input and settings.
β³ Section 3: Polling & Status Tracking
-
π Extract Prediction ID β Captures the unique prediction ID and endpoint for polling.
-
β±οΈ Wait β Pauses for 2 seconds before re-checking.
-
π‘ Check Prediction Status β Polls Replicateβs API to see if the audio generation is done.
-
β Check If Complete β
- If finished: moves forward to process results.
- If not: loops back to wait and check again.
Benefit: Efficiently manages asynchronous audio generation, ensuring the workflow only proceeds when results are ready.
π§ Section 4: Process Result
-
π Process Result β Extracts and structures final output data:
- status (success or failure)
- output (raw response)
- metrics (generation statistics)
- timestamps (created and completed times)
- audio_url (final generated audio link)
Benefit: Provides a clean, structured output that can be used in follow-up automations (e.g., sending audio to users, storing in a database, or sharing via email).
π Workflow Overview
| Section | Purpose | Key Nodes | Benefit |
|---|---|---|---|
| β‘ Trigger & Authentication | Start workflow & authenticate | Manual Trigger, Set API Key | Secure execution |
| ποΈ Create Prediction | Submit audio generation request | Create Prediction | Start model processing |
| β³ Polling & Status Tracking | Monitor prediction progress | Extract Prediction ID, Wait, Check Prediction Status, Check If Complete | Ensures reliable completion |
| π§ Process Result | Format and deliver output | Process Result | Clean audio result ready for use |
β Final Benefits
- π Secure authentication with Replicate
- ποΈ Flexible audio generation using voxtral-small-24b-2507
- β³ Reliable polling until results are ready
- π§ Clean and structured audio output
π Nodes Used
HTTP Request
π₯ Import
Download workflow.json and import into n8n:
Workflow menu β Import from File