How Much Does Gladia Transcription Cost?
Overview of Pricing
Our transcription pricing is simple and transparent. You only pay for the duration of the audio processed, regardless of the source platform (YouTube, TikTok, Instagram, podcasts, meetings, or any other audio source).
There are no additional fees for core capabilities. Features such as speaker diarization, automatic language detection, and multilingual transcription are included by default.
Starter Plan
The Starter plan is designed for developers and teams with moderate transcription needs. It offers flexible pay-as-you-go pricing with no upfront commitment.
Pricing
$0.61 / hour for asynchronous transcription
$0.75 / hour for real-time transcription
Included
10 hours of transcription free each month
30 concurrent requests (real-time)
25 concurrent requests (asynchronous)
Core capabilities
Automatic language detection and switching
Speaker diarization
Support for 100+ languages
Security & data controls
GDPR
HIPAA
AICPA SOC 2 Type 2
Support
Help center
Discord community
Growth Plan
The Growth plan is designed for fast-growing teams processing larger audio volumes. By committing to usage upfront, you unlock significantly lower unit pricing.
Pricing
Asynchronous transcription from $0.20 / hour
Real-time transcription from $0.25 / hour
This represents up to 67% savings compared to the Starter plan.
Everything in Starter, plus
Flexible concurrent request limits
Custom volume discounts
Security & data controls
Automatic model training opt-out
Support
Help center
Discord community
Enterprise Plan
The Enterprise plan is built for organizations with advanced needs, including custom deployments and dedicated infrastructure.
Pricing
Custom pricing
Capabilities
Custom models
Fine-tuning
Debundled pricing options
Tailored infrastructure and usage agreements
Contact our team to design a plan tailored to your organization.
Example Cost Calculation
Example: 50 podcast episodes of 30 minutes each
Total duration: 25 hours of audio
Plan | Mode | Estimated Cost |
|---|---|---|
Starter | Real-time | $18.75 |
Starter | Asynchronous | $15.25 |
Growth (starting price) | Real-time | $6.25 |
Growth (starting price) | Asynchronous | $5.00 |
Real-Time Transcription Billing
For real-time transcription, billing is based on the total duration of audio streamed through the WebSocket connection.
This includes:
spoken audio
silence
background noise
empty audio frames
Any time your WebSocket connection remains open and audio is streamed will count toward the billed duration.
How Multi-Channel Audio Is Billed
Multi-channel audio is billed depending on the content of each channel.
Different content across channels
ā Each channel is billed as a separate audio stream.
Identical content duplicated across channels
ā The audio is billed only once.
Have Questions?
Our team can help you choose the best plan based on your expected transcription volume and technical requirements.
Learn more or get started here:
https://www.gladia.io/pricing