Skip to content
Gladia Help Center home
Gladia Help Center home

How Much Does Gladia Transcription Cost?

Overview of Pricing

Our transcription pricing is simple and transparent. You only pay for the duration of the audio processed, regardless of the source platform (YouTube, TikTok, Instagram, podcasts, meetings, or any other audio source).

There are no additional fees for core capabilities. Features such as speaker diarization, automatic language detection, and multilingual transcription are included by default.

Starter Plan

The Starter plan is designed for developers and teams with moderate transcription needs. It offers flexible pay-as-you-go pricing with no upfront commitment.

Pricing

  • $0.61 / hour for asynchronous transcription

  • $0.75 / hour for real-time transcription

Included

  • 10 hours of transcription free each month

  • 30 concurrent requests (real-time)

  • 25 concurrent requests (asynchronous)

Core capabilities

  • Automatic language detection and switching

  • Speaker diarization

  • Support for 100+ languages

Security & data controls

  • GDPR

  • HIPAA

  • AICPA SOC 2 Type 2

Support

  • Help center

  • Discord community

Growth Plan

The Growth plan is designed for fast-growing teams processing larger audio volumes. By committing to usage upfront, you unlock significantly lower unit pricing.

Pricing

  • Asynchronous transcription from $0.20 / hour

  • Real-time transcription from $0.25 / hour

This represents up to 67% savings compared to the Starter plan.

Everything in Starter, plus

  • Flexible concurrent request limits

  • Custom volume discounts

Security & data controls

  • Automatic model training opt-out

Support

  • Help center

  • Discord community

Enterprise Plan

The Enterprise plan is built for organizations with advanced needs, including custom deployments and dedicated infrastructure.

Pricing

  • Custom pricing

Capabilities

  • Custom models

  • Fine-tuning

  • Debundled pricing options

  • Tailored infrastructure and usage agreements

Contact our team to design a plan tailored to your organization.


Example Cost Calculation

Example: 50 podcast episodes of 30 minutes each

Total duration: 25 hours of audio

Plan

Mode

Estimated Cost

Starter

Real-time

$18.75

Starter

Asynchronous

$15.25

Growth (starting price)

Real-time

$6.25

Growth (starting price)

Asynchronous

$5.00


Real-Time Transcription Billing

For real-time transcription, billing is based on the total duration of audio streamed through the WebSocket connection.

This includes:

  • spoken audio

  • silence

  • background noise

  • empty audio frames

Any time your WebSocket connection remains open and audio is streamed will count toward the billed duration.


How Multi-Channel Audio Is Billed

Multi-channel audio is billed depending on the content of each channel.

Different content across channels
→ Each channel is billed as a separate audio stream.

Identical content duplicated across channels
→ The audio is billed only once.


Have Questions?

Our team can help you choose the best plan based on your expected transcription volume and technical requirements.

Learn more or get started here:
https://www.gladia.io/pricing