Voice & Speech Translation API

Trusted by top teams worldwide

Built for production. Certified for compliance.

Cloud, self-hosted, or on-premises.

ISO 27001

Certified infrastructure

GDPR

EU data processing, DPA ready

99.9% SLA

Status page & incident response

Zero Training

Your data never trains our models

Access 60+ languages

Translate between more than 60 languages. Couldn’t find your language? Reach out, and we’ll talk about adding it.

Arabic

Bulgarian

Chinese

Czech

Danish

Dutch

English

Finnish

French

German

Greek

Hebrew

Hindi

Hungarian

Indonesian

Italian

Japanese

Korean

Polish

Portuguese

Portuguese (Brazilian)

Romanian

Russian

Slovak

Spanish

Swedish

Turkish

Ukrainian

Language auto-detection

Palabra automatically detects and switches between languages in real time, even if a single speaker code-switches mid-conversation.

Tone and context conveyed

By controlling the entire translation pipeline, Palabra can carry over key data from the original speech into the translated output. This preserves tone and conversational context throughout the process, with emotion delivery coming soon.

Voice cloning out of the box

With Palabra, you can automatically generate synthetic voices for each speaker. No manual setup needed.

Ultra-low latency

Palabra delivers speech-to-speech translation in real time with less than a second delay. Predictive models tailored to each language pair cut lag, while full-stack control from ASR to TTS keeps every stage fast and efficient.

Custom glossaries

Palabra lets you define custom terminology to keep translations accurate and consistent. In real-time sessions, the speech-to-speech translation API applies your glossary rules so key terms are recognized and translated exactly as defined.

Feedback on our real-time
translation services

Saptarshi Chakraborty

Co-Founder & Product Owner at EventLabs

“At EventLabs, we rely on Palabra for real-time translation during conferences and live events. Among all the solutions we’ve tested, Palabra stands out with the highest translation quality and the lowest latency by a significant margin. The platform’s speaker autodetection, differentiating between male and female voices and adapting translations in real time, has noticeably improved the listener experience. For us, Palabra is setting the benchmark for event translation technology.”

Anton Selikhov

CEO at Talo AI

“We built our product entirely on the Palabra API and it’s been an incredible foundation for what we do at Talo. The API’s natural language processing capabilities are reliable and accurate, which allowed us to bring real-time translations and captions to our users without starting from scratch.”

Designed for what you build

Impress your speakers and guests with Live Translation powered by Palabra’s very own language models, offering state-of-the-art accuracy and small latency

Tech stack support

Real time speech-to-speech translation streaming API for speech interpretation.

Scalable for any use case

Translate your online streams into multiple languages in real-time.

Accurate in noisy environments

Create and manage custom voices for your Voices Collection.

Flexible deployment

Ensure accuracy for your industry with Palabra's
custom glossaries.

What Teams Build with Our Speech Translation API

Communication & Collaboration Platforms

to strengthen global reach with seamless multilingual interaction and higher user satisfaction.

Global Call Centers & Customer Support Platforms

to scale multilingual support and win clients who serve global user bases.

Entertainment & Streaming Platforms

to expand global audiences, drive engagement, and reduce translation or interpreter costs.

Social Commerce Platforms

to increase sales conversion and global reach.

How Our Speech to Speech Translation API Works in 4 Steps

HTML

1
2
3
4
5
6
7
8

<div class="app">
<div class="transcription" />
<div class="controls">
<button id="start">Start</button>
<button id="start">Stop</button>
</div>
<div class="translations" />
</div>

JavaScript

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21

import { PalabraClient, getMicAudioTrack }
from "palabra";

const client = new PalabraClient({
// auth: { userToken: '<API_KEY>' },
// originalTrack: getMicAudioTrack(),
// translateFrom: "en",
// translateTo: "fr"
});

// document.getElementById('start')
// .addEventListener('click', () => {
// client.startTranslation();
// client.playTranslationTrack();
// });

// document.getElementById('stop')
// .addEventListener('click', () => {
// client.stopTranslation();
// });

HTML

1
2
3
4
5
6
7
8

<div class="app">
<div class="transcription" />
<div class="controls">
<button id="start">Start</button>
<button id="start">Stop</button>
</div>
<div class="translations" />
</div>

JavaScript

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21

import { PalabraClient, getMicAudioTrack }
from "palabra";

const client = new PalabraClient({
auth: { userToken:, '<API_KEY>' } ,
clientSecret: "<API_CLIENT_SECRET>",
// originalTrack: getMicAudioTrack(),
// translateFrom: "en",
// translateTo: "fr"
});

// document.getElementById('start')
// .addEventListener('click', () => {
// client.startTranslation();
// client.playTranslationTrack();
// });

// document.getElementById('stop')
// .addEventListener('click', () => {
// client.stopTranslation();
// });

HTML

1
2
3
4
5
6
7
8

<div class="app">
<div class="transcription" />
<div class="controls">
<button id="start">Start</button>
<button id="start">Stop</button>
</div>
<div class="translations" />
</div>

JavaScript

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21

import { PalabraClient, getMicAudioTrack }
from "palabra";

const client = new PalabraClient({
      auth: { userToken:, '<API_KEY>' } ,
      originalTrack: getMicAudioTrack(),
      translateFrom: "en",
      translateTo: "fr"
});

// document.getElementById('start')
// .addEventListener('click', () => {
// client.startTranslation();
// client.playTranslationTrack();
// });

// document.getElementById('stop')
// .addEventListener('click', () => {
// client.stopTranslation();
// });

HTML

1
2
3
4
5
6
7
8

<div class="app">
<div class="transcription" />
<div class="controls">
<button id="start">Start</button>
<button id="start">Stop</button>
</div>
<div class="translations" />
</div>

JavaScript

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21

import { PalabraClient, getMicAudioTrack }
from "palabra";

const client = new PalabraClient({
      auth: { userToken:, '<API_KEY>' },
      originalTrack: getMicAudioTrack(),
      translateFrom: "en",
      translateTo: "fr"
});

document.getElementById('start')
      .addEventListener('click', () => {
   client.startTranslation();
client.playTranslationTrack();
  });

document.getElementById('stop')
.addEventListener('click', () => {
client.stopTranslation();
    });

TaSpeak and hear translations in real time.

Import a read-made Palabra client.

HTML

1
2
3
4
5
6
7
8

<div class="app">
<div class="transcription" />
<div class="controls">
<button id="start">Start</button>
<button id="start">Stop</button>
</div>
<div class="translations" />
</div>

JavaScript

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21

import { PalabraClient, getMicAudioTrack }
from "palabra";

const client = new PalabraClient({
// clientId: "<API_CLIENT_ID>",
// clientSecret: "<API_CLIENT_SECRET>",
// originalTrack: getMicAudioTrack(),
// translateFrom: "en",
// translateTo: "fr"
});

// document.getElementById('start')
// .addEventListener('click', () => {
// client.startTranslation();
// client.playTranslationTrack();
// });

// document.getElementById('stop')
// .addEventListener('click', () => {
// client.stopTranslation();
// });

Drop your API keys.

HTML

1
2
3
4
5
6
7
8

<div class="app">
<div class="transcription" />
<div class="controls">
<button id="start">Start</button>
<button id="start">Stop</button>
</div>
<div class="translations" />
</div>

JavaScript

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21

import { PalabraClient, getMicAudioTrack }
from "palabra";

const client = new PalabraClient({
clientId: "<API_CLIENT_ID>",
clientSecret: "<API_CLIENT_SECRET>",
// originalTrack: getMicAudioTrack(),
// translateFrom: "en",
// translateTo: "fr"
});

// document.getElementById('start')
// .addEventListener('click', () => {
// client.startTranslation();
// client.playTranslationTrack();
// });

// document.getElementById('stop')
// .addEventListener('click', () => {
// client.stopTranslation();
// });

Pick your source and target languages.

HTML

1
2
3
4
5
6
7
8

<div class="app">
<div class="transcription" />
<div class="controls">
<button id="start">Start</button>
<button id="start">Stop</button>
</div>
<div class="translations" />
</div>

JavaScript

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21

import { PalabraClient, getMicAudioTrack }
from "palabra";

const client = new PalabraClient({
      clientId: "<API_CLIENT_ID>",
      clientSecret: "<API_CLIENT_SECRET>",
      originalTrack: getMicAudioTrack(),
      translateFrom: "en",
      translateTo: "fr"
});

// document.getElementById('start')
// .addEventListener('click', () => {
// client.startTranslation();
// client.playTranslationTrack();
// });

// document.getElementById('stop')
// .addEventListener('click', () => {
// client.stopTranslation();
// });

Wire up your UI (e.g., button click handlers).

HTML

1
2
3
4
5
6
7
8

<div class="app">
<div class="transcription" />
<div class="controls">
<button id="start">Start</button>
<button id="start">Stop</button>
</div>
<div class="translations" />
</div>

JavaScript

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21

import { PalabraClient, getMicAudioTrack }
from "palabra";

const client = new PalabraClient({
      clientId: "<API_CLIENT_ID>",
      clientSecret: "<API_CLIENT_SECRET>",
      originalTrack: getMicAudioTrack(),
      translateFrom: "en",
      translateTo: "fr"
});

document.getElementById('start')
      .addEventListener('click', () => {
   client.startTranslation();
client.playTranslationTrack();
  });

‍document.getElementById('stop')
.addEventListener('click', () => {
client.stopTranslation();
    });

Speak and hear translations in real time.

If you have any questions, please contact us
at [email protected] or book a demo call.

Get Early Access Now

Read Docs

Real-time translation pricing, built around how you use it

Pay Monthly

Pay yearly

save up to 62%

Pay as you go

Text-to-Speech

Natural-sounding speech from text in real time — for voice agents and apps.

TTS

$0.03

/ 1,000 CHARACTERS

Start building

$50 free credits on sign-up

CORE FEATURES

FASTEST TTS IN THE WORLD — TTFA 35 MS

STREAMING INPUT & OUTPUT

ZERO-SHOT VOICE CLONING WITH DEACCENTING

Speech-to-Text

Realtime streaming transcription for voice apps, agents, and pipelines.

STT / ASR

$0.002

/ MIN OF AUDIO

Start building

$50 free credits on sign-up

CORE FEATURES

REALTIME STREAMING (WEBSOCKET)

PRECISE TIMESTAMPS & TURN DETECTION

AUTOMATIC LANGUAGE DETECTION

Speech-to-Speech

Ultra-precise, context-aware voice-to-voice translation in real time.

S2S

$0.04

/ MIN

Start building

$50 free credits on sign-up

CORE FEATURES

PREDICTIVE ALGORITHM — LOWEST LATENCY

60+ LANGUAGES

ZERO-SHOT VOICE CLONING WITH DEACCENTING

Up to 10 concurrent sessions per account.

Zero data retention — audio and text are never stored or used for training

Starter

For occasional multilingual calls, presentations, and B2B webinars.

$60

$45

/MO

Capacity

3 hours

Capacity

$20 / hour

$15 / hour

Start free trial

CORE FEATURES

60+ languages

Conversation mode (two-
way translation)

Presentation mode (one-to-many translation)

Custom glossaries

Voice cloning & Pre-recorded voices

Noise suppression & Music isolation

Live captions & transcripts

RECOMMENDED FOR YOU

Pro

For regular multilingual meetings, webinars, and presentations.

$200

$150

$113

/MO

Capacity

10 hours

Capacity

$15 / hour

$11.5 / hour

Start free trial

MORE HOURS AND BETTER RATE THAN STARTER

60+ languages

Conversation mode (two-
way translation)

Presentation mode (one-to-many translation)

Custom glossaries

Voice cloning & Pre-recorded voices

Noise suppression & Music isolation

Live captions & transcripts

Team

For high-volume translation with the lowest self-serve rate.

$1000

$500

$375

/MO

Capacity

50 hours

Capacity

$10 / hour

$7.5 / hour

Start free trial

Talk to sales

EVERYTHING IN PRO, PLUS:

Multi-seat workspace with roles & permissions

SSO & audit logs

Dedicated account manager

Setup help

Business

For tailored capacity, control, and enterprise-grade support.

Custom

Tailored volume pricing

Talk to sales

EVERYTHING IN TEAM, PLUS:

Custom features and integration development

SLA and priority support

Security and procurement support

Enterprise onboarding

Regional server deploy

AUDIENCE

Unlimited listeners

QR-code access from attendees' own phones

Live translated captions

60+ languages, all running at once

STAGE & VOICES

Single- and multi-stage events

Cloned speaker voices (+ accent control)

Custom glossaries & context-aware delivery

Noise suppression & music isolation

RTMP/SRT and HLS integration

ROLLOUT & SUPPORT

Hands-on setup & onboarding

Dedicated account manager

Multi-seat workspace, roles & permissions

SLA & priority support

Regional servers & custom integrations

AUDIENCE

Unlimited listeners

Live audio & captions translation

60+ output languages

QR code audience access

STREAM & VOICES

RTMP/SRT and HLS integration

Cloned speaker voices (+ accent control)

Speaker diarization

Custom glossaries & context-aware delivery

Noise suppression & music isolation

ROLLOUT & SUPPORT

Multi-seat workspace, roles & permissions

Dedicated account manager

SLA & priority support

Regional server deployment

Custom features & integrations

Answers You Might Need

What industries can benefit most from a real-time voice translation API?

Industries that benefit most include customer service, enterprise software and collaboration, media and entertainment, and consumer apps.

How do I integrate the API into my existing application?

You can integrate Palabra API into your application by using our SDKs or connecting directly via WebRTC (for browsers) or WebSockets (for servers).

Which programming languages and SDKs are supported?

Palabra provides SDKs and client libraries for Python and JavaScript. For other languages, Palabra integrates through WebRTC (frontend) and WebSockets (backend).

Is HTML & XML handling supported for translation?

As a real-time speech-to-speech translation solution, Palabra supports audio input only.

Can I customize translations with a glossary?

Yes. Glossaries let you define how Palabra translates specific terms. Once enabled, your glossary applies across all Palabra applications and sessions.

How is data secured during and after translation?

All conversations are encrypted in transit and processed entirely in memory. Palabra does not store voice data on its servers ー once audio is translated, it is deleted.

Does the API store or log voice data?

No. By default, Palabra does not store or log user data, nor is user data used to train models.

Can the API be deployed in a private cloud or on-premises environment?

Yes. The Palabra API can run in a private cloud or on-premises, fully under your own security and compliance controls.

What audio does Palabra.ai support?

Palabra.ai WebSocket integration supports these input audio formats: Opus, PCM_S16LE, and WAV. For output, it supports PCM_S16LE and ZLIB_PCM_S16LE.

What is the maximum audio length supported per request?

Palabra processes real-time audio streams, which by default can run indefinitely.

How does the API maintain accuracy in noisy environments?

Palabra includes integrated noise suppression, so speech remains accurate even in noisy conditions. No additional preprocessing is required.

Built for production. Certified for compliance.

Language auto-detection

Tone and context conveyed

Voice cloning out of the box

Ultra-low latency

Custom glossaries

Feedback on our real-timetranslation services

Designed for what you build

What Teams Build with Our Speech Translation API

How Our Speech to Speech Translation API Works in 4 Steps

Real-time translation pricing, built around how you use it

Answers You Might Need

Feedback on our real-time
translation services