2,298 questions with Azure AI Speech tags

Sort by: Updated
0 answers

Issue Creating Azure AI Language Resource in Custom Question Answering Lab

Hello, I am currently working on the Microsoft Applied Skills lab for Custom Question Answering. When attempting to create the Azure AI Language resource, the deployment fails with the following error: RequestDisallowedByPolicy – The resource was…

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2026-03-08T05:34:50.2333333+00:00
Pornpra Chumnanvanichkul 0 Reputation points
edited the question 2026-03-08T05:46:08.1366667+00:00
Pornpra Chumnanvanichkul 0 Reputation points
0 answers

CRITICAL ISSUE Azure AI Speech SDK – Numbers getting Added , Deleted and Substituted and sometimes Exceeds too much time while using the microsoft realtime speech to text conginitve services API

We are using Azure Speech Service with the browser Speech SDK for real-time speech-to-text transcription. We are observing an issue when users speak continuous digits. The recognizer sometimes returns a significantly different number of digits than were…

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2026-03-06T14:12:10.9733333+00:00
Aravind ks 20 Reputation points
edited the question 2026-03-06T14:14:26.67+00:00
Aravind ks 20 Reputation points
2 answers

Pronunciation Assessment with Language en-GB- Phoneme symbols

I am using the pronunciation assessment API for language en-GB Doing the assessment at phoneme level The documentation does mention this: AccuracyScore: Phoneme level, Syllable level (en-US only), Word level, Full Text level I get a response with empty…

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2026-02-18T17:07:02.75+00:00
Anju Aggarwal 0 Reputation points
commented 2026-03-06T11:30:41.6133333+00:00
SRILAKSHMI C 14,815 Reputation points Microsoft External Staff Moderator
2 answers

Please clarify the conflicting information regarding permission to use the free tier of Azure Speech for commercial purposes, such as narration of a YouTube video.

Hello everyone, I had previously asked a question on this forum regarding whether the Free Tier F0 of Azure Speech can be used for commercial purposes such as narration of a YouTube video:…

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2026-03-03T07:28:30.2+00:00
KRJ14 0 Reputation points
commented 2026-03-05T01:09:38.88+00:00
Manas Mohanty 14,750 Reputation points Microsoft External Staff Moderator
1 answer

Can the audio generated by Azure Speech Studio's free tier (monthly limit of 500,000 characters) be used for commercial purposes like for example: narration of a youtube video?

Hello! I've searched this Q&A site extensively but found conflicting answers and hence, I thought I should I ask directly. I've Azure Speech Studio's free tier (monthly limit of 500,000 characters) and can I use the audio generated by that for…

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2026-03-01T05:39:56.72+00:00
KRJ14 0 Reputation points
commented 2026-03-03T17:40:30.41+00:00
Marcin Policht 81,705 Reputation points MVP Volunteer Moderator
2 answers

Custom Neural Voice (CNV Pro) model in East US and East US 2 is failing to train the model

Custom Neural Voice (CNV Pro) model in East US and East US 2, and the training consistently fails after several hours with an internal/unknown error. The dataset uploads successfully and passes validation, but the training job never completes. It fails…

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2026-03-01T17:26:01.8533333+00:00
Ramachandran, Iaiswarya I 20 Reputation points Microsoft Employee
commented 2026-03-03T14:29:12.7+00:00
Ramachandran, Iaiswarya I 20 Reputation points Microsoft Employee
1 answer

Special character ampersand (“&”) breaks word boundaries in Azure Text-to-Speech

Hello, I’m encountering an issue with word boundary events in Azure Text-to-Speech when the input text contains the ampersand character (&). Context Locale: fr-FR Neural French voice (e.g. fr-FR-Remy:DragonHDLatestNeural) Batch synthesis API …

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2026-02-26T10:51:25.2866667+00:00
Soulaïman Marsou 0 Reputation points
answered 2026-03-03T00:44:38.1+00:00
SRILAKSHMI C 14,815 Reputation points Microsoft External Staff Moderator
2 answers

Azure AI Foundry agents intermittently failing with JSON parsing error (empty response, no schema changes)

We are experiencing intermittent but increasingly frequent failures when running agents on Azure AI Foundry. Agents that were working correctly in the same environment, with no code or schema changes, suddenly started failing with the following…

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2026-02-26T13:01:19.7233333+00:00
Maximiliano Gutierrez 0 Reputation points
edited the question 2026-03-02T18:17:38.46+00:00
Jilakara Hemalatha 10,205 Reputation points Microsoft External Staff Moderator
2 answers

Python code to generate ephemeral token for gpt-4o-mini-transcribe OR gpt-4o-transcribe

Hi Team, We're unable to find ways/python code to generate ephemeral token for gpt-4o-mini-transcribe OR gpt-4o-transcribe. Searched online & there are some references for generating such tokens for realtime API. But none for…

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2026-02-25T12:42:59.1733333+00:00
GenixPRO 171 Reputation points
answered 2026-03-02T16:47:22.3466667+00:00
Anshika Varshney 7,970 Reputation points Microsoft External Staff Moderator
2 answers

can some one help, how to config voicelive sdk to recieve animation blendshapes and viseme_id

it try to add this but no animation data recieve. modalities: ["text", "audio", 'animation'], outputAudioTimestampYypes: ["word"], animation: { modelName: "default", outputs:…

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2026-02-25T17:09:02.8933333+00:00
Dadong Hu 0 Reputation points
commented 2026-03-02T12:30:00.6933333+00:00
Anshika Varshney 7,970 Reputation points Microsoft External Staff Moderator
1 answer

Has MS abandoned human tech support?

Reading some of the nightmare scenarios on these forums and realizing that human tech support is a thing of the past really alarms me. It's obvious that since companies such as MS are pouring so much into AI, they've abandoned tech support from humans.…

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2026-03-01T21:28:29.3033333+00:00
Ed Myers 0 Reputation points
answered 2026-03-02T00:29:40.7033333+00:00
Jerald Felix 10,970 Reputation points
1 answer

Issues with Azure Speech Services: Incorrect transcription of "draft" as "draught" and "£" as "lbs" in UK English

I'm using Azure Speech Services with the language set to UK English, and I've noticed two recurring transcription issues: When I dictate the word "draft", it consistently transcribes as "draught", even when the context clearly favors…

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2025-06-12T11:36:24.3566667+00:00
Niki Kariappa 0 Reputation points
answered 2026-03-01T23:36:12.31+00:00
Mike Williams 0 Reputation points
1 answer

Azure: Deactivated Severity: 2 alert-0225185834

2 emails Azure: Deactivated Severity: 2 alert-0225185834

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2026-02-25T21:10:39.2966667+00:00
Danny FitzGerald 0 Reputation points
answered 2026-02-25T21:10:48.5166667+00:00
Q&A Assist
1 answer

High Initial Latency with Multi-Language Detection (3+ Languages)

Hello Azure Speech Team, We're experiencing significant initial latency when using Continuous Language Identification with 2+ languages in production. Configuration: Languages: 3 languages (en-IN, te-IN, hi-IN) Mode:…

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2026-02-14T10:23:11.7166667+00:00
ello ai 5 Reputation points
commented 2026-02-25T07:32:16.5566667+00:00
SRILAKSHMI C 14,815 Reputation points Microsoft External Staff Moderator
2 answers

gpt-4o-transcribe for real-time speech-to-text transcription ---slow speed

When I try to use gpt-4o-transcribe for real-time speech-to-text transcription, it takes about 1.5-2 seconds for a 2s mp3 file from sending the request to receiving the first token. Are there improved methods or other model options? Additionally,…

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2026-02-24T03:40:45.7033333+00:00
yu.lili 0 Reputation points
answered 2026-02-25T06:25:07.45+00:00
Karnam Venkata Rajeswari 280 Reputation points Microsoft External Staff Moderator
1 answer

Custom Avatar Model Training Showing as Processing After 16 Hours

I created a Azure AI Service Resource in West US 2(Test Avatar) and then went to Speech Studio, uploaded all the required training Data and then started the model training. But its showing 1hr left estimated for last 8 Hours.

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2026-02-25T02:16:06.2066667+00:00
Trinanjan Majumder 0 Reputation points Microsoft Employee
answered 2026-02-25T03:38:09.5266667+00:00
SRILAKSHMI C 14,815 Reputation points Microsoft External Staff Moderator
3 answers

Transcription using gpt-4o-transcribe with gpt-realtime is failing in useast2

Hello, I am trying to use gpt-4o-transcribe with gpt-realtime in useast2, and it is consistently failing. I am using gpt-realtime with websockets as per the documentation. I am seeing the following event:…

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2026-02-12T08:11:59.3233333+00:00
PRABU WEERASINGHE 0 Reputation points
commented 2026-02-24T12:47:18.32+00:00
SRILAKSHMI C 14,815 Reputation points Microsoft External Staff Moderator
2 answers One of the answers was accepted by the question author.

Pricing for Azure Voice Live API

We are evaluating Azure Voice live API for our Contact Center use case, automating with AI. However, we could not find the latest pricing of Azure Voice live API - we want to use Pro version - use Azure speech, GPT 5.2 Chat (or suitable chat models).…

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2026-02-17T12:53:48.97+00:00
Sankar Ramakrishnan, Prathap 20 Reputation points
accepted 2026-02-24T05:09:15.0766667+00:00
Sankar Ramakrishnan, Prathap 20 Reputation points
2 answers One of the answers was accepted by the question author.

Function Calling via Foundry Agent in Voice Live API

Below are the quickstarts for foundry agent with Voice Live API, function calling with Voice Live API and foundry agent with function calling, respectively: …

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2026-02-03T11:49:44.1133333+00:00
Cem Işık Doğru 40 Reputation points
commented 2026-02-17T07:41:33.95+00:00
Manoj Kumar Ragupathi 0 Reputation points
1 answer

Different English accent is not working

I’m running into something odd with the voice accents in my setup. Whenever I switch to different English accents—like English (US), English (India), English (Australia)—the voice doesn’t actually change. It just keeps sounding like the default English…

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

2,298 questions
asked 2026-01-15T11:08:41.6133333+00:00
Wie Dizon 0 Reputation points
commented 2026-02-13T05:31:55.9466667+00:00
SRILAKSHMI C 14,815 Reputation points Microsoft External Staff Moderator