Azure Real-Time diarization
Hi! I am working on a project in Python, in which I use Azure AI Speech Service. More specifically, I implemented real-time dairization using the azure.cognitiveservices.speech.transcription.ConversationTranscriber class. And now I am working on speaker…
Azure AI Speech
Can we get a confidence score for the AutoDetect the source language using AutoDetectSourceLanguageConfig
I am working on Speech to Text services in Azure. We're using autodetect feature with both specified set and open range using below class. Is there a way that I can get the confidence score for the source language detected. So that only if the score…
Azure AI Speech

Cannot run SPX under dotnet 8 for mac arm64 version
I have followed instructions to install dotnet 8 and the Azure speech CLI (from: https://learn.microsoft.com/en-us/azure/ai-services/speech-service/spx-basics?tabs=windowsinstall%2Cterminal) and can verify my dotnet installation, and I have updated the…
Azure AI Speech

unable to estimate avatar usage
We are not able to correctly understand the usage of a AVATAR Resources on our Azure Resources.
Azure AI Speech

Azure Speech SDK - Formal list of Languages/Locales Supported for Semantic Speech segmentation
Per https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-recognize-speech?pivots=programming-language-csharp Semantic segmentation isn't available for all languages and locales. Can Microsoft provide a list of languages/locales for…
Azure AI Speech

Custom Speech Dataset
So I'm experimenting with a project and I have compiled my dataset, the first step is the Azure Speech Services, TTS> Custom Speech, I wanted to upload my dataset to my project, it contains both Audio and Transcript, I was the one who created the…
Azure AI Speech
Azure AI Speech Recognition Batch Transcription Services are Down
Hello Team, it has been brought to our attention by our users that Azure AI Speech Recognition Batch Transcription Services are down from past 8 hours (beginning 1:30 AM UTC). Can you check and ensure service normalcy at the earliest?
Azure AI Speech

Latency in Azure Speech service
Latency In azure speech service , due to which our speech to text job is failing
Azure AI Speech
Intermittent authentication errors using batch transcription
Hello, I am facing intermittent container authentication issues when transcribing an audio with the batch transcription API. The Speech resource has Storage Blob Data Reader permission on the container where the audio files are stored. Yesterday, I tried…
Azure AI Speech
30 secs timeout on Azure speech to text
Hello, I'm experiencing an issue with Azure Speech-to-Text where, in continuous recognition mode, it outputs a RECOGNIZED result every 30 seconds, regardless of whether speech has stopped. Adjusting settings like Speech_SegmentationSilenceTimeoutMs has…
Azure AI Speech
Azure Speech SDK JavaScript - Silence timeout properties not working for continuous recognition
I'm using the Azure Speech SDK for JavaScript (microsoft-cognitiveservices-speech-sdk) to implement continuous speech recognition, but I'm unable to increase the silence timeout duration. The recognition still stops after the default silence period (~2-3…
Azure AI Speech

Tranlsation application - the synthesized audio text may not perfectly match the original video timing, TTS speed mismatch
Hi Team, I’m developing a language translation application that generates translated video files using Azure Text-to-Speech and .NET (C#). The workflow involves generating audio from translated text and combining it with video visuals. However, I’ve…
Azure AI Speech
Does Azure Pronunciation Assessment handle Hong Kong, Japanese, and other East Asian English accents accurately?
We’re building a language learning app for English speakers in Hong Kong, Japan, and other East Asian countries. We plan to integrate Azure Speech Service — Pronunciation Assessment using PHP (Laravel). My main question is: How well does it handle…
Azure AI Speech
[Setting up STT Resource]: Configure your account
I am new to Azure and have been trying to create resource for STT. However, I am stuck in the subscription process, where it seems I must "configure my acocunt". The subscription wouldn't be marked as complete until I do so, and to do so, there…
Azure AI Speech
Why may the Basic Custom Keyword model be taking over 9 hours to complete for 1 word with 1 prefix?
Hey folks, I have created 1 custom keyword with the word "Hey" as a prefix. There are 2 distinct pronunciations chosen for the word and the prefix and the Model Type is "Basic". In the display it mentions that the model may take…
Azure AI Speech
Cloud Speech To Text stopped working on old Speech SDK version
This issue is really urgent. Cloud Speech To Text stopped working on old Speech SDK version in devices that are currently in customers. The devices are running Android applications that use Microsoft Speech SDK. We've confirmed that upgrading the…
Azure AI Speech

Problem creating SpeechRecognizer with audio stream input using node.js Speech SDK
Using Speech SDK for JavaScript v1.44.0, and following the STT in-memory streaming example, but using the fromEndpoint API to create Recognizer, as recommended in the Release Notes for that SDK version. Node.js is v22 LTS, running in Azure Cloud as an…
Azure AI Speech
Where is data stored when using the fast transcription API?
Dear Microsoft, for a project, we are using the fast transcription API from Azure as a component to transcribe text from audio. We are working with sensitive data and would like to be sure to have control or at least insight into where our data is stored…
Azure AI Speech
I am using Fast Transcription API to get transcripts for my media file, I am getting "Too Many Request" error in first try itself for free tier
I am using a Free resource of Fast Transcription API for testing purpose under Azure AI speech service. I am getting "Too Many Request" error on trying for a first time it self. And this has happen many times to different users when they…
Azure AI Speech
Tried to create a resource for a class instruction but it says it is disallowed by policy
I am taking a course (about Speech to Text) on my own and i need help: the class says to create a resource in a sandbox but when I do it says Resource 'learn-account-14303' was disallowed by policy. I have no idea how to resolve this. Can someone help…