Azure AI Speech

0 answers

Azure Real-Time diarization

Hi! I am working on a project in Python, in which I use Azure AI Speech Service. More specifically, I implemented real-time dairization using the azure.cognitiveservices.speech.transcription.ConversationTranscriber class. And now I am working on speaker…

asked

Karyna Khinevich 0

0 answers

Can we get a confidence score for the AutoDetect the source language using AutoDetectSourceLanguageConfig

I am working on Speech to Text services in Azure. We're using autodetect feature with both specified set and open range using below class. Is there a way that I can get the confidence score for the source language detected. So that only if the score…

asked

Midhilesh Momidi 0

edited a comment

Manas Mohanty 6,285 Microsoft External Staff Moderator

1 answer

Cannot run SPX under dotnet 8 for mac arm64 version

I have followed instructions to install dotnet 8 and the Azure speech CLI (from: https://learn.microsoft.com/en-us/azure/ai-services/speech-service/spx-basics?tabs=windowsinstall%2Cterminal) and can verify my dotnet installation, and I have updated the…

asked

Noah Scott 0

answered

Amira Bedhiafi 34,491 Volunteer Moderator

2 answers

unable to estimate avatar usage

We are not able to correctly understand the usage of a AVATAR Resources on our Azure Resources.

asked

Alessandro Brizzolesi 40

commented

Manas Mohanty 6,285 Microsoft External Staff Moderator

2 answers

Azure Speech SDK - Formal list of Languages/Locales Supported for Semantic Speech segmentation

Per https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-recognize-speech?pivots=programming-language-csharp Semantic segmentation isn't available for all languages and locales. Can Microsoft provide a list of languages/locales for…

asked

VS 0

answered

Prashanth Veeragoni 5,485 Microsoft External Staff Moderator

1 answer

Custom Speech Dataset

So I'm experimenting with a project and I have compiled my dataset, the first step is the Azure Speech Services, TTS> Custom Speech, I wanted to upload my dataset to my project, it contains both Audio and Transcript, I was the one who created the…

asked

Sharmaine Erika Delgado 25

commented

Sharmaine Erika Delgado 25

1 answer

Azure AI Speech Recognition Batch Transcription Services are Down

Hello Team, it has been brought to our attention by our users that Azure AI Speech Recognition Batch Transcription Services are down from past 8 hours (beginning 1:30 AM UTC). Can you check and ensure service normalcy at the earliest?

asked

Panini Devs 0

answered

Amira Bedhiafi 34,491 Volunteer Moderator

2 answers

Latency in Azure Speech service

Latency In azure speech service , due to which our speech to text job is failing

asked

Apoorv Kumar 0

commented

Ravada Shivaprasad 550 Microsoft External Staff Moderator

1 answer

Intermittent authentication errors using batch transcription

Hello, I am facing intermittent container authentication issues when transcribing an audio with the batch transcription API. The Speech resource has Storage Blob Data Reader permission on the container where the audio files are stored. Yesterday, I tried…

asked

Yan Gonçalves 0

commented

Pavankumar Purilla 8,660 Microsoft External Staff Moderator

1 answer

30 secs timeout on Azure speech to text

Hello, I'm experiencing an issue with Azure Speech-to-Text where, in continuous recognition mode, it outputs a RECOGNIZED result every 30 seconds, regardless of whether speech has stopped. Adjusting settings like Speech_SegmentationSilenceTimeoutMs has…

asked

Nandhu TS 0

commented

Ravada Shivaprasad 550 Microsoft External Staff Moderator

1 answer

Azure Speech SDK JavaScript - Silence timeout properties not working for continuous recognition

I'm using the Azure Speech SDK for JavaScript (microsoft-cognitiveservices-speech-sdk) to implement continuous speech recognition, but I'm unable to increase the silence timeout duration. The recognition still stops after the default silence period (~2-3…

asked

MI Sajid 0

answered

Amira Bedhiafi 34,491 Volunteer Moderator

1 answer

Tranlsation application - the synthesized audio text may not perfectly match the original video timing, TTS speed mismatch

Hi Team, I’m developing a language translation application that generates translated video files using Azure Text-to-Speech and .NET (C#). The workflow involves generating audio from translated text and combining it with video visuals. However, I’ve…

asked

Aravind R 0

commented

Aravind R 0

1 answer

Does Azure Pronunciation Assessment handle Hong Kong, Japanese, and other East Asian English accents accurately?

We’re building a language learning app for English speakers in Hong Kong, Japan, and other East Asian countries. We plan to integrate Azure Speech Service — Pronunciation Assessment using PHP (Laravel). My main question is: How well does it handle…

asked

Darsh Al 20

accepted

Darsh Al 20

0 answers

[Setting up STT Resource]: Configure your account

I am new to Azure and have been trying to create resource for STT. However, I am stuck in the subscription process, where it seems I must "configure my acocunt". The subscription wouldn't be marked as complete until I do so, and to do so, there…

asked

nimesh.s 0

commented

Pavankumar Purilla 8,660 Microsoft External Staff Moderator

1 answer

Why may the Basic Custom Keyword model be taking over 9 hours to complete for 1 word with 1 prefix?

Hey folks, I have created 1 custom keyword with the word "Hey" as a prefix. There are 2 distinct pronunciations chosen for the word and the prefix and the Model Type is "Basic". In the display it mentions that the model may take…

asked

Shreyas Chitransh 20

accepted

Shreyas Chitransh 20

2 answers

Cloud Speech To Text stopped working on old Speech SDK version

This issue is really urgent. Cloud Speech To Text stopped working on old Speech SDK version in devices that are currently in customers. The devices are running Android applications that use Microsoft Speech SDK. We've confirmed that upgrading the…

asked

Tomasz Stomski 20

answered

Amira Bedhiafi 34,491 Volunteer Moderator

1 answer

Problem creating SpeechRecognizer with audio stream input using node.js Speech SDK

Using Speech SDK for JavaScript v1.44.0, and following the STT in-memory streaming example, but using the fromEndpoint API to create Recognizer, as recommended in the Release Notes for that SDK version. Node.js is v22 LTS, running in Azure Cloud as an…

asked

Michael Pickering 0

commented

Michael Pickering 0

1 answer

Where is data stored when using the fast transcription API?

Dear Microsoft, for a project, we are using the fast transcription API from Azure as a component to transcribe text from audio. We are working with sensitive data and would like to be sure to have control or at least insight into where our data is stored…

asked

Lenting, Coen 20

edited a comment

Lenting, Coen 20

1 answer

I am using Fast Transcription API to get transcripts for my media file, I am getting "Too Many Request" error in first try itself for free tier

I am using a Free resource of Fast Transcription API for testing purpose under Azure AI speech service. I am getting "Too Many Request" error on trying for a first time it self. And this has happen many times to different users when they…

asked

Nikita Khandare 60

accepted

Nikita Khandare 60

2 answers

Tried to create a resource for a class instruction but it says it is disallowed by policy

I am taking a course (about Speech to Text) on my own and i need help: the class says to create a resource in a sandbox but when I do it says Resource 'learn-account-14303' was disallowed by policy. I have no idea how to resolve this. Can someone help…

asked

Linda Manganaro 20

commented

Linda Manganaro 20

Filter

Content

2,075 questions with Azure AI Speech tags

Azure Real-Time diarization

Can we get a confidence score for the AutoDetect the source language using AutoDetectSourceLanguageConfig

Cannot run SPX under dotnet 8 for mac arm64 version

unable to estimate avatar usage

Azure Speech SDK - Formal list of Languages/Locales Supported for Semantic Speech segmentation

Custom Speech Dataset

Azure AI Speech Recognition Batch Transcription Services are Down

Latency in Azure Speech service

Intermittent authentication errors using batch transcription

30 secs timeout on Azure speech to text

Azure Speech SDK JavaScript - Silence timeout properties not working for continuous recognition

Tranlsation application - the synthesized audio text may not perfectly match the original video timing, TTS speed mismatch

Does Azure Pronunciation Assessment handle Hong Kong, Japanese, and other East Asian English accents accurately?

[Setting up STT Resource]: Configure your account

Why may the Basic Custom Keyword model be taking over 9 hours to complete for 1 word with 1 prefix?

Cloud Speech To Text stopped working on old Speech SDK version

Problem creating SpeechRecognizer with audio stream input using node.js Speech SDK

Where is data stored when using the fast transcription API?

I am using Fast Transcription API to get transcripts for my media file, I am getting "Too Many Request" error in first try itself for free tier

Tried to create a resource for a class instruction but it says it is disallowed by policy