An Azure service that integrates speech processing into apps and services.
Loading subscriptions
I have set up an account and I am trying to use the Azure speech studio. But when I go there, it just gets stuck on 'Loading subscriptions'. Is it to do with multiple account conflicts or something?
Azure AI Speech
Azure Open AI Realtime GA Model not working
I have developed a conversational AI agent using Azure OpenAI Realtime Preview model which is getting depracted in April. All the audio functions were working properly. When I moved this to Realtime GA model when ever I start speaking its terminating.…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Azure Fast Transcription – Inability to enforce British spelling (en‑GB) without post‑processing
Hello, I am currently using Azure Speech – Fast Transcription with the locale explicitly set to en-GB, on a Speech resource deployed in UK South. In most cases, the recognition quality is good. However, I consistently observe that some words are…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
CRITICAL ISSUE Azure AI Speech SDK – Numbers getting Added , Deleted and Substituted and sometimes Exceeds too much time while using the microsoft realtime speech to text conginitve services API
We are using Azure Speech Service with the browser Speech SDK for real-time speech-to-text transcription. We are observing an issue when users speak continuous digits. The recognizer sometimes returns a significantly different number of digits than were…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Question about temas premium ai features- regarding data privacy
Im currently Using Teams on my university account. im interested in using the teams premium ai features in meetings, but need to know where the Ai is processing the data. Is the ai running locally on my teams app/ my computer or in the cloud? also what…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
where can I find API section in Speech Studio?
I am developing an app for Arabic language self learning want to integrate speech capabilities into my website but I am not finding the API section in Speech Studio
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Custom Avatar Quality not Upto the Mark
I recently create my own custom avatar using Azure Speech Studio. Even with the training video recordings being done in a studio with a professional camera and a green screen background, when an Avatar video (batch) is generated, the hands often become…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Speech Studio "Text to Speech" not respecting <break> markup
The text to speech renderer fails to apply the "break" markup in the Audio Content Creation interface of the Speech Studio. I haven't tried other markup. Yesterday, it didn't work with RyanMultinationalNeural, but worked with AndrewNeural. Now…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Cohere Rerank v4.0 Fast returns 500 error in Azure AI Foundry when using DefaultAzureCredential
Hi, I am experiencing persistent 500 Internal Server Error issues when using Cohere Rerank v4.0 Fast in Azure AI Foundry. I have carefully followed the guidance provided in similar threads: Verified the exact deployment name:…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Speech support for Pashto language
Do I understand the documentation correctly that translation services for Pashto language are text only? Is there any support for speech-to-text or speech synthesis for Pashto? What about Dari?
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
How much compute hours is required to fine tune approximately 5-10 hours of audio data on custom speech to text fine tune
I'd like to know if there is any relevancy or blogs available to get to know the hours of audio that'd be processed based on custom speech training hours. For e.g. 10 hours of audio equivalent to 2-3 hours of compute hours. anything available to get…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
How to adjust Azure speech to interpret audio effectively
Hello I am using Azure speech to build my English speech clarity tool but what I am finding is that no matter what I do with interpretating or putting guardrails, I cannot build it when Azure presumes or guesses what sound or word it hears thus inflating…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Azure AI Foundry agents intermittently failing with JSON parsing error (empty response, no schema changes)
We are experiencing intermittent but increasingly frequent failures when running agents on Azure AI Foundry. Agents that were working correctly in the same environment, with no code or schema changes, suddenly started failing with the following…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
"Can I use the Azure Speech-to-Text fast transcription REST API for short audio to perform pronunciation assessment? How do I use it?
My problem is that the mode right now is too expensive for my work, which is $1.3 per hour. I want to try to use fast transcription mode to to perform pronunciation assessment,which may finally cost $0.66 per hour. Can I? Here is the example code from…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Issue Creating Azure AI Language Resource in Custom Question Answering Lab
Hello, I am currently working on the Microsoft Applied Skills lab for Custom Question Answering. When attempting to create the Azure AI Language resource, the deployment fails with the following error: RequestDisallowedByPolicy – The resource was…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Add Persian/Farsi as a language option for Language Identification (LID) in Speech Service
Wondering if it is possible to use or add Persian/Farsi as a language option for Language Identification (LID) in Speech Service? It is an option in almost all other Speech Service capabilities - is adding it on the roadmap for Azure AI?
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Speech Studio Text to Speech - Silence Not Working
I'm trying to add silence in my text to speech files (see image below), but the silence tags will not actually generate the specified silence where I input them when I preview the audio. The silence doesn't show up in the exported output either. Am I…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Deploy Azure AI Speech with CognitiveResource or Microsoft Foundry
Someone any information on if we should deploy AI Speech via microsoft.cognitiveservices/accounts or new via MicrosoftFoundry? Will the microsoft.cognitiveservices/accounts be deprecated? The old speech studio at least doesn't seem to support…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Special character ampersand (“&”) breaks word boundaries in Azure Text-to-Speech
Hello, I’m encountering an issue with word boundary events in Azure Text-to-Speech when the input text contains the ampersand character (&). Context Locale: fr-FR Neural French voice (e.g. fr-FR-Remy:DragonHDLatestNeural) Batch synthesis API …
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Pronunciation Assessment with Language en-GB- Phoneme symbols
I am using the pronunciation assessment API for language en-GB Doing the assessment at phoneme level The documentation does mention this: AccuracyScore: Phoneme level, Syllable level (en-US only), Word level, Full Text level I get a response with empty…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.