Why have they changed Vivienne Multilingual's voice?

Question

Why have they changed Vivienne Multilingual's voice?

Albert MG 0

Since March 17th, Vivienne's multilingual voice has changed; it's no longer the same. It sounds deeper, and it's not the same in Spanish (Spain) or English (US). The voice the model has isn't the multilingual one; it's the default one, and it sounds terrible. It's as if something has been misconfigured. This isn't just a personal issue; I've tried it in Azure, and even the same voice that appears in Clipchamp, and it's definitely not the same.

0 comments

1 answer

Your answer

Answer 1

VivienneMultilingualNeural is part of the Azure Speech multilingual and HD voice family, which is actively being updated and improved. These voices are based on large language models and are explicitly documented as evolving over time, including changes to quality, prosody, and sometimes timbre.

Key points relevant to the change being observed:

Multilingual and HD voices are versioned and updated
- Azure Speech release notes show ongoing updates to multilingual and Dragon HD voices (for example, multiple rounds of “quality improved and bugs fixed,” new HD versions, and Omni variants).
- HD and “Latest” model suffixes (such as DragonHDLatestNeural, DragonHDOmniLatestNeural, and other Latest voices) are specifically called out as models that can be updated over time for better quality and features. This can result in audible changes in pitch, tone, and expressiveness while keeping the same voice name.
Multilingual voices share a unified model across many locales
- Multilingual voices (including VivienneMultilingualNeural) are backed by a single multilingual model that supports many locales from the same base voice.
- When that underlying multilingual model is updated, the change affects all supported languages at once (for example, Spanish and English), which matches the behavior being described.
Voice behavior can differ between “standard” and HD/Omni variants
- Release notes show that some voices now have Dragon HD or Dragon HD Omni variants (for example, …:DragonHDLatestNeural or …:DragonHDOmniLatestNeural) that maintain the same persona but with different acoustic behavior.
- If an integration (such as Clipchamp or an Azure region/endpoint) has been switched to a newer “Latest” or HD/Omni-backed implementation behind the same display name, the audible result can sound deeper or more expressive even though the name has not changed.
“Latest” models are explicitly not fixed
- Documentation for personal/custom voices notes that …LatestNeural models are updated from time to time and that behavior can change with those updates; a fixed version requires using a versioned model name instead of Latest.
- The same principle applies to base multilingual/HD voices: when using a Latest-style or non-versioned multilingual voice, changes like the one being observed are expected over time as the service is improved.

Given the above, the change in VivienneMultilingualNeural’s sound since mid‑March is consistent with a backend model update rather than a misconfiguration on a single tenant. The same updated behavior would appear across Azure Speech and any integrated products (such as Clipchamp) that call the same voice name.

To restore a previous sound, the only supported path would be to:

Use a versioned voice model if/when Microsoft exposes one for that persona, or
Create a custom/personal voice based on a base model whose version is controlled, then keep that version fixed instead of relying on a Latest/evolving model.

For an issue that feels like a regression (for example, “sounds terrible” or clearly worse quality), the recommended next step is to open a support ticket with Azure Support, referencing the specific voice name, region, and approximate date when the change was noticed, so the product group can review the recent deployment for that voice.

References:

Share via

Why have they changed Vivienne Multilingual's voice?

1 answer

Your answer