Are Closed Captions and Real-Time Transcription Billed Separately or Together?

Daniel Li 20 Reputation points
2025-12-24T21:57:41.2966667+00:00

I'm asking because I've noticed that these two features seem very similar in both functionality and outputs.

If Closed Captions and real-time transcription are enabled simultaneously on the same Azure Communication Service meeting, will Azure charge twice, or is the speech recognition billed only once?

Do Closed Captions and real-time transcription share the same Speech-to-Text session, or are they metered separately?

Why is there a cost difference between the two features?

I noticed that we only receive the final outputs for real-time transcription, while Closed Captions provide partial outputs and final outputs. Is it possible to also get partial outputs for real-time transcription?

ref: https://learn.microsoft.com/en-us/azure/communication-services/how-tos/call-automation/real-time-transcription-tutorial?pivots=programming-language-csharp

https://learn.microsoft.com/en-us/azure/communication-services/concepts/voice-video-calling/closed-captions

Service Price
Closed Captions $0.021/min ($1.26/hour)
Standard Transcription Real-time Transcription: $1 per hour
Azure Communication Services
{count} votes

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.