APIM OpenAI loadbalancer - requests go to only two different locations

70708851 20 Reputation points
2025-04-14T14:03:28.8233333+00:00

Hello Community,

my question is regarding to Azure APIM, which I use as a Azure OpenAI loadbalancer.

My aoai backends are from regions -

Sweden Central

Germany West Central

Poland Central

Italy North

All of them have the identical model deployments Data Zone Standard.

User's image

(pic: The request was sent to GWC region but was redirected to the West Europe cluster)

When I run the APIM test "POST Creates a completion for a text message", the requests always end up in just two regions – West Europe and Sweden Central.

Which brings me to my question: why is that? Why the requests never go to Germany West Central, Poland, or Italy? Could it be due to low availability in those regions?

Thank you.

Azure API Management
Azure API Management
An Azure service that provides a hybrid, multi-cloud management platform for APIs.
2,371 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.