Share via

We are getting OpenAI Internal Server errors with no useful details

Alex Rosen 10 Reputation points
2026-01-06T16:37:01.8666667+00:00

Our application shows 13 errors like the below within 110 minutes (9:20 AM EST to 11:10 AM EST). Is this a transient issue or something more significant?

Not all requests are failing. The failures do not seem to be only with a single prompt. The prompts and deployment did not change within 8 hours before the errors began.

openai.InternalServerError: Error code: 500 - {'error': {'message': 'The server had an error processing your request. Sorry about that! You can retry your request, or contact us through an Azure support request at: https://go.microsoft.com/fwlink/?linkid=2213926 if you keep seeing this error. (Please include the request ID b2*** in your email.)', 'type': 'server_error', 'param': None, 'code': None}}

I can share more request IDs if that will help.

It has now been 25 minutes since the last error. That is the longest period of no errors since this began. Please investigate to confirm this issue was transient.

Azure OpenAI Service
Azure OpenAI Service

An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.


3 answers

Sort by: Most helpful
  1. Alex Rosen 10 Reputation points
    2026-01-06T17:06:36.46+00:00

    Azure should investigate immediately to determine how widespread the problem is. The status page does not show any issues for Azure OpenAI Service in East US 2, but it appears that there is one.

    This intermittent error continues. It's now been happening for over 100 minutes.

    1 person found this answer helpful.
    0 comments No comments

  2. Alex Rosen 10 Reputation points
    2026-01-06T19:53:27.8066667+00:00

    I increased retries from the client's default (3) to 5 and added timeouts. The Azure OpenAI Service availability returned to 100 right around the time this was pushed to production. It is unclear why we have not had these issues until today and whether the increased retries and timeouts should be necessary. They may stop the Azure OpenAI Service from having Internal Server Errors, but it seems that they also increase the latency.Screenshot 2026-01-06 144627

    AzureOpenAI(
            azure_endpoint=azure_endpoint,
            api_key=azure_key,
            api_version="2025-04-01-preview",
            max_retries=5,
            timeout=httpx.Timeout(
                600.0,      # Total timeout: 10 minutes
                connect=10.0,  # Connection timeout: 10 seconds
                read=300.0,    # Read timeout: 5 minutes
                write=30.0     # Write timeout: 30 seconds
            )
        )
    

  3. Anshika Varshney 9,740 Reputation points Microsoft External Staff Moderator
    2026-01-06T19:23:32.6066667+00:00

    Hi Alex Rosen,

    Thanks for reporting this. The OpenAI “Internal Server Error” (500) usually indicates a temporary platform-side issue or an interruption while the model is processing the request. It is not caused by your configuration.

    Here’s what you can try:

    1. Retry the request after a short time These errors often resolve automatically when the backend stabilizes.

    2. Reduce the request size If your request includes very large prompts, long conversations, or large attached documents, try sending a smaller input. Large payloads can sometimes cause transient 500 errors.

    3. Check for known service incidents If the issue continues, verify whether there is an active outage or degradation in your region.

    This behavior typically indicates a temporary internal service condition, and based on similar reports, it should stabilize soon.

    Let me know if you still see errors after retrying. I’ll be happy to help further.

    Thankyou!


Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.