Share via

Batch jobs are stuck on "validating" and then all fail afterwards

Kiptoo Towett 0 Reputation points Microsoft Employee
2026-03-13T12:27:54.6866667+00:00

User's image

Batch jobs are stuck on "validating" and then all fail afterwards

Azure OpenAI Service
Azure OpenAI Service

An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.

{count} votes

1 answer

Sort by: Most helpful
  1. Manas Mohanty 15,795 Reputation points Microsoft External Staff Moderator
    2026-03-26T20:10:57.3833333+00:00

    Hi Kiptoo Towett

    Thank you for confirming that issue is mitigated from backend now.

    Re-attached above as work around in case of temporary regional outages.Reduce full batch to mini batches

    1. Test in other regions (East US and Sweden Central are in demand regions)
    2. Leverage multiple batch deployment to reduce latency and throughput.
    3. Rest API commands were shared to cancel existing batch job if it is stalled for long. https://learn.microsoft.com/en-us/rest/api/azureopenai/batch/cancel?view=rest-azureopenai-2024-10-21&tabs=HTTP

    We can setup monitor the events from log analytics workspace or set up a monitor in SDK.

    if the processing time is more than usual processing time, it should execute above steps as precautionary measure.

    Please take a minute to accept this answer if you found my recommendation helpful.

    Thank you.

    0 comments No comments

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.