Share via

The services in ACI are often impacted, which will cause a timeout.

Liang Zhang 0 Reputation points Microsoft Employee
2026-06-03T07:36:01.25+00:00

1.There is a sharp fluctuation in the execution time curve at the beginning of every hour.
User's image

2.Sometimes, several server instances suddenly operated more slowly, but they automatically resumed afterward as follows.
User's image

Azure Container Instances

1 answer

Sort by: Most helpful
  1. AI answer

    2026-06-03T07:36:32.21+00:00

    The described behavior matches intermittent performance issues on the compute side rather than a pure network fault.

    In similar Azure containerized workloads, intermittent timeouts and sharp spikes in latency are commonly caused by:

    1. Component or application performance problems inside the container (for example, CPU or memory pressure, garbage collection, or restarts), which manifest as timeouts at the client.
    2. Platform limits such as SNAT port exhaustion when many outbound connections are opened in a short time window.

    From the available guidance:

    • Intermittent timeouts with an otherwise healthy network are typically linked to component performance issues. The recommended approach is to inspect resource usage and health from the “inside out”: check CPU/memory usage and container restarts, and review logs around the time of the spikes to see whether the application is recycling or entering a CrashLoopBackOff‑like pattern.
    • For services that make many outbound calls, sudden latency spikes and timeouts can also be caused by exhausting SNAT ports. A rapid succession of client or container requests to the same backend host and port can consume the preallocated SNAT ports. Until ports are reclaimed (after several minutes), new connections may hang or time out.

    Based on this, the next steps are:

    1. During the top‑of‑hour spikes, inspect container metrics (CPU, memory, restarts) and logs to see whether the application is under heavy load, recycling, or throwing errors.
    2. If the containers call external services, review connection patterns (for example, many short‑lived connections to the same endpoint) and consider connection pooling or reducing connection churn to avoid SNAT exhaustion.
    3. Correlate timeout periods with any scheduled jobs or batch workloads that run at the beginning of each hour, as these may be triggering the bursts in resource usage or outbound connections.

    References:

    AI-generated content may be incorrect. Read our transparency notes for more information.

    Was this answer helpful?

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.