An Azure service that provides customers with a serverless container experience.
The described behavior matches intermittent performance issues on the compute side rather than a pure network fault.
In similar Azure containerized workloads, intermittent timeouts and sharp spikes in latency are commonly caused by:
- Component or application performance problems inside the container (for example, CPU or memory pressure, garbage collection, or restarts), which manifest as timeouts at the client.
- Platform limits such as SNAT port exhaustion when many outbound connections are opened in a short time window.
From the available guidance:
- Intermittent timeouts with an otherwise healthy network are typically linked to component performance issues. The recommended approach is to inspect resource usage and health from the “inside out”: check CPU/memory usage and container restarts, and review logs around the time of the spikes to see whether the application is recycling or entering a CrashLoopBackOff‑like pattern.
- For services that make many outbound calls, sudden latency spikes and timeouts can also be caused by exhausting SNAT ports. A rapid succession of client or container requests to the same backend host and port can consume the preallocated SNAT ports. Until ports are reclaimed (after several minutes), new connections may hang or time out.
Based on this, the next steps are:
- During the top‑of‑hour spikes, inspect container metrics (CPU, memory, restarts) and logs to see whether the application is under heavy load, recycling, or throwing errors.
- If the containers call external services, review connection patterns (for example, many short‑lived connections to the same endpoint) and consider connection pooling or reducing connection churn to avoid SNAT exhaustion.
- Correlate timeout periods with any scheduled jobs or batch workloads that run at the beginning of each hour, as these may be triggering the bursts in resource usage or outbound connections.
References: