Hello @Xiwei Zhou ,
I am converting the comment to answer so that it will help the community to find better solution.
Issue:
The DeepSeek-V3
and DeepSeek-V3-0324
models were returning timeouts or internal server errors for non-streaming (stream=False
) requests. However, streaming requests (stream=True
) continued to work as expected.
Solution:
This issue appears to be transient and related to temporary backend unavailability. If you encounter similar errors with non-streaming requests (such as timeouts or internal server errors), please try the following steps:
- Retry the request after a short wait, as the service may take time for fully available after deployment.
- If the issue persists beyond a reasonable period, capture the
apim-request-id
from the response headers and share it with Azure support for further investigation.
Please do accept
the solution and give feedback by clicking on yes
.
Thank you