Add configurable timeout for passthrough endpoints#2
Add configurable timeout for passthrough endpoints#2tanmaykm wants to merge 2 commits intov1.82.1_with_jh_fixesfrom
Conversation
|
@claude review this |
mortenpi
left a comment
There was a problem hiding this comment.
I see the timeout value passed through a lot of the functions, but where do we actually get the initial value? It should be consistent with the existing timeouts, right?
There was a problem hiding this comment.
Why the package-lock.json changes?
|
I would also suggest contributing this to the upstream.
If it's unrelated, then probably better to have it as a separate PR, with its own motivation. |
|
Yes, will remove the ConnectionTimeoutError/SocketTimeoutError and the package-lock changes. The initial timeout value is retained as default in |
But we'd like it to use the |
- Fix hardcoded 600-second timeout on pass-through endpoints that caused TimeoutError for long-running streaming requests (e.g. Anthropic API calls with extended thinking/tool use) - Add timeout field to PassThroughGenericEndpoint config, allowing per-endpoint timeout configuration - Added tests
8f1341e to
946465d
Compare
Makes the Anthropic adapter path respect --request_timeout, litellm_settings.request_timeout, and per-deployment timeout settings. This matches the pattern used by every other handler in the file (responses API, completion, embedding, etc.
|
Closing and working on a different fix |
Ref: https://github.com/JuliaComputing/JuliaHub/issues/21622