Set connecttimeout with user option (Fixes #318) #321

CorradoLanera · 2025-02-11T23:04:18Z

The suggestion in #318 (comment) by @hadley solves #318, and my reported query (including all the others I have that failed) succeeded.

I've:

added the corresponding code in the same place with timeout
included the corresponding NEWS item.

atheriel

Looks right to me.

NEWS.md

atheriel · 2025-02-12T00:48:01Z

Many of the snapshots need to be updated with this change, unfortunately.

Co-authored-by: Aaron Jacobs <[email protected]>

hadley · 2025-02-12T13:57:55Z

@atheriel what are you checking for in the tests when you print the request object? Maybe it would be better to test it with req_dry_run()?

atheriel · 2025-02-12T13:59:38Z

I was largely testing if the body and headers look right. Very open to suggestions on a better way to do that.

hadley · 2025-02-12T14:00:49Z

@atheriel maybe with req_dry_run()? There are two current drawbacks to using it:

It's slow because it starts and stops a httpuv server on each run.
It includes a few headers that a known to change each run (like Date)

It wouldn't be hard to fix those problems in httr2.

hadley · 2025-03-06T22:34:37Z

I'm not convinced that this is the correct solution because it means that (e.g.) chat_openai(base_url = "https://doesntexist.com")$chat("Hi") will take 60s to time out.

schelhorn · 2025-03-17T09:58:05Z

I just wanted to add as a user perspective here: the flexible timeout setting is very important when using local models (as for instance served by llama.cpp's llama-server).

The reason is that even with Metal execution on an M1 Max, larger models such as the new gemma-3-27b will need more than the default 60 seconds {httr2} timeout to generate a complete response if tool use via {ragnar} is specified.

This is because as far as I could find out, tool use does not work with a streaming response, so I have to use ellmer::chat_openai with echo='none'. Consequently, llama-server will send back the response only when it has been fully generated, which often takes longer than 60 seconds for a 2048 token response, and in the meantime {httr2} times out.

So I (and probably other local LLM users) would certainly appreciate some sort of configurable option here since unfortunately neither {http2} nor {curl} allow setting a global timeout option in the user's R session.

hadley · 2025-03-17T19:29:30Z

@schelhorn I get that, but I'm not convinced that this is the right timeout to set — even if it takes llama-server a long time to generate the complete request, it should still connect.

Set connecttimeout with user option (Fixes tidyverse#318)

c412acc

atheriel approved these changes Feb 12, 2025

View reviewed changes

NEWS.md Outdated Show resolved Hide resolved

Update NEWS.md

286d017

Co-authored-by: Aaron Jacobs <[email protected]>

atheriel mentioned this pull request Feb 12, 2025

Avoid unit tests that snapshot httr2 request objects directly #322

Merged

Merge commit 'b692a7d820e1de5d2cb9f80d81d59efc0c9a2ff4'

eaeb539

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set connecttimeout with user option (Fixes #318) #321

Set connecttimeout with user option (Fixes #318) #321

CorradoLanera commented Feb 11, 2025

atheriel left a comment

atheriel commented Feb 12, 2025

hadley commented Feb 12, 2025

atheriel commented Feb 12, 2025

hadley commented Feb 12, 2025

hadley commented Mar 6, 2025

schelhorn commented Mar 17, 2025 •

edited

Loading

hadley commented Mar 17, 2025

Set connecttimeout with user option (Fixes #318) #321

Are you sure you want to change the base?

Set connecttimeout with user option (Fixes #318) #321

Conversation

CorradoLanera commented Feb 11, 2025

atheriel left a comment

Choose a reason for hiding this comment

atheriel commented Feb 12, 2025

hadley commented Feb 12, 2025

atheriel commented Feb 12, 2025

hadley commented Feb 12, 2025

hadley commented Mar 6, 2025

schelhorn commented Mar 17, 2025 • edited Loading

hadley commented Mar 17, 2025

schelhorn commented Mar 17, 2025 •

edited

Loading