Feature: image input for VLMs #34

yagil · 2025-03-09T16:24:19Z

Equivalent to lmstudio-js’s https://lmstudio.ai/docs/typescript/llm-prediction/image-input

ncoghlan · 2025-03-10T04:04:52Z

Breaking down the individual pieces of this:

Add prepare_image to the files session API
Make the client files session attribute public
Add a top-level prepare_image convenience API for easy interactive use
Make the FileHandle type public
Make the images parameter in Chat.add_user_message public, but without the implicit local file input support
Drop the implicit local file handling from Chat instances entirely (as we're going to take a different approach to the chat session history management convenience API)

Preparatory refactoring for #34

ncoghlan · 2025-03-11T15:56:11Z

#36 makes the APIs for adding file handles to chat history instances public.

Passing file handles to Chat.from_history/.add_entry/.append will still require the full multi-part user message format for now (it seems preferable to wait for lmstudio-ai/lmstudio-js#270 to be considered first, rather than jumping directly to independently replicating the required input types on the Python side)

Preparatory refactoring for #34

ncoghlan · 2025-03-11T16:36:20Z

While not directly part of this issue, #37 makes the file preparation interface public, which will be used as the base for the image preparation interface.

Implements the remaining components of #34

Also make debug logging for file uploads less noisy. Implements the remaining components of #34

ncoghlan · 2025-03-11T17:37:00Z

#38 adds the image-specific preparation APIs

Also make debug logging for file uploads less noisy. Implements the remaining components of #34

Part of #34

ncoghlan · 2025-03-13T15:27:25Z

Draft SDK docs PR is up at lmstudio-ai/docs#48

ncoghlan self-assigned this Mar 10, 2025

ncoghlan added a commit that referenced this issue Mar 11, 2025

Remove implicit file handle caching from history.Chat

ae434a3

Preparatory refactoring for #34

ncoghlan mentioned this issue Mar 11, 2025

Remove implicit file handle caching from history.Chat #36

Merged

ncoghlan added a commit that referenced this issue Mar 11, 2025

Remove implicit file handle caching from history.Chat (#36)

5aeb367

Preparatory refactoring for #34

ncoghlan added a commit that referenced this issue Mar 11, 2025

Add image preparation APIs

aecd0f4

Implements the remaining components of #34

ncoghlan added a commit that referenced this issue Mar 11, 2025

Add image preparation APIs

e4f2456

Also make debug logging for file uploads less noisy. Implements the remaining components of #34

ncoghlan mentioned this issue Mar 11, 2025

Add image preparation APIs #38

Merged

ncoghlan added a commit that referenced this issue Mar 11, 2025

Add image preparation APIs (#38)

972f35b

Also make debug logging for file uploads less noisy. Implements the remaining components of #34

ncoghlan added a commit that referenced this issue Mar 11, 2025

Publish file and image prep convenience APIs

833bafe

Part of #34

ncoghlan mentioned this issue Mar 11, 2025

Publish file and image prep convenience APIs #39

Merged

ncoghlan added a commit that referenced this issue Mar 11, 2025

Publish file and image prep convenience APIs (#39)

27f1b0b

Part of #34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: image input for VLMs #34

Feature: image input for VLMs #34

yagil commented Mar 9, 2025

ncoghlan commented Mar 10, 2025 •

edited

Loading

ncoghlan commented Mar 11, 2025

ncoghlan commented Mar 11, 2025

ncoghlan commented Mar 11, 2025

ncoghlan commented Mar 13, 2025

Feature: image input for VLMs #34

Feature: image input for VLMs #34

Comments

yagil commented Mar 9, 2025

ncoghlan commented Mar 10, 2025 • edited Loading

ncoghlan commented Mar 11, 2025

ncoghlan commented Mar 11, 2025

ncoghlan commented Mar 11, 2025

ncoghlan commented Mar 13, 2025

ncoghlan commented Mar 10, 2025 •

edited

Loading