[FEATURE] Add Xinference implementation #1179 #25

alvinlee518 · 2024-11-27T14:39:07Z

Issue

Contributes to langchain4j/langchain4j#1179

Change

Support for Xorbits Inference.

Added a XinferenceChatModel
Added a XinferenceEmbeddingModel
Added a XinferenceImageModel
Added a XinferenceLanguageModel
Added a XinferenceScoringModel
Added a XinferenceStreamingChatModel
Added a XinferenceStreamingLanguageModel

General checklist

There are no breaking changes
I have added unit and integration tests for my change
I have manually run all the unit tests in all modules, and they are all green
I have manually run all integration tests in the module I have added/changed, and they are all green

Checklist for adding new maven module

I have added my new module in the root pom.xml and langchain4j-community-bom/pom.xml

Martin7-1 · 2024-11-28T02:59:10Z

@alvinlee518 Hi! Thank you for your contribution. Will review it asap.

BTW, it looks like XInference support docker container. Would it possible to use testcontainers in IT?

Martin7-1 · 2024-11-28T03:26:35Z

BTW, could you make all static method call by using import static? (e.g. InternalXinferenceHelper.toTools -> toTools). Just for consistency with other modules.

And, could you please add tests about ChatModelListener like OllamaChatModelListenerIT?

alvinlee518 · 2024-11-28T06:11:41Z

BTW, could you make all static method call by using import static? (e.g. InternalXinferenceHelper.toTools -> toTools). Just for consistency with other modules.

And, could you please add tests about ChatModelListener like OllamaChatModelListenerIT?

Okay, got it.

alvinlee518 · 2024-11-28T06:12:27Z

@alvinlee518 Hi! Thank you for your contribution. Will review it asap.

BTW, it looks like XInference support docker container. Would it possible to use testcontainers in IT?

It's a bit tricky for me to test the docker container locally, but I can try adding it.

2. use import static to call all static methods

Xinference model

enhance unit tests

alvinlee518 · 2024-11-30T01:04:27Z

@Martin7-1 The Xinferenrece testcontainers require GPU support. How can I configure it in GitHub Actions to ensure the unit tests pass?

Martin7-1 · 2024-11-30T03:25:46Z

Since we're using the free tier, Github Actions do not support GPU... Are there any ways that Xinference can run on CPU (such as latest-cpu version). BTW, could we use smaller models (I don't know how much time you test with GPU locally, but if we change it to CPU, it may cost much time, so we need to make the model as small as possible) to reduce IT cost?

If the test takes too long to run with CPU, maybe I'll consider disabling Xinference's IT in Github Actions and just running it locally.

Xinference's ITs are like Ollama's ITs in that they both take a long time to execute tests, and I'm also thinking about and trying to figure out how to optimise them and get them working perfectly in Github Actions. You can try the ideas I suggested above, but if it doesn't work that's fine, I'd run those test cases locally. I think I will review it next week :)

…supported test cases.

alvinlee518 · 2024-12-03T02:16:00Z

@Martin7-1 I have modified the image to the CPU version. When executed in GitHub Actions, the entire process takes approximately 10 minutes.
ImageModel, VisionModel, and tool call streaming functions rely on GPUs and can be validated locally.

Martin7-1 · 2024-12-03T03:39:13Z

@Martin7-1 I have modified the image to the CPU version. When executed in GitHub Actions, the entire process takes approximately 10 minutes. ImageModel, VisionModel, and tool call streaming functions rely on GPUs and can be validated locally.

Thank you! Will try to review it this week.

Martin7-1

@alvinlee518 Thank you!

models/langchain4j-community-xinference/pom.xml

...xinference/src/main/java/dev/langchain4j/community/model/xinference/XinferenceChatModel.java

...e/src/main/java/dev/langchain4j/community/model/xinference/XinferenceStreamingChatModel.java

...c/main/java/dev/langchain4j/community/model/xinference/XinferenceStreamingLanguageModel.java

Martin7-1

@alvinlee518 Thank you! About IT: maybe we should give up comitting new image, because so much images will exceed Github Actions disk space limit. WDYT?

...c/main/java/dev/langchain4j/community/model/xinference/client/RequestLoggingInterceptor.java

.../main/java/dev/langchain4j/community/model/xinference/client/ResponseLoggingInterceptor.java

Martin7-1

Thank you! Would you mind removing all final in method level if it's not really needed?

...rence/src/main/java/dev/langchain4j/community/model/xinference/InternalXinferenceHelper.java

...xinference/src/main/java/dev/langchain4j/community/model/xinference/XinferenceChatModel.java

alvinlee518 · 2024-12-05T01:09:33Z

@alvinlee518 Thank you! About IT: maybe we should give up comitting new image, because so much images will exceed Github Actions disk space limit. WDYT?
@Martin7-1
The new image is intended to avoid re-downloading the model. If you're concerned about the disk space limit, I can remove it.

2.remove final on local variable and parameter

Martin7-1

@alvinlee518 Thank you! LGTM. Could you please add docs in langchain4j main repo?

alvinlee518 · 2024-12-05T05:44:06Z

@alvinlee518 Thank you! LGTM. Could you please add docs in langchain4j main repo?

Okay, I'll add it later.

> ## Issue > Closes [langchain4j-community #25](langchain4j/langchain4j-community#25) > > ## Change > Add `langchain4j-community-xinference` document. > > ## General checklist > * [x] There are no breaking changes > * [ ] I have added unit and integration tests for my change > * [ ] I have manually run all the unit and integration tests in the module I have added/changed, and they are all green > * [ ] I have manually run all the unit and integration tests in the [core](https://github.com/langchain4j/langchain4j/tree/main/langchain4j-core) and [main](https://github.com/langchain4j/langchain4j/tree/main/langchain4j) modules, and they are all green > > * [x] I have added/updated the [documentation](https://github.com/langchain4j/langchain4j/tree/main/docs/docs) > * [ ] I have added an example in the [examples repo](https://github.com/langchain4j/langchain4j-examples) (only for "big" features) > * [ ] I have added/updated [Spring Boot starter(s)](https://github.com/langchain4j/langchain4j-spring) (if applicable) Co-authored-by: lixw <>

lixw added 2 commits November 27, 2024 22:07

Add Xinference implementation

0ff79ea

Add Xinference implementation

f09e0c2

Martin7-1 added enhancement New feature or request P3 Medium priority theme: model Issues/PRs related to model labels Nov 27, 2024

lixw and others added 9 commits November 28, 2024 15:27

1. add tests about ChatModelListener

740f8ab

2. use import static to call all static methods

add testcontainers

619492e

add testcontainers

6021ff8

Merge pull request #1 from alvinlee518/xinference-model

2d66a52

Xinference model

Merge branch 'langchain4j:main' into main

4979ecc

enhance unit tests

f7ee1e8

Create maven.yml

e13a076

Delete .github/workflows/maven.yml

814ccfc

Merge pull request #2 from alvinlee518/xinference-model

ab518b2

enhance unit tests

lixw added 2 commits November 30, 2024 16:47

switch to the CPU version and use the @disabled annotation to skip un…

e59ed1a

…supported test cases.

Modify the ChatModelListenerIT#should_listen_error test case.

03656c2

skip xinference ITs due to large model size

c909851

Martin7-1 reviewed Dec 4, 2024

View reviewed changes

resolve review issues

04f33f1

alvinlee518 requested a review from Martin7-1 December 4, 2024 05:58

lixw and others added 2 commits December 4, 2024 14:03

resolve review issues

8641b80

Merge branch 'main' into alvinlee518/main

3225d41

spotless format

cd08a14

Martin7-1 reviewed Dec 4, 2024

View reviewed changes

...c/main/java/dev/langchain4j/community/model/xinference/client/RequestLoggingInterceptor.java Show resolved Hide resolved

.../main/java/dev/langchain4j/community/model/xinference/client/ResponseLoggingInterceptor.java Show resolved Hide resolved

Martin7-1 reviewed Dec 4, 2024

View reviewed changes

lixw and others added 2 commits December 5, 2024 10:04

1.disable commit new image

87c3aca

2.remove final on local variable and parameter

enable github actions

31c1a1c

Martin7-1 approved these changes Dec 5, 2024

View reviewed changes

Merge branch 'main' into alvinlee518/main

06835bf

Martin7-1 merged commit 481a1e2 into langchain4j:main Dec 5, 2024
4 checks passed

alvinlee518 mentioned this pull request Dec 21, 2024

[FEATURE] add xinference doc langchain4j/langchain4j#2324

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Add Xinference implementation #1179 #25

[FEATURE] Add Xinference implementation #1179 #25

alvinlee518 commented Nov 27, 2024

Martin7-1 commented Nov 28, 2024

Martin7-1 commented Nov 28, 2024 •

edited

Loading

alvinlee518 commented Nov 28, 2024

alvinlee518 commented Nov 28, 2024

alvinlee518 commented Nov 30, 2024

Martin7-1 commented Nov 30, 2024

alvinlee518 commented Dec 3, 2024

Martin7-1 commented Dec 3, 2024

Martin7-1 left a comment

Martin7-1 left a comment

Martin7-1 left a comment

alvinlee518 commented Dec 5, 2024

Martin7-1 left a comment

alvinlee518 commented Dec 5, 2024

[FEATURE] Add Xinference implementation #1179 #25

[FEATURE] Add Xinference implementation #1179 #25

Conversation

alvinlee518 commented Nov 27, 2024

Issue

Change

General checklist

Checklist for adding new maven module

Martin7-1 commented Nov 28, 2024

Martin7-1 commented Nov 28, 2024 • edited Loading

alvinlee518 commented Nov 28, 2024

alvinlee518 commented Nov 28, 2024

alvinlee518 commented Nov 30, 2024

Martin7-1 commented Nov 30, 2024

alvinlee518 commented Dec 3, 2024

Martin7-1 commented Dec 3, 2024

Martin7-1 left a comment

Choose a reason for hiding this comment

Martin7-1 left a comment

Choose a reason for hiding this comment

Martin7-1 left a comment

Choose a reason for hiding this comment

alvinlee518 commented Dec 5, 2024

Martin7-1 left a comment

Choose a reason for hiding this comment

alvinlee518 commented Dec 5, 2024

Martin7-1 commented Nov 28, 2024 •

edited

Loading