Difference in output when running via Trasformers.js and when hosting on Huggingface #46

jtmuller5 · 2024-02-10T00:14:00Z

I created an application that uses the UAE-large-V1 model inside Transformers.js and was able to embed sentences in a browser without issues. The model would return a single vector for a single input:

extractor = await pipeline("feature-extraction", "WhereIsAI/UAE-Large-V1", {
      quantized: true,
});

let result = await extractor(text, { pooling: "mean", normalize: true });

When I hosted the model on Huggingface using their inference endpoint solution, it no longer works as expected. Instead of returning a single vector, it returns a variable length of 1024 dimension vectors.

Sample input:

{
   "inputs":  "Where are you"
}

This returns a list of lists of lists of numbers.

Is there a way to make hosted model return a single vector? And why does the the model act differently based on where it's hosted?

The text was updated successfully, but these errors were encountered:

SeanLee97 · 2024-03-14T06:25:12Z

It is strange. It should return a single vector because you have specified the mean pooling.

You could ask for help in the Transformers.js project because I am unfamiliar with it. Sorry for this.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Difference in output when running via Trasformers.js and when hosting on Huggingface #46

Difference in output when running via Trasformers.js and when hosting on Huggingface #46

jtmuller5 commented Feb 10, 2024 •

edited

Loading

SeanLee97 commented Mar 14, 2024

Difference in output when running via Trasformers.js and when hosting on Huggingface #46

Difference in output when running via Trasformers.js and when hosting on Huggingface #46

Comments

jtmuller5 commented Feb 10, 2024 • edited Loading

SeanLee97 commented Mar 14, 2024

jtmuller5 commented Feb 10, 2024 •

edited

Loading