Add a non-token approach for OpenAI #16

martinezpl · 2023-05-06T02:42:09Z

Hello, as per interest shown in #2 I'd like to propose a variation for OpenAI models. This makes it possible to use OpenAI models with jsonformer without changing existing code.

Summary

Added possibility of filling the JSONs through calls to a non-chat OpenAI model (seems to work best with curie)
Two new classes: OpenAIModel and JsonformerNoTokens, which is basically the stripped down original with no tokenizer
Instead of using logits in generate_boolean and generate_array, we're getting the next most likely token from logprobs.
As a substitute for stopping criteria, stop word ", is used to limit the model's response to a single value

Why no chat?

I found the chat model to be more querulous (As an AI model I cannot blablabla...), prompt-dependant and slow.
Solution proposed here seems to be working best with text-curie-001 as it's super fast and cheap.

Perhaps somebody can figure out an effective way to utilize the chat model, but I cannot see any other way than prompting to generate the whole JSON at once, which is completely in opposition to the concept of this project.

Why no tokens?

I spent some time trying to continue operating on tokens while using the API, but I encountered two issues:

tokenization available on tiktoken does not seem to provide word boundaries, therefore, for example, encoding "colors" will give us two separate tokens, which after decoding give "col ors". Can't work like that.
chat models do not accept tokens as input

And of course, because the models run remotely we have no access to the generation process. From my perspective, this is the reason that renders all token operations pointless here. I still left the tokenizer class just in case.

How to run it?

Make sure you have OPENAI_API_KEY env var.

poetry install
poetry run python tests/test_openai.py

You'll see the JSON being filled. You can change the used model and its temperature in that file during JsonformerNoTokens initialisation.

zhaochenyang20 · 2023-05-06T11:39:51Z

I am just pondering that, as far as I know, ChatModel is an inborn good JSON generator. GPT-3.5-turbo is derived from code-davinci-003 (Codex), which is fine-tuned on a large number of codes and really capable of generating JSON.

zhaochenyang20 · 2023-05-06T11:41:55Z

I do have a deep interset in generating JSON format from OpenAI models. Please feel free to contact me!

martinezpl · 2023-05-06T13:21:06Z

@zhaochenyang20 I agree it is fairly good at it, but the authors of this project don't seem to be convinced it's enough to have a good prompt for the chat. I assume it's based on their experiences, personally I don't know.

Void-n-Null · 2023-05-16T17:10:22Z

Very Excited to try this out!!!

moro-n0-kimi · 2023-05-19T21:50:55Z

Looking forward to using this to parse plain text outputs from CoCa into JSON for image JSON captioning.

Update: Getting really good results so far using text-davinci-003

martinezpl · 2023-05-22T09:05:57Z

Awesome to hear that @moro-no-kimi 🥳

tv-ankur · 2023-05-24T12:22:42Z

@moro-no-kimi

Are you using jsonformer with the open ai model? If yes, is it possible to share the code?

martinezpl · 2023-07-03T16:41:38Z

This is kind of obsolete now with function calls feature from OpenAI

wassname · 2024-05-10T12:05:51Z

This is great work, but would complicate the repo, which is nice and simple

On this list there are quite a few others that support api only models https://github.com/wassname/awesome-interpretability/tree/main?tab=readme-ov-file#structured-output

martinezpl added 4 commits May 6, 2023 04:05

Add openai sdk

1a08385

Create Jsonformer that operates on text instead of tokens

dd647fc

Add openai model class and a prototype tokenizer

c6bd1ca

Add a script for manual testing

57d4f86

Samuel-Villegas mentioned this pull request May 22, 2023

String stopping criteria might need to be more specific #6

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a non-token approach for OpenAI #16

Add a non-token approach for OpenAI #16

martinezpl commented May 6, 2023 •

edited

Loading

zhaochenyang20 commented May 6, 2023

zhaochenyang20 commented May 6, 2023

martinezpl commented May 6, 2023

Void-n-Null commented May 16, 2023

moro-n0-kimi commented May 19, 2023 •

edited

Loading

martinezpl commented May 22, 2023

tv-ankur commented May 24, 2023

martinezpl commented Jul 3, 2023

wassname commented May 10, 2024

Add a non-token approach for OpenAI #16

Are you sure you want to change the base?

Add a non-token approach for OpenAI #16

Conversation

martinezpl commented May 6, 2023 • edited Loading

Summary

Why no chat?

Why no tokens?

How to run it?

zhaochenyang20 commented May 6, 2023

zhaochenyang20 commented May 6, 2023

martinezpl commented May 6, 2023

Void-n-Null commented May 16, 2023

moro-n0-kimi commented May 19, 2023 • edited Loading

martinezpl commented May 22, 2023

tv-ankur commented May 24, 2023

martinezpl commented Jul 3, 2023

wassname commented May 10, 2024

martinezpl commented May 6, 2023 •

edited

Loading

moro-n0-kimi commented May 19, 2023 •

edited

Loading