WIP: Implement streaming completions #32

antaz · 2024-11-08T12:45:06Z

Changes

Added Chunk response to represent a streamed completion chunk
Added streaming capability for OpenAI

Examples

completion = LLM.openai(KEY).complete Message.new("user", "write a small story"), stream: true
result = ""
while chunk = completion.resume
  result << chunk.choices.first.content
end
puts result

0x1eef

Left a couple of comments

0x1eef · 2024-11-08T12:53:19Z

lib/llm/response/chunk.rb

+# frozen_string_literal: true
+
+module LLM
+  class Response::Chunk < Response


Should we inherit from Response::Completion ? It is still a Completion response, right ?

It certainly is, I was thinking about it, but got a little hesitant.

0x1eef · 2024-11-08T12:54:28Z

lib/llm/providers/openai.rb

@@ -30,8 +30,18 @@ def complete(message, **params)
      params = DEFAULT_PARAMS.merge(params)
      body = {messages: messages.map(&:to_h)}.merge!(params)
      req = preflight(req, body)
-      res = request(@http, req)
-      Response::Completion.new(res.body, self).extend(response_parser)
+      if params[:stream]


We could offload the complexity to another method

stream!(req, body) if params[:stream] stream_completion!(req, body) if params[:stream]

@0x1eef I'm still not sure about we go about this

0x1eef · 2024-11-08T16:38:00Z

lib/llm/response/chunk.rb

+# frozen_string_literal: true
+
+module LLM
+  require_relative "completion"


Can we move this to lib/llm/response.rb instead ? The require_relative statements found in that file are properly ordered. For example, it could be like this and the dependencies should be properly met:

require "json" require_relative "response/completion" require_relative "response/chunk" require_relative "response/embedding"

0x1eef reviewed Nov 8, 2024

View reviewed changes

antaz changed the title ~~Implement streaming completions~~ WIP: Implement streaming completions Nov 8, 2024

0x1eef reviewed Nov 8, 2024

View reviewed changes

antaz added 5 commits November 8, 2024 18:22

feat: add Chunk response

fff6baa

feat: implement streaming for OpenAI

43f7c12

fixup! feat: add Chunk response

73b82fb

fixup! feat: implement streaming for OpenAI

5bd0adc

fixup! feat: implement streaming for OpenAI

7eceebf

antaz force-pushed the feat/streaming branch from ff71068 to 7eceebf Compare November 8, 2024 17:29

fixup! feat: add Chunk response

c14a02f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Implement streaming completions #32

WIP: Implement streaming completions #32

antaz commented Nov 8, 2024 •

edited

Loading

0x1eef left a comment

0x1eef Nov 8, 2024

antaz Nov 8, 2024

0x1eef Nov 8, 2024 •

edited

Loading

antaz Nov 8, 2024

0x1eef Nov 8, 2024 •

edited

Loading

WIP: Implement streaming completions #32

Are you sure you want to change the base?

WIP: Implement streaming completions #32

Conversation

antaz commented Nov 8, 2024 • edited Loading

Changes

Examples

0x1eef left a comment

Choose a reason for hiding this comment

0x1eef Nov 8, 2024

Choose a reason for hiding this comment

antaz Nov 8, 2024

Choose a reason for hiding this comment

0x1eef Nov 8, 2024 • edited Loading

Choose a reason for hiding this comment

antaz Nov 8, 2024

Choose a reason for hiding this comment

0x1eef Nov 8, 2024 • edited Loading

Choose a reason for hiding this comment

antaz commented Nov 8, 2024 •

edited

Loading

0x1eef Nov 8, 2024 •

edited

Loading

0x1eef Nov 8, 2024 •

edited

Loading