feat: Adding code for sending parameter "response_format" as request payload #2317

FarrukhMasud · 2024-11-20T23:55:02Z

Related Issues/PRs

For newer models, openai accepts response_format, which can be text or json_object. This PR adds the parameter for this. Additionally, when developer set postprocessingoptions, in OpenAIPrompt class, post processing field is reduncdant and can be made optional. We cannot infer post processing from response format because json as response format is not supported by many models including GPT-4 and GPT-3.5-Turbo. All changes made in this PR as backward compatible and will not break any existing code.

What changes are proposed in this pull request?

In this PR, we are adding:

Ability to add parameter for response_format in OpenAIPrompt class and OpenAIChatCompletion class.
Added unit tests to validate this
Added code to infer postprocessing from postprocessingoptions and hence making it optional.

How is this patch tested?

Unit tests and integration tests are added to validate this change

Does this PR change any dependencies?

No. You can skip this section.
Yes. Make sure the dependencies are resolved correctly, and list changes here.

Does this PR add a new feature? If so, have you added samples on website?

No. You can skip this section.
Yes. Make sure you have added samples following below steps.

FarrukhMasud · 2024-11-20T23:55:41Z

/azp run

azure-pipelines · 2024-11-20T23:55:52Z

Azure Pipelines successfully started running 1 pipeline(s).

codecov-commenter · 2024-11-21T00:07:14Z

Codecov Report

Attention: Patch coverage is 92.53731% with 5 lines in your changes missing coverage. Please review.

Project coverage is 84.50%. Comparing base (08aab6a) to head (14d460f).

Files with missing lines	Patch %	Lines
...zure/synapse/ml/services/openai/OpenAIPrompt.scala	85.71%	3 Missing ⚠️
...apse/ml/services/openai/OpenAIChatCompletion.scala	95.55%	2 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##           master    #2317   +/-   ##
=======================================
  Coverage   84.50%   84.50%           
=======================================
  Files         326      326           
  Lines       16755    16810   +55     
  Branches     1480     1498   +18     
=======================================
+ Hits        14158    14205   +47     
- Misses       2597     2605    +8

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚨 Try these New Features:

Flaky Tests Detection - Detect and resolve failed and flaky tests
JS Bundle Analysis - Avoid shipping oversized bundles

mhamilton723 · 2024-11-21T01:43:23Z

...ive/src/main/scala/com/microsoft/azure/synapse/ml/services/openai/OpenAIChatCompletion.scala

@@ -55,11 +132,20 @@ class OpenAIChatCompletion(override val uid: String) extends OpenAIServicesBase(
  override def responseDataType: DataType = ChatCompletionResponse.schema

  private[this] def getStringEntity(messages: Seq[Row], optionalParams: Map[String, Any]): StringEntity = {
-    val mappedMessages: Seq[Map[String, String]] = messages.map { m =>
+    var mappedMessages: Seq[Map[String, String]] = messages.map { m =>


nit: please no vars in code unless required. In scala any kind of mutability is highly discouraged. Here in this case you can return either the original or the original plus your addition in the if statement

made mappedMessages val again

mhamilton723 · 2024-11-21T01:44:42Z

cognitive/src/main/scala/com/microsoft/azure/synapse/ml/services/openai/OpenAIPrompt.scala

+  val responseFormat = new Param[String](
+    this, "responseFormat", "The response format from the OpenAI API.")
+
+  def getResponseFormat: String = $(responseFormat)
+
+  def setResponseFormat(value: String): this.type = {
+    if (value.isEmpty) {
+      this
+    } else {
+      val normalizedValue = value.toLowerCase match {
+        case "json" => "json_object"
+        case other => other
+      }
+
+      // Validate the normalized value using the OpenAIResponseFormat enum
+      if (!OpenAIResponseFormat.values
+        .map(_.asInstanceOf[OpenAIResponseFormat.ResponseFormat].name)
+        .contains(normalizedValue)) {
+        throw new IllegalArgumentException("Response format must be valid for OpenAI API. " +
+          "Currently supported formats are " + OpenAIResponseFormat.values
+          .map(_.asInstanceOf[OpenAIResponseFormat.ResponseFormat].name)
+          .mkString(", "))
+      }
+
+      set(responseFormat, normalizedValue)
+    }
+  }
+
+  def setResponseFormat(value: OpenAIResponseFormat.ResponseFormat): this.type = {
+    this.setResponseFormat(value.name)
+  }


this code seems duplicated, can it be abstracted and shared?

I am now using shared code.

mhamilton723 · 2024-11-21T01:47:34Z

cognitive/src/test/scala/com/microsoft/azure/synapse/ml/services/openai/OpenAIPromptSuite.scala

+    assert(nonNullCount == 4)
+  }
+
+  test("Basic Usage - Gpt 4o with response format text") {


mhamilton723 · 2024-11-21T01:49:14Z

cognitive/src/test/scala/com/microsoft/azure/synapse/ml/services/openai/OpenAIPromptSuite.scala

+  test("setResponseFormat should set the response format correctly for valid values") {
+    val prompt = new OpenAIPrompt()
+    prompt.setResponseFormat("text")
+    prompt.getResponseFormat should be ("text")
+
+    prompt.setResponseFormat("json")
+    prompt.getResponseFormat should be ("json_object")
+
+    prompt.setResponseFormat("json_object")
+    prompt.getResponseFormat should be ("json_object")
+
+    prompt.setResponseFormat("jSoN")
+    prompt.getResponseFormat should be ("json_object")
+
+    prompt.setResponseFormat("TEXT")
+    prompt.getResponseFormat should be ("text")
+  }


these tests look duplicated from above, if the core parameter flexibility is shared between the two we can keep just 1 copy and still have same basic coverage

mhamilton723 · 2024-11-21T01:49:40Z

Awesome! Only minor nits!

FarrukhMasud · 2024-11-21T02:59:05Z

/azp run

azure-pipelines · 2024-11-21T02:59:16Z

Azure Pipelines successfully started running 1 pipeline(s).

…ts can be reused, in this iteration, I am removing this possibility. Now each test create a new OpenAIPrompt

FarrukhMasud · 2024-11-21T04:27:18Z

/azp run

azure-pipelines · 2024-11-21T04:27:30Z

Azure Pipelines successfully started running 1 pipeline(s).

FarrukhMasud · 2024-11-21T18:50:09Z

/azp run

azure-pipelines · 2024-11-21T18:50:22Z

Azure Pipelines successfully started running 1 pipeline(s).

FarrukhMasud · 2024-11-21T19:52:00Z

/azp run

azure-pipelines · 2024-11-21T19:52:12Z

Azure Pipelines successfully started running 1 pipeline(s).

FarrukhMasud · 2024-11-21T20:22:21Z

/azp run

azure-pipelines · 2024-11-21T20:22:32Z

Azure Pipelines successfully started running 1 pipeline(s).

Adding code for sending parameter "response format"

d014023

FarrukhMasud requested a review from mhamilton723 as a code owner November 20, 2024 23:55

FarrukhMasud assigned mhamilton723 Nov 20, 2024

FarrukhMasud requested review from sss04 and lhrotk November 20, 2024 23:57

mhamilton723 reviewed Nov 21, 2024

View reviewed changes

fixing typo

70eead5

FMasudMsft added 2 commits November 20, 2024 19:17

making mappedMessages immutable again

1d848ba

Simplifying code and cleaning unit tests. In unit tests, prompt objec…

8a5f5d9

…ts can be reused, in this iteration, I am removing this possibility. Now each test create a new OpenAIPrompt

Fixing failing unit tests

f3aa4a6

Fixing failing unit tests, one more try

e089363

Fixing failing unit tests, one more try

14d460f

FarrukhMasud closed this Nov 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Adding code for sending parameter "response_format" as request payload #2317

feat: Adding code for sending parameter "response_format" as request payload #2317

FarrukhMasud commented Nov 20, 2024

FarrukhMasud commented Nov 20, 2024

azure-pipelines bot commented Nov 20, 2024

codecov-commenter commented Nov 21, 2024 •

edited

Loading

mhamilton723 Nov 21, 2024 •

edited

Loading

FarrukhMasud Nov 21, 2024

mhamilton723 Nov 21, 2024

FarrukhMasud Nov 21, 2024

mhamilton723 Nov 21, 2024

mhamilton723 Nov 21, 2024

FarrukhMasud Nov 21, 2024

mhamilton723 commented Nov 21, 2024

FarrukhMasud commented Nov 21, 2024

azure-pipelines bot commented Nov 21, 2024

FarrukhMasud commented Nov 21, 2024

azure-pipelines bot commented Nov 21, 2024

FarrukhMasud commented Nov 21, 2024

azure-pipelines bot commented Nov 21, 2024

FarrukhMasud commented Nov 21, 2024

azure-pipelines bot commented Nov 21, 2024

FarrukhMasud commented Nov 21, 2024

azure-pipelines bot commented Nov 21, 2024

feat: Adding code for sending parameter "response_format" as request payload #2317

feat: Adding code for sending parameter "response_format" as request payload #2317

Conversation

FarrukhMasud commented Nov 20, 2024

Related Issues/PRs

What changes are proposed in this pull request?

How is this patch tested?

Does this PR change any dependencies?

Does this PR add a new feature? If so, have you added samples on website?

FarrukhMasud commented Nov 20, 2024

azure-pipelines bot commented Nov 20, 2024

codecov-commenter commented Nov 21, 2024 • edited Loading

Codecov Report

mhamilton723 Nov 21, 2024 • edited Loading

Choose a reason for hiding this comment

FarrukhMasud Nov 21, 2024

Choose a reason for hiding this comment

mhamilton723 Nov 21, 2024

Choose a reason for hiding this comment

FarrukhMasud Nov 21, 2024

Choose a reason for hiding this comment

mhamilton723 Nov 21, 2024

Choose a reason for hiding this comment

mhamilton723 Nov 21, 2024

Choose a reason for hiding this comment

FarrukhMasud Nov 21, 2024

Choose a reason for hiding this comment

mhamilton723 commented Nov 21, 2024

FarrukhMasud commented Nov 21, 2024

azure-pipelines bot commented Nov 21, 2024

FarrukhMasud commented Nov 21, 2024

azure-pipelines bot commented Nov 21, 2024

FarrukhMasud commented Nov 21, 2024

azure-pipelines bot commented Nov 21, 2024

FarrukhMasud commented Nov 21, 2024

azure-pipelines bot commented Nov 21, 2024

FarrukhMasud commented Nov 21, 2024

azure-pipelines bot commented Nov 21, 2024

codecov-commenter commented Nov 21, 2024 •

edited

Loading

mhamilton723 Nov 21, 2024 •

edited

Loading