SAP-docs
diff --git a/‎docs/sap-ai-core/accessing-generative-ai-models-through-global-scenarios-4ca11f6.md
+22 b/‎docs/sap-ai-core/accessing-generative-ai-models-through-global-scenarios-4ca11f6.md
+22
diff --git a/‎docs/sap-ai-core/chat-39321a9.md
+24-3 b/‎docs/sap-ai-core/chat-39321a9.md
+24-3
diff --git a/‎docs/sap-ai-core/consume-generative-ai-models-bf0373b.md
+1-1 b/‎docs/sap-ai-core/consume-generative-ai-models-bf0373b.md
+1-1
diff --git a/‎docs/sap-ai-core/consume-models-with-the-harmonized-api-2392d9a.md
+116 b/‎docs/sap-ai-core/consume-models-with-the-harmonized-api-2392d9a.md
+116
diff --git a/‎docs/sap-ai-core/content-filtering-on-input-04e7c5a.md
-83 b/‎docs/sap-ai-core/content-filtering-on-input-04e7c5a.md
-83
diff --git a/‎docs/sap-ai-core/content-filtering-on-output-f0fba18.md
-74 b/‎docs/sap-ai-core/content-filtering-on-output-f0fba18.md
-74
diff --git a/‎docs/sap-ai-core/create-a-collection-9cbfe27.md
+2-2 b/‎docs/sap-ai-core/create-a-collection-9cbfe27.md
+2-2
diff --git a/‎docs/sap-ai-core/create-a-deployment-for-a-generative-ai-model-b32e7a8.md
+2-2 b/‎docs/sap-ai-core/create-a-deployment-for-a-generative-ai-model-b32e7a8.md
+2-2
@@ -0,0 +1,22 @@
+<!-- loio4ca11f687fbd435b813b2726c3ddbaea -->
+
+# Accessing Generative AI Models Through Global Scenarios
+
+Access to generative AI models falls under the global AI scenarios `foundation-models` and `orchestration`. SAP AI Core manages these scenarios. You can access individual models as executables through serving templates. To use a specific model, choose the corresponding template.
+
+-   **[Orchestration](orchestration-cdd4847.md "The orchestration service operates under the global AI scenario
+			orchestration, which is managed by SAP AI Core. This service
+		enables the use of various generative AI models with a unified code, configuration, and
+		deployment.")**  
+The orchestration service operates under the global AI scenario `orchestration`, which is managed by SAP AI Core. This service enables the use of various generative AI models with a unified code, configuration, and deployment.
+-   **[Foundation Models](foundation-models-2d981fb.md "The foundation models service operates under the global AI scenario
+			foundation-models, which is managed by SAP AI Core.")**  
+The foundation models service operates under the global AI scenario `foundation-models`, which is managed by SAP AI Core.
+
+**Related Information**  
+
+
+[Orchestration](orchestration-cdd4847.md "The orchestration service operates under the global AI scenario orchestration, which is managed by SAP AI Core. This service enables the use of various generative AI models with a unified code, configuration, and deployment.")
+
+[Foundation Models](foundation-models-2d981fb.md "The foundation models service operates under the global AI scenario foundation-models, which is managed by SAP AI Core.")
+
@@ -4,6 +4,20 @@
 
 Orchestration can also be used in chat scenarios. The following example shows how to configure the templating module to use a chat prompt.
 
+
+
+<a name="loio39321a9f02c5486fafd76ec565cd9638__section_vr2_rpj_12c"/>
+
+## Prerequisites
+
+You have created a deployment for orchestration as described at [Create a Deployment for Orchestration](create-a-deployment-for-orchestration-4387aa7.md).
+
+
+
+<a name="loio39321a9f02c5486fafd76ec565cd9638__section_chv_4qj_12c"/>
+
+## Process
+
 ```
 curl --request POST $ORCH_DEPLOYMENT_URL/completion \
     --header 'content-type: application/json' \
@@ -21,7 +35,7 @@ curl --request POST $ORCH_DEPLOYMENT_URL/completion \
           ]
         },
         "llm_module_config": {
-          "model_name": "gpt-35-turbo-16k",
+          "model_name": "<ModelName>",
           "model_params": {
             "max_tokens": 300,
             "temperature": 0.1,
@@ -104,7 +118,7 @@ The response contains the messages from the chat history and the response to the
       "id": "chatcmpl-9kXqisJKnuNv1B4eXTUzqZEJSmzdC",
       "object": "chat.completion",
       "created": 1720880232,
-      "model": "gpt-35-turbo-16k",
+      "model": "<ModelName>",
       "choices": [
         {
           "index": 0,
@@ -126,7 +140,7 @@ The response contains the messages from the chat history and the response to the
     "id": "chatcmpl-9kXqisJKnuNv1B4eXTUzqZEJSmzdC",
     "object": "chat.completion",
     "created": 1720880232,
-    "model": "gpt-35-turbo-16k",
+    "model": "<ModelName>",
     "choices": [
       {
         "index": 0,
@@ -146,3 +160,10 @@ The response contains the messages from the chat history and the response to the
 }
 ```
 
+**Related Information**  
+
+
+[Leveraging Orchestration Capabilities to Enhance Responses](https://developers.sap.com/tutorials/ai-core-orchestration-consumption-opt.html)
+
+[Libraries and SDKs](libraries-and-sdks-499309d.md "Explore additional SDKs and Libraries, for use with SAP AI Core.")
+
@@ -75,7 +75,7 @@ The resource group used in the activation steps
 
 ## Example Payloads for Inferencing
 
-llama2-70b-chat-hfThe following examples show how you can consume various generative AI models using curl. For more information about prompts, see the tutorial [Prompt LLMs in the Generative AI Hub in SAP AI Core & Launchpad](https://developers.sap.com/tutorials/ai-core-generative-ai.html).
+The following examples show how you can consume various generative AI models using curl. For more information about prompts, see the tutorial [Prompt LLMs in the Generative AI Hub in SAP AI Core & Launchpad](https://developers.sap.com/tutorials/ai-core-generative-ai.html).
 
 > ### Tip:  
 > If you use a Windows device, use Windows Powershell, and replace `curl` with `curl.exe`.
 
@@ -0,0 +1,116 @@
+<!-- loio2392d9a0504e4380bfa75a5efdb64b6e -->
+
+# Consume Models with the Harmonized API
+
+In this section, we will provide a minimal inference call without any orchestration modules.
+
+A minimal call to orchestration contains only configurations of the required templating and model configuration modules. The curl command below shows how to make such a request.
+
+```
+curl --request POST $ORCH_DEPLOYMENT_URL/completion \
+    --header 'content-type: application/json' \
+    --header "Authorization: Bearer $TOKEN" \
+    --header "ai-resource-group: $RESOURCE_GROUP" \
+    --data-raw '{
+  "orchestration_config": {
+    "module_configurations": {
+      "templating_module_config": {
+        "template": [
+          {
+            "role": "user",
+            "content": "Reply with `{{?text}}` in {{?language}}"
+          }
+        ],
+        "defaults": {
+          "language": "English"
+        }
+      },
+      "llm_module_config": {
+        "model_name": " gpt-35-turbo-16k ",
+        "model_params": {
+          "max_tokens": 50,
+          "temperature": 0.1,
+          "frequency_penalty": 0,
+          "presence_penalty": 0
+        },
+        "model_version": "latest"
+      }
+    }
+  },
+  "input_params": {
+    "text": "Orchestration is Working!",
+    "language": "German"
+  }
+}'
+```
+
+This request configures the templating module with a single user message with two parameters: `text` and `language`. The `language` parameter is also configured with English as the default. The LLM module is configured to use gpt-35-turbo-16k in the latest available version and a set of model parameters. The `input_params` field contains the values for the parameters `text` and `language`. These values are used during this request in the prompt sent to the model.
+
+The response contains a `request_id`, the module results from each module that was executed, and the `orchestration_result`, which includes the response of the call to the model.
+
+> ### Output Code:  
+> ```
+> {
+>   "request_id": "53fc2dcd-399d-4a2b-8bde-912b9f001fed",
+>   "module_results": {
+>     "templating": [
+>       {
+>         "role": "user",
+>         "content": "Reply with `Orchestration is Working!` in German"
+>       }
+>     ],
+>     "llm": {
+>       "id": "chatcmpl-9k8M3djXphXPWh2QkQm1YVtXK4Eki",
+>       "object": "chat.completion",
+>       "created": 1720782231,
+>       "model": " gpt-35-turbo-16k ",
+>       "choices": [
+>         {
+>           "index": 0,
+>           "message": {
+>             "role": "assistant",
+>             "content": "Orchestrierungsdienst funktioniert!"
+>           },
+>           "finish_reason": "stop"
+>         }
+>       ],
+>       "usage": {
+>         "completion_tokens": 10,
+>         "prompt_tokens": 20,
+>         "total_tokens": 30
+>       }
+>     }
+>   },
+>   "orchestration_result": {
+>     "id": "chatcmpl-9k8M3djXphXPWh2QkQm1YVtXK4Eki",
+>     "object": "chat.completion",
+>     "created": 1720782231,
+>     "model": "<ModelName>",
+>     "choices": [
+>       {
+>         "index": 0,
+>         "message": {
+>           "role": "assistant",
+>           "content": "Orchestrierungsdienst funktioniert!"
+>         },
+>         "finish_reason": "stop"
+>       }
+>     ],
+>     "usage": {
+>       "completion_tokens": 10,
+>       "prompt_tokens": 20,
+>       "total_tokens": 30
+>     }
+>   }
+> }
+> ```
+
+The templating module result contains the user message with the filled in parameters. The LLM module result contains the response of the model execution. In this example, the LLM module result and the orchestration result are the same. However, they might differ, such as when the output filtering module filters the response.
+
+**Related Information**  
+
+
+[Consumption of GenAI Models Using Orchestration – A Beginner's Guide](https://developers.sap.com/tutorials/ai-core-orchestration-consumption.html)
+
+[Libraries and SDKs](libraries-and-sdks-499309d.md "Explore additional SDKs and Libraries, for use with SAP AI Core.")
+
@@ -8,9 +8,9 @@
 
 ## Prerequisites
 
-You have created a resource group for grounding purposes. For more information, see [Create a Resource Group for AI Data Management](create-a-resource-group-for-ai-data-management-6712bfe.md)
+You have created a resource group for grounding purposes. For more information, see [Create a Resource Group for Grounding](create-a-resource-group-for-grounding-6712bfe.md)
 
-You have created a generic secret for grounding purposes. For more information, see [Create a Generic Secret for AI Data Management](create-a-generic-secret-for-ai-data-management-bdea357.md)
+You have created a generic secret for grounding purposes. For more information, see [Create a Generic Secret for Grounding](create-a-generic-secret-for-grounding-bdea357.md)
 
 
 
 
@@ -85,11 +85,11 @@ You make a model available for use by creating a deployment. You can do so one t
     > 	"parameterBindings": [
     > 		{
     > 			"key":"modelName",
-    > 			"value":"gpt-35-turbo"
+    > 			"value":"<ModelName>"
     > 		},
     > 		{
     > 			"key": "modelVersion",
-    > 			"value": "0613"
+    > 			"value": "<ModelVersion>"
     > 		}
     >   ],
     > 	"inputArtifactBindings": []