llama-stack-client-swift

llama-stack-client-swift brings the inference and agents APIs of Llama Stack to iOS.

Update: January 27, 2025 The llama-stack-client-swift SDK version has been updated to 0.1.0, working with Llama Stack 0.1.0 (release note).

Features

Inference & Agents: Leverage remote Llama Stack distributions for inference, code execution, and safety.
Custom Tool Calling: Provide Swift tools that Llama agents can understand and use.

iOS Demos

See here for a quick iOS demo (video) using a remote Llama Stack server for inferencing.

For a more advanced demo using the Llama Stack Agent API and custom tool calling feature, see the iOS Calendar Assistant demo.

Installation

Click "Xcode > File > Add Package Dependencies...".
Add this repo URL at the top right: https://github.com/meta-llama/llama-stack-client-swift and 0.1.0 in the Dependency Rule, then click Add Package.
Select and add llama-stack-client-swift to your app target.
On the first build: Enable & Trust the OpenAPIGenerator extension when prompted.
Set up a remote Llama Stack distributions, assuming you have a Fireworks or Together API key, which you can get easily by clicking the link:

conda create -n llama-stack python=3.10
conda activate llama-stack
pip install --no-cache llama-stack==0.1.0 llama-models==0.1.0 llama-stack-client==0.1.0

Then, either:

PYPI_VERSION=0.1.0 llama stack build --template fireworks --image-type conda
export FIREWORKS_API_KEY="<your_fireworks_api_key>"
llama stack run fireworks

or

PYPI_VERSION=0.1.0 llama stack build --template together --image-type conda
export TOGETHER_API_KEY="<your_together_api_key>"
llama stack run together

The default port is 5000 for llama stack run and you can specify a different port by adding --port <your_port> to the end of llama stack run fireworks|together.

Replace the RemoteInference url string below with the host IP and port of the remote Llama Stack distro in Step 5:

import LlamaStackClient

let inference = RemoteInference(url: URL(string: "http://127.0.0.1:5000")!)

Below is an example code snippet to use the Llama Stack inference API. See the iOS Demos above for complete code.

for await chunk in try await inference.chatCompletion(
    request:
        Components.Schemas.ChatCompletionRequest(
        messages: [
            .user(
            Components.Schemas.UserMessage(
                content:
                    .InterleavedContentItem(
                        .text(Components.Schemas.TextContentItem(
                            text: userInput,
                            _type: .text
                        )
                    )
                ),
                role: .user
            )
        )
        ],
        model_id: "meta-llama/Llama-3.1-8B-Instruct",
        stream: true)
    ) {
        switch (chunk.event.delta) {
            case .text(let s):
                message += s.text
                break
            case .image(let s):
                print("> \(s)")
                break
            case .tool_call(let s):
                print("> \(s)")
                break
        }
    }

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github		.github
.swiftpm/.swiftpm/xcode/package.xcworkspace		.swiftpm/.swiftpm/xcode/package.xcworkspace
Sources/LlamaStackClient		Sources/LlamaStackClient
Tests/LlamaStackClientTests		Tests/LlamaStackClientTests
scripts		scripts
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Package.resolved		Package.resolved
Package.swift		Package.swift
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llama-stack-client-swift

Features

iOS Demos

Installation

About

Releases 3

Packages

Contributors 7

Languages

License

meta-llama/llama-stack-client-swift

Folders and files

Latest commit

History

Repository files navigation

llama-stack-client-swift

Features

iOS Demos

Installation

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 3

Packages 0

Contributors 7

Languages

Packages