Releases · josStorer/RWKV-Runner

05 Nov 13:27

github-actions

v1.5.0

b8712e0

v1.5.0

Changes

Features

chat attachment is now related to single message (Experimental)
port occupied detection

Upgrades

upgrade to rwkv 0.8.20

Improvements

improve the compatibility between frontend presets and chatgpt api
improve memory usage of state cache

Chores

update ngrok_connect
python38 compatibility
adjust startup process

Fixes

fix log encoding error
fix stop button status of Chat page

Install

Assets 6

27 Oct 06:28

github-actions

v1.4.9

97ae139

v1.4.9

Changes

Features

allow conversation with some document (.pdf, .txt) (Experimental)
add /file-to-text api
allow avatarImg to be local absolute path
base64 preset support

Upgrades

upgrade to rwkv 0.8.16 (DirectML support; rwkv 5.2 no longer needs to ensure custom cuda kernel enabled)
upgrade to webgpu 0.2.2 (WebGPU Mode is now recommended for AMD and Intel Users) (https://github.com/josStorer/ai00_rwkv_server)
upgrade python packages

Improvements

improve cuda kernel compatibility (compute compatibility 5.3, Jetson Nano, MX Series, Nvidia 10 Series+)
RWKVType now no longer relies on the file name (use emb)
improve message interruption and retry for Chat page
update sample.jsonl of LoRA finetune
update api stop strategy for better custom user_name and assistant_name support
edited chat message now is marked as Normal
change default World series prefix to User/Assistant

Chores

update manifest.json (RWKV-5)
update readme and client text description
add pip --no-warn-script-location
mark rwkv raven series as old model
chore

Fixes

fix linux kernel (partial revert 68228a4)
fix the make command on Linux and macOS, no longer need manual operations on the wsl.go file. (#158, #173, #207)

Install

Assets 6

03 Oct 07:21

github-actions

v1.4.8

4f26404

v1.4.8

Changes

latest rwkv-5.2 is now supported (with pre-compiled kernel for windows)
completion page: add format content button
chore

Install

Assets 6

18 Sep 15:20

github-actions

v1.4.7

c0aa6aa

v1.4.7

Changes

custom tokenizer .txt support
add pre-compiled beta cuda kernel (rwkv-beta==0.8.5, 40%+ faster for fp16) (thanks to #180, pre-compiled kernel of RTX 40 Series will be included later)

Install

Assets 6

16 Sep 05:23

github-actions

v1.4.6

f8b764e

v1.4.6

Changes

frontend adaptation for api params (user_name, assistant_name, presystem)
custom tokenizer (#77)
enable right-click context menu
upgrade cuda-beta
revert(2d5456): refresh local models when download complete (for macOS)
improve ui desc
chore

Install

Assets 6

27 Aug 16:15

github-actions

v1.4.5

e0df9ae

v1.4.5

Changes

frontend: update manifest (a lot of new models)
frontend: correct Preset UI description
frontend: add HardwareMonitor (Windows Only)
lora finetune: fix max_epochs (#170)
python-backend: allow message content to be empty
python-backend: extra ChatCompletionBody params (raw, presystem)
python-backend: add default_stop when stop is null
webgpu: fix webgpu_server file permissions of linux and macos
chore

Install

Assets 6

16 Aug 15:46

github-actions

v1.4.4

c9e4ae7

v1.4.4

Changes

webgpu support (AMD, Intel, Nvidia, Apple) (https://github.com/cgisky1980/ai00_rwkv_server)
add rwkv-cuda-beta support (faster)
add misc API (/models and /dashboard/billing/credit_grants)
allow multiple systems
allow completions input to be null

Install

WebGPU Mode

Assets 6

31 Jul 15:03

github-actions

v1.4.3

d7d4f87

v1.4.3

Changes

japanese UI
global penalty
allow custom user_name and assistant_name (/chat/completions API)
update defaultConfigs

Install

Assets 6

29 Jul 11:43

github-actions

v1.4.2

eb25dc6

v1.4.2

Breaking Changes

Due to performance and bandwidth considerations, the /chat/completions and /completions API no longer return the response field. If necessary, you can still uncomment it yourself.
Please note that this is never part of the OpenAI API, it existed previously only for API development convenience. If you follow the OpenAI API specification, you will not be affected in any way.

Changes

improve /chat/completions and /completions API performance (remove response field)
improve default ChatCompletion stop
improve python backend startup speed
update defaultConfigs

Install

Assets 6

28 Jul 14:28

github-actions

v1.4.1

30b6d66

v1.4.1

Changes

add Composition Page (RWKV-Music)
improve RunButton prompt
support for stop array api params
improve embeddings API results
improve python backend startup speed
add support for MIDI RWKV
add midi api
add CPU-120M-Music config
improve sse fetch
update manifest (a lot of new models)
update presets
remove LoraFinetunePrecision fp32
chore

Install

Assets 6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changes

Features

Upgrades

Improvements

Chores

Fixes

Install

Changes

Features

Upgrades

Improvements

Chores

Fixes

Install

Changes

Install

Changes

Install

Changes

Install

Changes

Install

Changes

Install

WebGPU Mode

Changes

Install

Breaking Changes

Changes

Install

Changes

Install

Releases: josStorer/RWKV-Runner

v1.5.0

Changes

Features

Upgrades

Improvements

Chores

Fixes

Install

v1.4.9

Changes

Features

Upgrades

Improvements

Chores

Fixes

Install

v1.4.8

Changes

Install

v1.4.7

Changes

Install

v1.4.6

Changes

Install

v1.4.5

Changes

Install

v1.4.4

Changes

Install

WebGPU Mode

v1.4.3

Changes

Install

v1.4.2

Breaking Changes

Changes

Install

v1.4.1

Changes

Install