v0.1.0
The three of us - @ishkhan42, @VoVoR, and @ashvardanian have added support for NumPy arrays and Pillow images as RPC arguments, adding a small Client
class along the way! The following example shows how to serve UForm multi-modal vision-language transformers in just a few lines of Python, connecting dozens of SIMD-accelerated libraries under the hood for ultimate performance!
from ujrpc.rich_posix import Server
import ufrom
server = Server()
model = uform.get_model('unum-cloud/uform-vl-multilingual')
@server
def vectorize(description: str, photo: PIL.Image.Image) -> numpy.ndarray:
image = model.preprocess_image(photo)
tokens = model.preprocess_text(description)
joint_embedding = model.encode_multimodal(image=image, text=tokens)
return joint_embedding.cpu().detach().numpy()
A new UKV release and the public rollout of UNSW are also right around the corner!