ModelScope

All

27 repositories

data-juicer
Public
Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！
nlp data-science opendata data-visualization pytorch dataset chinese data-analysis llama gpt
Python
•
Apache License 2.0
•186•3.1k•25•16•Updated Dec 18, 2024Dec 18, 2024
DiffSynth-Studio
Public
Enjoy the magic of Diffusion models!
Python
•
Apache License 2.0
•609•6.7k•115•0•Updated Dec 18, 2024Dec 18, 2024
ms-swift
Public
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
agent deploy llama lora liger peft multimodal sft dpo pre-training
Python
•
Apache License 2.0
•409•4.7k•292•9•Updated Dec 18, 2024Dec 18, 2024
evalscope
Public
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
performance evaluation vlm rag llm
Python
•
Apache License 2.0
•36•310•21•1•Updated Dec 18, 2024Dec 18, 2024
dash-infer
Public
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.
cpu cuda llm llm-inference native-engine guided-decoding
C
•
Apache License 2.0
•16•139•5•0•Updated Dec 18, 2024Dec 18, 2024
modelscope-studio
Public
A third-party component library based on Gradio.
Python
•
Apache License 2.0
•8•55•2•0•Updated Dec 18, 2024Dec 18, 2024
3D-Speaker
Public
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
speaker-verification speaker-diarization language-identification voxceleb modelscope campplus eres2net 3d-speaker rdino cnceleb
Python
•
Apache License 2.0
•109•1.3k•1•0•Updated Dec 18, 2024Dec 18, 2024
modelscope
Public
ModelScope: bring the notion of Model-as-a-Service to life.
nlp science cv speech multi-modal python machine-learning deep-learning
Python
•
Apache License 2.0
•739•7.1k•21•7•Updated Dec 18, 2024Dec 18, 2024
FunASR
Public
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model
Python
•
Other
•784•7.4k•205•11•Updated Dec 17, 2024Dec 17, 2024
ClearerVoice-Studio
Public
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Python
•
Apache License 2.0
•123•1.8k•10•4•Updated Dec 17, 2024Dec 17, 2024
agentscope
Public
Start building LLM-empowered multi-agent applications in an easier way.
agent drag-and-drop chatbot multi-agent multi-modal distributed-agents gpt-4 large-language-models llm llm-agent
Python
•
Apache License 2.0
•343•5.5k•29•18•Updated Dec 12, 2024Dec 12, 2024
PromptScope
Public
Enjoy easier conversations with LLM
prompt multi-modal gpt-4 in-context-learning large-language-models prompt-engineering llms
Python
•
Apache License 2.0
•1•2•0•0•Updated Dec 12, 2024Dec 12, 2024
facechain
Public
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Jupyter Notebook
•
Apache License 2.0
•860•9.2k•7•2•Updated Dec 10, 2024Dec 10, 2024
scepter
Public
SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.
generative-model scedit aigc lar-gen stylebooth
Python
•
Apache License 2.0
•26•439•9•0•Updated Dec 7, 2024Dec 7, 2024
modelscope-agent
Public
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
agent data-science code chatbot android-application multi-agents rag mobile-agents gpts llm
Python
•
Apache License 2.0
•320•2.8k•68•2•Updated Dec 4, 2024Dec 4, 2024
modelscope-classroom
Public
Jupyter Notebook
•
Apache License 2.0
•68•568•0•1•Updated Nov 22, 2024Nov 22, 2024
MemoryScope
Public
Python
•
Apache License 2.0
•32•337•3•0•Updated Nov 21, 2024Nov 21, 2024
comfyscope
Public
Collection of various Comfy components.
Python
•
Apache License 2.0
•1•3•0•2•Updated Nov 20, 2024Nov 20, 2024
richdreamer
Public
[CVPR2024 (Highlight)] RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D. Live Demo：https://modelscope.cn/studios/Damo_XR_Lab/3D_AIGC
Python
•
Apache License 2.0
•18•426•17•0•Updated Sep 27, 2024Sep 27, 2024
motionagent
Public
MotionAgent is your AI assistent to convert ideas into motion pictures.
Python
•
Apache License 2.0
•35•286•3•1•Updated Sep 2, 2024Sep 2, 2024
FunClip
Public
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
speech-recognition speech-to-text gradio video-clip subtitles-generator video-subtitles llm gradio-python-llm
Python
•
MIT License
•432•3.9k•24•2•Updated Aug 22, 2024Aug 22, 2024
lite-sora
Public
An initiative to replicate Sora
Python
•
Apache License 2.0
•6•100•3•0•Updated Apr 10, 2024Apr 10, 2024
normal-depth-diffusion
Public
Python
•
Apache License 2.0
•8•126•5•0•Updated Feb 7, 2024Feb 7, 2024
FunCodec
Public
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
tts speech-synthesis codec speech-to-text audio-generation encodec voicecloning audio-quantization
Python
•
MIT License
•31•374•20•1•Updated Jan 25, 2024Jan 25, 2024
KAN-TTS
Public
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
modelscope speech tts speech-synthesis
Python
•
MIT License
•82•498•42•1•Updated Dec 28, 2023Dec 28, 2023
AdaSeq
Public
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models
natural-language-processing information-extraction chinese-nlp word-segmentation bert sequence-labeling relation-extraction natural-language-understanding entity-typing token-classification
Python
•
Apache License 2.0
•38•426•31•0•Updated Nov 15, 2023Nov 15, 2023
kws-training-suite
Public
Python
•
MIT License
•18•90•7•0•Updated May 26, 2023May 26, 2023