Skip to content
@FunAudioLLM

FunAudioLLM

Popular repositories Loading

  1. CosyVoice CosyVoice Public

    Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

    Python 19.4k 2.2k

  2. SenseVoice SenseVoice Public

    Multilingual Voice Understanding Model

    Python 7.4k 694

  3. FunMusic FunMusic Public

    A fundamental toolkit designed for music, song, and audio generation

    Python 1.3k 131

  4. ThinkSound ThinkSound Public

    [NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

    Python 1.1k 67

  5. Fun-ASR Fun-ASR Public

    Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.

    Python 834 65

  6. Fun-Audio-Chat Fun-Audio-Chat Public

    Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.

    Python 723 71

Repositories

Showing 10 of 12 repositories
  • FunResearch Public

    This repository is maintained by the Speech Team at Alibaba’s Tongyi Lab, serving as an open-source platform for our cutting-edge research in speech, audio, NLP technologies. We believe in accelerating scientific progress through transparent collaboration, and invite the global research community to explore, reproduce, and build upon our work.

    FunAudioLLM/FunResearch’s past year of commit activity
    Python 16 Apache-2.0 1 0 0 Updated Jan 30, 2026
  • CosyVoice Public

    Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

    FunAudioLLM/CosyVoice’s past year of commit activity
    Python 19,397 Apache-2.0 2,180 862 18 Updated Jan 29, 2026
  • Fun-Audio-Chat Public

    Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.

    FunAudioLLM/Fun-Audio-Chat’s past year of commit activity
    Python 723 Apache-2.0 71 10 1 Updated Jan 29, 2026
  • ThinkSound Public

    [NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

    FunAudioLLM/ThinkSound’s past year of commit activity
    Python 1,146 67 32 1 Updated Jan 27, 2026
  • Fun-ASR Public

    Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.

    FunAudioLLM/Fun-ASR’s past year of commit activity
    Python 834 Apache-2.0 65 46 0 Updated Jan 26, 2026
  • FunAudioLLM/FunAudioLLM.github.io’s past year of commit activity
    HTML 56 MIT 10 0 1 Updated Jan 21, 2026
  • MME-Emotion Public

    Official repository for the paper “MME-Emotion: A Holistic Evaluation Benchmark for Emotional Intelligence in Multimodal Large Language Models”

    FunAudioLLM/MME-Emotion’s past year of commit activity
    Python 20 MIT 2 1 0 Updated Jan 17, 2026
  • SenseVoice Public

    Multilingual Voice Understanding Model

    FunAudioLLM/SenseVoice’s past year of commit activity
    Python 7,447 694 169 4 Updated Dec 30, 2025
  • CV3-Eval Public
    FunAudioLLM/CV3-Eval’s past year of commit activity
    Python 172 Apache-2.0 14 7 0 Updated Aug 25, 2025
  • OmniAudio Public
    FunAudioLLM/OmniAudio’s past year of commit activity
    Python 8 3 0 0 Updated May 21, 2025