Skip to content
@kyutai-labs

kyutai

Kyutai - Open Science AI Lab

Popular repositories Loading

  1. moshi moshi Public

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

    Python 9.3k 844

  2. delayed-streams-modeling delayed-streams-modeling Public

    Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

    Python 2.8k 278

  3. pocket-tts pocket-tts Public

    A TTS that fits in your CPU (and pocket)

    Python 1.5k 162

  4. hibiki hibiki Public

    Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

    Rust 1.4k 109

  5. unmute unmute Public

    Make text LLMs listen and speak

    Python 1.1k 188

  6. moshi-finetune moshi-finetune Public

    Python 347 47

Repositories

Showing 10 of 24 repositories
  • pocket-tts Public

    A TTS that fits in your CPU (and pocket)

    kyutai-labs/pocket-tts’s past year of commit activity
    Python 1,467 MIT 162 19 (8 issues need help) 8 Updated Jan 15, 2026
  • tts_longeval Public
    kyutai-labs/tts_longeval’s past year of commit activity
    Python 25 MIT 2 0 0 Updated Jan 15, 2026
  • moshi Public

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

    kyutai-labs/moshi’s past year of commit activity
    Python 9,312 Apache-2.0 844 60 13 Updated Jan 8, 2026
  • sphn Public

    python bindings for symphonia/opus - read various audio formats from python and write opus files

    kyutai-labs/sphn’s past year of commit activity
    Rust 74 Apache-2.0 7 1 0 Updated Jan 7, 2026
  • ARC-Encoder Public
    kyutai-labs/ARC-Encoder’s past year of commit activity
    Python 24 Apache-2.0 3 0 0 Updated Jan 5, 2026
  • jax-flash-attn3 Public

    JAX bindings for the flash-attention3 kernels

    kyutai-labs/jax-flash-attn3’s past year of commit activity
    C++ 18 3 0 1 Updated Jan 2, 2026
  • flash-attn3-jax Public

    JAX bindings for the FlashAttention 3 kernels

    kyutai-labs/flash-attn3-jax’s past year of commit activity
    C++ 14 BSD-3-Clause 1 0 0 Updated Dec 27, 2025
  • casa Public

    A vision-language model with an improved cross-attention mechanism for scalable streaming inference

    kyutai-labs/casa’s past year of commit activity
    Python 23 MIT 3 3 0 Updated Dec 24, 2025
  • unmute Public

    Make text LLMs listen and speak

    kyutai-labs/unmute’s past year of commit activity
    Python 1,106 MIT 188 26 (3 issues need help) 0 Updated Dec 24, 2025
  • delayed-streams-modeling Public

    Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

    kyutai-labs/delayed-streams-modeling’s past year of commit activity
    Python 2,764 Apache-2.0 278 34 0 Updated Nov 26, 2025

Most used topics

Loading…