Skip to content
Change the repository type filter

All

    Repositories list

    • moshi

      Public
      Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
      Python
      Apache License 2.0
      6658k367Updated Apr 8, 2025Apr 8, 2025
    • hibiki

      Public
      Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- Hibiki adapts its flow to accumulate just enough context to produce a correct translation in real-time, chunk by chunk.
      Rust
      Apache License 2.0
      7897351Updated Apr 4, 2025Apr 4, 2025
    • Python
      Apache License 2.0
      817920Updated Apr 3, 2025Apr 3, 2025
    • Swift
      MIT License
      68710Updated Mar 31, 2025Mar 31, 2025
    • sphn

      Public
      python bindings for symphonia/opus - read various audio formats from python and write opus files
      Rust
      Apache License 2.0
      55600Updated Mar 26, 2025Mar 26, 2025
    • moshivis

      Public
      Kyutai with an "eye"
      Python
      Apache License 2.0
      2618001Updated Mar 26, 2025Mar 26, 2025
    • kaudio

      Public
      Rust crate for some audio utilities
      Rust
      Apache License 2.0
      02200Updated Mar 8, 2025Mar 8, 2025
    • Proof of concept for running moshi/hibiki using webrtc
      Rust
      Apache License 2.0
      11800Updated Feb 28, 2025Feb 28, 2025
    • JAX bindings for the flash-attention2 kernels
      C++
      0700Updated Jan 16, 2025Jan 16, 2025
    • yomikomi

      Public
      A small rust-based data loader
      Rust
      Apache License 2.0
      02400Updated Dec 10, 2024Dec 10, 2024
    • ogg-table

      Public
      Ogg-vorbis reader with fast random access
      Rust
      Other
      1600Updated Aug 29, 2024Aug 29, 2024
    • JAX bindings for the flash-attention3 kernels
      C++
      11101Updated Aug 6, 2024Aug 6, 2024