moshi

实时语音对话模型

Introduction

Moshi是一个语音文本基础模型和全双工口语对话框架。它使用Mimi ，一种最先进的流式神经音频编解码器。 Mimi 以完全流式传输的方式处理 24 kHz 音频，低至 12.5 Hz 表示，带宽为 1.1 kbps（延迟 80 毫秒，帧大小），但性能比现有的非流式编解码器（如SpeechTokenizer）（50 Hz 、4kbps）或SemantiCodec （50 Hz，1.3kbps）。

Back

Information

Publisher
SeeAISeeAI
Websitehttps://github.com/kyutai-labs/moshi
Published date2024/11/04

More Products

AI工具图像设计

Visit Website

Nano Banana

Details

Nano-Banana is a next-gen AI photo editor that turns short prompts into natural portraits and product shots—pose & hair control, safe face swap, clean background removal, lighting fixes, and crisp 4K upscaling.

AI图像 AI生成 AI工具 AI文本开发工具

AI工具

Visit Website

Free Unlimited Video Face Swap

Details

Free Unlimited Video Face Swap transforms media with advanced AI. Our AI Video Swap Unlimited.

AI助理

AI工具

Visit Website

todaysnews.ai

Details

TodaysNews.ai delivers the world’s top stories in just minutes, curated by AI to cut through the noise. Get smarter, faster updates without endless scrolling — just the facts that matter, sent daily to your inbox.

二次元

moshi

Introduction

Information

Categories

Tags

More Products

Nano Banana

Free Unlimited Video Face Swap

todaysnews.ai

moshi

Introduction

Information

Categories

Tags

More Products

Nano Banana

Free Unlimited Video Face Swap

todaysnews.ai

Newsletter

加入社群