Moshi: A speech-text foundation model for real time dialogue

(github.com)

324 points | by gkucsko a day ago ago

54 comments