Research Scientist @AIatMeta. @centralesupelec, @ENS_ParisSaclay and @UGrenobleAlpes alum. 💬 My opinions are my own | she/her | 🇲🇦
Dec 13, 2024 • 9 tweets • 4 min read
Proud to share our work on Large Concept Models (LCMs)! This is a new direction in language modeling that moves beyond traditional token-level LLMs.
Paper: ai.meta.com/research/publi…
Code: github.com/facebookresear…1/ LCMs operate at the level of meaning or what we label “concepts”. This corresponds to a sentence in text or an utterance in speech. These units are then embedded into SONAR, a language- and modality-agnostic representation space. github.com/facebookresear…
Aug 22, 2023 • 22 tweets • 6 min read
Excited to share our work on SeamlessM4T. An all-in-one, Massively Multilingual and Multimodal Machine Translation model.
This 🧵 discusses some of the contributions of SeamlessM4T
SeamlessM4T introduces multitask UnitY model, capable of translating text/speech input into text/speech output. It enables:
- Speech-to-speech translation (S2ST)
- Speech-to-text translation (S2TT)
- Text-to-speech translation (T2ST)
- Text-to-text translation (T2TT)
- ASR