Antoine Chaffin Profile picture
27, French CS Engineer 💻, PhD in ML 🎓🤖 — Guiding generative models for better synthetic data and building multimodal representations @LightOnIO — 🇫🇷🇬🇧
May 22 12 tweets 4 min read
Agentic RAG is the✨hot✨new✨thing
That is why I am thrilled to announce that @LightOnIO is releasing Reason-ModernColBERT, reaching the top of the popular BRIGHT benchmark and outperforming models more than 45 times its size on reasoning-intensive retrieval!Image Model:
So what is reasoning-intensive retrieval?
As explained in the BRIGHT paper, most existing retrieval tasks leverage lexical or semantic-based retrieval, but in practice, some tasks can require reasoning to determine what the target relevancy is huggingface.co/lightonai/Reas…Image
Apr 30 10 tweets 3 min read
Among all those LLM releases, here is an important retrieval release:
To overcome limitations of awesome ModernBERT-based dense models, today @LightOnIO is releasing GTE-ModernColBERT, the very first state-of-the-art late-interaction (multi-vectors) model trained using PyLate🚀Image Model link: huggingface.co/lightonai/GTE-…
GTE-ModernColBERT is trained on top of the GTE-ColBERT model using knowledge distillation on the MS MARCO dataset and is the first SOTA model trained using PyLate!
Get started with PyLate using the documentation:
lightonai.github.io/pylate/