AI Developer Experience @GoogleDeepMind | prev: Tech Lead at @huggingface, AWS ML Hero 🤗 Sharing my own views and AI News 🧑🏻💻 https://t.co/7IosdlNz22
Dec 11, 2024 • 4 tweets • 2 min read
Here is an 80-line Python script of how to use the @GoogleDeepMind Gemini 2.0 Flash Live API for real-time conversations. 🗣️
1. Copy the code from the gist 2. "pip install pyaudio websockets" 2. "GEMINI_API_KEY={TOKEN} python live_test.py" 3. Talk to Gemini it is fun!
Simple SimpleGeminiVoice Client gist.github.com/philschmid/cb8…
Nov 1, 2023 • 6 tweets • 2 min read
How can we teach LLMs to be factual, correct, and more reliable? 🤔
RAG is one approach to adding information to the prompt. But, always retrieving can lead to bad responses😔
Self-RAG proposes a new method to teach LLMs when to retrieve information and how to use it.🤯
🧶
🛠 𝗜𝗺𝗽𝗹𝗲𝗺𝗲𝗻𝘁𝗮𝘁𝗶𝗼𝗻
1️⃣ Create a “critique” dataset with when to use retrieval and what is relevant using, e.g., GPT-4
2️⃣ Train critique model on the synthetic dataset
3️⃣ Use prompts, critique model and retriever to generate RAG dataset in an offline way
Sep 28, 2023 • 9 tweets • 3 min read
Amazon Bedrock Is Now Generally Available. Below is a Thread from my findings looking at the published sources and trying it out.
Let's start with a highlight. 🏞
🧶
Quotas for using different models, not adjustable at the moment.
Jul 24, 2023 • 7 tweets • 2 min read
Is Llama 2 special or just a better iteration of Llama 1? 🤔 Over the weekend, I had time to read the paper in which Meta released. 📖
Below are some of my findings, which you might have missed📝
🧵 1/6
🧠 A 34B version may come later after more testing
⚖️ The 7B model used a 285x token to parameter ratio, with loss still decreasing.
💰 Training the 7B would cost ~$1M in AWS compute (5$ per A100 on AWS on-demand)
🛫 Llama Chat was started before Llama 2 finished training
🧵2/6
Jun 19, 2023 • 4 tweets • 3 min read
OpenLLaMA 13B was released and competitive with its original counterpart from MetaAI. 🚀🎉 Two months ago, the OpenLM research initiative started to create a permissively licensed open-source reproduction of Meta AI’s LLaMA! 🛫
👉 huggingface.co/openlm-researc…
🧵 1/4
Last week the team released the 13B weights under Apache 2.0 with evaluations on the lm-evaluation-harness by EleutherAI🔓
OpenLLaMA matches @Meta LLaMA with an avg score of 0.57, making it a perfect replacement for all your commercial use cases🥊
Finally had the time to read the "The False Promise of Imitating Proprietary LLMs.” paper in detail. 📚✨ Below are some of my key takeaways: 📝
🔍 Objective:
- The paper aimed to evaluate the effectiveness of models trained on GPT outputs.
🧵 1/4
💻Implementation
- collected datasets imitating ChatGPT for specific tasks or broadly imitating its behavior (0.3M–150M tokens).
- Fine-tuned LLMs (GPT-2 and LLaMA)
- Evaluated with Humans and GPT-4 (blind pairwise comparisons with ChatGPT) and on canonical NLP benchmarks
🧵 2/4
Jun 8, 2023 • 4 tweets • 2 min read
Introducing StarChat Beta β 🤖 Your new coding buddy 🙌Attention all coders and developers 💻
You can write in plain English, and it will understand your queries, offer explanations, and provide step-by-step guidance to solve coding problems 🤯
👉 huggingface.co/spaces/Hugging…
🧵1/4
StarChat can help you:
🙋🏻♂️ Answer coding questions in over 80 languages, including Python, Java, C++ and more!
🧠 Explain concepts and help debug your code
📊 Generate sample code for data visualizations and plots in Python
💬 Iterate together to solve your coding errors