Latest Twitter Threads by @Yampeleg on Thread Reader App

Jul 3 • 14 tweets • 3 min read

עכשיו האיראנים מבקשים מאיתנו להאמין שהם חפרו בונקר גרעיני בעומק 90 מ׳ בתוך הר

- בהשקעה של 2 מיליארד $ (0.5% מהתמ״ג)
- במשך עשור
- 3 קומות משוריינות
- דלתות הדף פנימיות מפלדה
- מעליות שירות
- מערכות סינון-אוויר

וכל זה

בשביל שדקה לפני תקיפה הם יוציאו את האורניום לטיול בשמש כי היה להם ״מ-ו-ד-י-ע-י-ן מהפנטגון״

הם הצליחו להערים על המערב ולהחביא מספיק אורניום לעשר (!!) פצצות אטום.

איראן ״ניצחו את אמריקה״.
אין מה לעשות.

[אבל לא פצצות אטום שלהם.
להם אין תוכנית גרעין צבאית בכלל.
זה הכל גרעין אזרחי.
האזרחים אוהבים חשמל.
צריך הרבה חשמל. מלא.]

Jun 21 • 8 tweets • 2 min read

הורג אותי שיש כאן אנשים שרואים את ישראל

1. גורמת לחיזבאללה לשלם על 5000 פצצות שנראות כמו ביפרים ומתפוצצות להם בתחתונים

2. עובדת בטלפון על הרמטכ״ל של חיזבאללה שבוגד באישתו עם 4 נשים שונות ש״יעלה רגע לדירה להפסקה קלה..״ ומפציצה לו את הבית. 3. גורמת למיטה רנדומלית באיראן להתפוצץ בשניה אסמאעיל הנייה נשכב עליה לישון איכשהו.

4. מפציצה אחד אחד 50 איש שמכסים כמעט כל אנשי הזרוע הצבאית בחמאס
(כמה בודדים שבדרך)

5. ועוברת סיסטמתית על רשימת שמות מסודרת של 4000 רוצחים מהשביעי באוקטובר ומתנקשת בכ-ו-ל-ם.

Aug 21, 2024 • 12 tweets • 4 min read

I collected a long list of the latest AI Coding tools.

All tools are from reddit, I picked only those I saw people talk about recently and are happy with.
(~2 weeks)

I am going to check them all soon enough.
Here they are so you can also try.

(lets talk about them if you try!) but.. why..?

The way I see it:

LLMs are great for code completion
(co-pilot)

But not so great when writing code on their own
(chat-gpt)

I am skeptical about chat LLMs producing value for experienced devs.

But I want to be proven wrong!
(take my job already)

Aug 11, 2024 • 19 tweets • 5 min read

I binge-read most of deepseek's papers this weekend.

Here are my notes
(focusing mostly on data)

The papers:
• DeepSeek:
• DeepSeek-V2:
• DeepSeekMath:
• DeepseekCoder:
• DeepSeek-Coder-V2: arxiv.org/pdf/2401.02954
arxiv.org/pdf/2405.04434
arxiv.org/pdf/2402.03300
arxiv.org/pdf/2401.14196
arxiv.org/pdf/2406.11931 What's the deal with DeepSeekCoder V2?

First time that an open source model reach this level.
Outperformed state of the art coding models.

Released in June so it is already widely tested by everyone around the world.

The results are real, this model is among the bests!

Aug 8, 2024 • 15 tweets • 5 min read

My final guess about @iruletheworldmo before they tell us:

- It is a model.
- Fetches data from X to its context & interacts with us
- The DMs (at least some) are human
- Has some internal agent loop
- NOT trained on X
- Uncensored
- VERY impressive.

(I might be VERY wrong ! !)

For those that don't know what this is all about:

Published 3 weeks ago:

Project "Straberry"

An internal reasoning effort at OpenAI as part of the next model's development.

--

Some link this to Q* but this is not confirmed.

[Q* is "what ilya saw" on last year's drama..]

Aug 7, 2024 • 19 tweets • 4 min read

אני לא בטוח שכולם מבינים את הכובד של זה.

דמיינו שגיליתם לבד תגלית שלא רק משנה את כל העולם..

מסתבר ש..
התגלית משמידה את כל תחום העיסוק שלכם.

ואז מסתבר ש..
היא גם משמידה גם את כל התחומים המשיקים.

ואז מסתבר ש..
אין עוד תגליות לגלות אחריה.

ואת כל זה אתם עושים 6 פעמים.

אטנשן. > יש לי כל כך הרבה יותר מידי לספר לכם על זה!

אתחיל לחפור:

שאלה:
איך ChatGPT יודע לדבר בשפה אנושית?

תשובה:
הוא למד לבד מדוגמאות "איך נראה טקסט שבני אדם כותבים"

התשובה הזו נשמעת נורא נחמדה אבל היא לא אומרת כלום לרוב האנשים.

להלן, רקע מקדים קל:

Jul 23, 2024 • 12 tweets • 2 min read

היום צפויה נק' מפנה בבינה המלאכותית.

תקציר:
מטא צפויה לשחרר היום לעולם מודל דגל החדש: LLaMA-3.1
מפה לשם.. הוא כבר דלף..

אחרי לילה ארוך אני יכול לספר לכם איך הוא:
מדובר ככל הנראה במודל החזק בעולם.

אלו רק תוצאות ראשוניות אבל אם המצב ישאר כך העולם כולו עומד להשתנות (לא הגזמה) >> היחוד במודל הוא שמדובר במודל פתוח.

עד לרגע זה המודלים הטובים ביותר היו רק מודלים מסחריים.

מודלים המופעלים על ידי ארגונים גדולים המספקים למשתמשים הגישה דרך ממשקי משתמש (כמו ChatGPT) או ממשקים תוכנתיים (API).

אבל את המודל הזה אתם יכולים להוריד אליכם אל המחשב.

Mar 10, 2024 • 10 tweets • 2 min read

I think Claude 3 crossed (or closely approached) an interesting threshold:

The "power users" threshold.

For the first time being able to help power users with heavy complex tasks faster than themselves.

This is a controversial topic in AI,
Let me try to explain: I never use GPT-4 for code.

I use it for:
1. Brainstorming ideas.
2. Learning new topics that I don't know about.
3. Reading long texts instead of me (ask-your-pdf)
4. Just easy tasks.

But never for code.
It never helped me coding.

Jan 24, 2024 • 11 tweets • 3 min read

Gazans going crazy when realizing there are Muslims in the IDF

A Thread 🧵👇

"Are these Muslims like us or what is their religion exactly?"

Oct 22, 2023 • 7 tweets • 2 min read

First week in the army.

Some thoughts:

The IDF is unbelievably different now than what is was during my (5yr) service.

Unrecognisable.

No clutter.

No bloat.

No BS procedures.

Everything (and everyone) is insanely efficient and highly motivated. Young soldiers are not complaining and oftentimes also overdo their roles voluntarily.

More reservists keeps showing up.

(Many of which volunteered and were not supposed to attend otherwise)

Civilians show up every couple of hours with food, clothing and supplies.

Jul 16, 2023 • 8 tweets • 4 min read

The strongest model you didn't know about:

- The best model.
- Trained on the best dataset.

OpenChat V2 x OpenOrca

OpenChat V2, A leading open-source model was fine-tuned last night on Open-Orca's data.

Link: https://t.co/4hiMTqXUOJ https://t.co/jG9Abqeos0huggingface.co/openchat/openc…
twitter.com/i/web/status/1…

OpenChat?

A series of powerful open source models leading multiple leader boards.

The teams' latest model, OpenChat V2 (weighted), Is currently leading 4 different benchmarks.

Github:
Model: https://t.co/Fv0793EEQSgithub.com/imoneoi/opench…
huggingface.co/openchat/openc…

Jul 9, 2023 • 8 tweets • 2 min read

Reverse engineering Code-Interpreter

Last night I participated in an interesting twitter space: Reverse Engineering the code-interpreter

Here are some of the most interesting bits: https://t.co/pUudeV5uuXtwitter.com/i/web/status/1…

Since the new model is able to run code, many people have already managed to ask it nicely for all the information it has inside the virtual machine it runs on.

Jun 21, 2023 • 8 tweets • 2 min read

I think I get it now. let's try something out:
.
Comment to this thread everything you "just guess" about GPT-4 👇👇 Guess: In the dataset: Went through every undergrad major and included exams, tests & textbooks of the subject. [to create a "wow" effect for every educated person no matter the field]

Jun 21, 2023 • 4 tweets • 1 min read

Ladies and Gentlemen. GPT-4 👇

*Ported from HF's code

Jun 20, 2023 • 4 tweets • 2 min read

B R E A K I N G! @realGeorgeHotz Just said:
- GPT-4 Has 220 parameters.
- GPT-4 Is a mixture of experts with 8 experts.
- GPT-4 Is doing 16 times inference (did u mean beams? or just 2 beams per model?)
-
@realGeorgeHotz It's HUGE!! Can you confirm??
src: latent.space/p/geohot twitter.com/i/web/status/1… @abacaj look what you have done! I had work to do!

Jun 20, 2023 • 7 tweets • 3 min read

החלקים החסרים של GPT-4 (בעברית) 👇
.
The missing pieces of GPT-4 (Hebrew)👇
.
טריקים שאף אחד לא מדבר עליהם ועתיד מודלי השפה הפתוחים.
***
> אמ;לק: למידה ממשוב לא-אנושי (!!)
.
העברתי הרצאה מעניינת לפני כמה ימים בכנס #GenML עם אותו השם ("החלקים החסרים של GPT-4").
בזמן שכולנו מחכים… twitter.com/i/web/status/1… חלק 2: ההשפעה של LLaMA
----------
אתם כבר יודעים על LLaMA.
מודל שפה עוצמתי מבית מטא, שוחרר בקוד פתוח בפברואר האחרון.
המודל עומד במרכזו של מאמץ עולמי לשחזר את יכולות מודלי השפה המסחריים באופן פתוח.
בכך לספק למיליוני אנשים בכל רחבי העולם גישה לטכנולוגיה זו.
ישנם עשרות אלפי אנשים… twitter.com/i/web/status/1…

May 2, 2022 • 24 tweets • 5 min read

How to trick AUC into scoring your stupid models higher 👌

👇A - Thread - about gaming the AUC metric!👇

There is a built-in "flaw" with the AUC metric. Today we are going to take it for a spin! One of the questions coming up as we get stuck on improving individual models further is how to best combine multiple models.

Apart from simple the baseline Avg, Median or Avg-Median-Rocket-Science-Blending[1].

Share this page!

Enter URL or ID to Unroll