Recently, @awnihannun asserted that 'According to benchmarks Qwen3.5 4B is as good as GPT 4o.' This drew controversy: Is the 4B just benchmaxxed? How could a 4B be as good as GPT-4o? I tried to test this scientifically. The answer to the question is likely: yes, in most cases.
To test this, I wanted a set of 'in the wild' prompts that would reflect real world usage and not narrow code/STEM tasks - so I went to WildChat (the classic repo for this), grabbed one of the training parquet files, and chose 1000 random deduped prompts. I then ran these prompts through GPT-4o and Qwen3.5 4B at recommended sampling settings.
For each answer pair, I then had Claude Opus 4.6 judge which answer was better. Of course, this comes with the normal issues of using an LLM judge - but since Clopus 4.6 is far stronger than both models, was given time to think before answering, and is 2nd on the Judgemark leaderboard, I felt comfy using it here.
1/4 في ظل ظروف الحرب والدمار والنزوح، تتزايد حاجة الناس إلى الاتصالات والإنترنت، لتتبع الأحداث ورصد التحذيرات والتنبيهات وإدارة عمليات الانتقال من المناطق المستهدفة، أو للاطمئنان على الأهل والأصدقاء… ولكن في حالة لبنان، حيث البنية التحتية متردّية بالأساس، والدولة تتعامل مع الاتصالات كقناة جباية لا خدمة عامّة، تتحوّل هذه الحاجة إلى عبء إضافي يستنزف السكان، ولا سيما النازحين منهم، من دون أن يظهر أي تحرّك للدولة لتخفيف هذا العبء وتيسير أمور الناس.
فما الذي يتوجب على الحكومة القيام به من دون تأخير؟
🚨BREAKING: Claude can now build you a full AI video channel from scratch like a $10K creator consultant (for free).
Here are 10 prompts that take you from zero to a monetized AI video channel in 90 days: (Save for later)
1/ The Niche & Visual Style Finder
You are an AI video strategist who has launched 50+ faceless YouTube channels. Rank the top 10 faceless AI video niches by CPM, audience size, and ease of production. For each, recommend the best AI visual style. Then identify 3 competitor channels per niche with their stats and gaps I could own. End with a one-sentence positioning statement for my top pick.
My interests: [YOUR TOPICS]
AI video is one of the best (and fastest) ways to build a real income stream as a creator 🎬
Vejam com us ho explico perquè s'entengui bé. Fa 31 anys que vaig començar a ensenyar, just l'any en què començava la LOGSE. L'ensenyament passava a ser obligatori dels catorze als setze anys. Els alumnes de 12 a catorze deixaven les escoles i passaven als instituts. 👇
Com que dels 12 als 16 l'ensenyament era obligatori, els alumnes s'havien de quedar al pati i no podien sortir al carrer. Fins llavors, mai cap professor de secundària havia hagut de fer guàrdies de pati. 👇
Pel mateix motiu els professors que portaven anys a l'ensenyament (no era el meu cas) van haver de començar a fer guàrdies als professors que faltaven. Fins llavors, si un professor no venia, es deixava sortir als alumnes al carrer durant aquella hora.👇
Bon, je vais vous expliquer comment télécharger des films et des séries, tout en profitant de la chute du site YggTorrent (car les sites alternatifs sont passés en freeleech pour faire venir les utilisateurs).
Allez, je vous explique.
1/
Tout d'abord c'est illégal. Qu'on soit d'accord sur ce point. Télécharger des oeuvres c'est mal et si vous le faites, c'est en connaissance de cause. Il existe des offres légales (Neftlix, Prime, Disney etc.). Mais je vous l'accorde, certains vieux trucs y sont introuvables.
/2
Ceci dit, le site de référence était YggTorrent, et il était devenu une pompe à fric en pigeonnant ceux qui n'utilisaient pas la méthode que je vais vous expliquer ici.
La chute de YggTorrent est expliquée ici si ça vous intéresse :
Best week of the whole year lads - I cannot wait 😍
If I wasn’t in Rome for a long weekend for the rugby, I would’ve posted this over the weekend but it’s worked in my favour to be honest now that declarations are out and we can see who’s riding, what the weights are and what the markets are suggesting!
Let me know if you’re with me or against me on any - and if I’ve missed anything worth mentioning relating to any of the races then I’d love to hear it!
Starting with Tuesday:
13:20)
The (Michael O’Sullivan) Supreme
Grade 1
2mile Novice Hurdle (1st season hurdlers)
Douvan (2015), Constitution Hill (2022) and Kopek Des Bordes (2025) are the only 3 favourites to have won this in the last 11 years.
Old Park Star is currently favourite at 5/2. He was nothing special in bumpers (comfortably beaten in each of the 3 runs last season) but my god has he been good over hurdles.
1st hurdle race) Beat Un Sens A La Vie by 3L on hurdle debut, who has since won an easy C4… but has been absolutely pumped by Tutti Quanti by 33L. Does that give the form a big knock? Maybe. Fortune Timmy was 3rd to OPS and has also since won an easy C4… before being beaten at Cheltenham on Trials Day. Another knock? Maybe. Wandering Ego was a 5L 4th to OPS… and has since been beaten by 10L in a C3 and by 2L in a C4. Another knock? Maybe. They are adding up a fair bit
2nd hurdle race) Won over C&D by 12 lengths to Glance At Midnight (who has since been pumped by a further 15L more by Idaho Sun). Another knock? Maybe not because in this race was Kingston Queen (who has won a C1 over 2m3f on heavy since). But Lisbane Park who was behind has since lost in a C3. Maybe I being too analytical here to judge the form lines that closely.
3rd hurdle race) Absolutely romped home in the Supreme trial at Haydock in January, beating Hurricane Pat (who was on a 4timer after winning a C1 most recently, including spanking Sober Glory). Now that is so impressive. Improving at rate of notches. Maybe overcomes all of the previous knocks? Hmmm. He seems to be the talk of the town and every man and his dog on here seem to have NAPd him for the day. Ahhh. My eyes told me yes LTO - the form lines and trends tell me no. Nico in the saddle too... I think I am going to take on.
Others:
Talk The Talk (9/2) only ran one bumper but has been good over hurdles for sure. Was very fortunate to beat Ballyfad at DRF and watching that myself, I’m not sure that he will be winning the Supreme. Did do his best efforts at the end though so may love the Cheltenham hill but not 100% myself. Strong form lines by beating Ballyfad though (who was 10L ahead of Leader D’Allier the time before). Very hard to dismiss and maybe rightful 2nd favourite but I don't think he would be my choice
El Cairos (has drifted 4/1 to 15/2 within the last week) has a serious engine on him that’s for sure but wasn’t mesmerising in bumpers and fell when asked the question on hurdle debut. Form lines of his win not that strong either. A big guessing game with him and I am not sure… especially as only beat Roc Dino by 3L and that one as pumped by Mighty Park by 38L the time before. Hmm. Not for me
Mighty Park (some money has come - 13/2 in to 5/1 the last week) was UNREAL on his only run. Could he be the one to side with? Maybe. Looks JPs best chance. A guessing game but he was so impressive. That Roc Dino form line is huge. 64L 4thhas won since. The time wasn’t the fastest though… nearly 10 seconds slower than Old Park Star’s runs. I think the sky is the limit with this one though and Willie will have him ready. Would be my main fancy I think
Sober Glory (11/1) was 18L behind Hurricane Pat, who was 18L behind Old Park Star. Huge swing that. He has won his other 5 races though. Maybe that was just an off day? I am sure he will have fans but not for me.
MyDaddyPaddy (12/1 in to 7/1 the last week) was the favourite for this before losing to Idaho Sun. Didn’t beat much before either. Happy to ignore this one.
Leader D’Allier (14/1 in to 10/1 the last week) looked good LTO when beating nothing but was beaten 10L by Ballyfad the time before and I cant put him ahead of Talk The Talk now after the DRF run.
Maybe lots of pointless notes on each. Does anyone have things they would disagree with or could say to add on to any of the notes?
So many questions surrounding them all.
Mighty Park for me. Looked unreal on his only run. Form lines strong too. Lots saying he won’t win? I think he might you know. I think he beats OPS in to 2nd... and El Cairos and Talk The Talk I think will be chasing them both home.
At 5/1 - I'll be having an each way single on him and in a few multis too.
Anyone agreeing with me here - or is everyone on Old Park Star?
Be honest and sincere, are you doing it for the sake of Allah?
It's a noble endeavour & you have to question whether you're looking to memorise in a parrot fashion manner where your life doesn't reflect the Quran - or if it's your truth.
I used the indo-pak script exclusively.
Consistency in script matters more than people realise; your brain maps the page visually.
Switching scripts mid journey is basically changing the map halfway through a trip.
Pick 1 and stick to it - whichever one you're comfortable with
1/ Global Situation Update; Iran war D+9, Russo-Ukrainian War D+1,475: The Persian Gulf and Ukrainian theaters are currently defined by unprecedented aerospace saturation, high-intensity ground maneuverability, and massive macroeconomic volatility. A thread on the multi-domain kinetic cascade. 🧵 #USIranWar #Epicfury #LionsRoar #TruePromise4 #UkraineWar
2a/ 🇺🇸🇮🇱 Persian Gulf Theater of Operations: The US-Israeli campaign (Epic Fury/Lion's Roar) has entered a grinding, multi-domain attrition phase. The IDF reports launching over 1,600 strike sorties since the operation's inception. Despite this intense bombardment, the Iranian regime's command structure has consolidated.
2b/ 🇺🇸To maintain this unrelenting operational tempo and secure the surrounding maritime corridors, the United States established a robust three-carrier posture: the USS Gerald R. Ford (CVN-78) positioned in the Eastern Mediterranean, the USS Abraham Lincoln (CVN-72) operating in the Arabian Sea, and the USS George Washington (CVN-73) forward-deployed at Yokosuka to maintain deterrence in the Pacific theater. The USS George H.W. Bush (CVN-77) Carrier Strike Group is preparing to deploy to support Operation Epic Fury, likely joining the USS Gerald R. Ford in the Eastern Mediterranean.
@NYCMayor He was the leader of an organization guilty of trespass, vandalism, and holding two Columbia employees hostage, as well as more generally terrorizing Jewish students. Needless to say, trespass, vandalism, and hostage-taking are not protected by the First Amendment.
Some links: At Columbia, an official task force found Jewish students reported not only verbal harassment and ostracism but also being physically targeted and feeling unsafe in the dorms. One campus rabbi advised Jewish students to leave campus for their safety. columbia.edu/content/report…
@NYCMayor Trespass, vandalism, hostage-taking by Columbia University Apartheid Divest, the organization in which he was a leader. nytimes.com/2024/05/08/nyr…
Petitioner: Yes myself. I can deposit my phone here.
CJI: What is your background ?
Petitioner: 12th pass
CJI: which school?
Petitioner: Sanatan Dharm school, Ludhiana
CJI: I will arrange an English exam here in court..if you score 30 marks.. I will see it then
Petitioner: yes yes I can
CJI: either you tell the truth or we impose huge costs and order probe
Petitioner: You can see my phone.
CJI: what does fiduciary risk to corporate donors etc that you have written.. what does it mean?
Adv: I can refer to the plea
CJI: I am asking last time which lawyer drafted it. You have not done it.
Adv: I have searched AI Tools. I also gifted 4 jackets to a typist.. and he charged 1000 per hour for typing.. Das sir.
CJI: Supreme Court typist made the petition. Call the typist here.
CJI: looks like petitioner has lent his shoulders to someone who has drafted a vague, wild petition. The tone tenor and so called constitutional principle sought to be raised cannot be the brainchild of the petitioner who is a small time trader. We however do not order a roving inquiry for such frivolous plea with stern warning not to file such petitions in the near future.
CJI: jao jakar kuch aur sweater banakar becho. Yeh sab PIL karoge toh cost dena pad jayega.
Raajmarg Infra Investment Trust wants to raise ₹6,000 crore by selling you the right to collect tolls on five highway stretches across 260 kilometres. This becomes only the 7th InvIT to hit public markets since 2014, but most investors still don't understand what they're actually buying. Let's find out in this thread below.
InvITs are REITs' less-famous cousins. Same trust structure, but instead of office buildings and rent cheques, you're paying for infrastructure assets and their cash flows. Highways, power lines, telecom fibre.
You could buy shares of Larsen & Toubro or IRB Infrastructure, but then you're betting on their business, their ability to win contracts, build assets on time, manage costs while making profit.
▶️ Erst lästert Alice Weidel über die Anwesenden.
Dann beginnt sie zu essen
Skurrile Schalte:
▶️ Bei einer Fernsehübertragung zur Wahl in Baden-Württemberg plauderte die AfD-Chefin vor sich hin.
Offenbar war ihr nicht klar, dass das Mikrofon angeschaltet war.
Weidel war aus Stuttgart zugeschaltet – und fiel vor allem durch ihre ostentative Patzigkeit auf.
Unter anderem behauptete sie, den Generalsekretär der SPD nicht zu kennen (»Ich habe Sie noch nie gesehen«). vm.tiktok.com/ZNRuHwHgk/
▶️ Wenn sie nicht an der Reihe war, beriet sie sich immer wieder mit einem Mann im Hintergrund, offenbar mit ihrem Sprecher ▶️ Daniel Tapp.
Auch schimpfte sie über die anderen Politiker.
Während die Runde debattierte, entstand so im Hintergrund ein zweites Gespräch.