Chubby♨️ Profile picture
Sep 18 7 tweets 2 min read Read on X
1/ DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning

This changes everything: Researchers are teaching large language models to develop independent, deep thinking - without humans dictating every single step of the thought process. A game changer for AI performance and autonomy.

Lets break it down, a 🧵Image
2/ Many models are currently only able to perform well in calculations or solve standard tasks when humans guide them step by step (“chain of thought”) or provide examples. This dependency costs time and money and limits what is possible.
3/ DeepSeek-R1 uses *purely* reinforcement learning (RL): the model receives feedback based *solely* on whether the end result is correct - not on how it got there. Analogous to a child who learns through feedback, not by memorizing all the intermediate steps. Image
4/ Why this is groundbreaking:
Because DeepSeek-R1 *automatically* learns advanced thinking strategies: self-checking, strategy adaptation, reflection. In tests such as math Olympiads, it outperforms RL-trained models that have worked with human thought processes. Image
5/ In the next 1-2 years, we could see models that solve complex scientific, technical, and logical tasks almost as well as experts, but much more efficiently. More automation, better assistance - in code, in research, in teaching.
7/ This could change everything, literally

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Chubby♨️

Chubby♨️ Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @kimmonismus

Sep 15
1/ This new mRNA cancer vaccine changes everything: researchers stimulate the immune system so strongly that tumors in mice simply disappear—not in a targeted manner, but universally. This could redefine cancer treatment.

Cancer could very soon be a thing of the past.

Lets break it down - a 🧵Image
2/ Until now: Cancer treatments have often been tailored or aggressive (surgery, chemotherapy, radiation). Immunotherapy only helps if tumors have certain markers — many remain resistant or respond poorly.
3/ The new vaccine works differently: instead of targeting specific proteins, it alerts the immune system to “virus-like attacks” so that T cells are activated. This is done using mRNA, as in COVID vaccines, but not against a virus, rather to trigger an immune response against tumors in general.
Read 7 tweets
Sep 9
I probably sound like a broken record, repeating myself over and over again. But as we have seen recently, Dario Amodei is doing much the same thing: he is issuing urgent, emphatic warnings.

He says he firmly believes that if the development of AI continues as he believes it will, we will see massive upheavals in the labor market and high double-digit losses in the labor market in a few years.

Specifically: legal, finance, and everything that counts as white-collar work.

I've linked a few videos in the comments from people who have created good clips.

But I want to say one thing: take his words seriously. We need to have the discussion now about what our economy should look like in a few years!
Read 9 tweets
Aug 5
1/ The first big announcement: Google DeepMind has unveiled Genie 3
A new frontier for world models, the first AI that generates interactive 3D worlds in real time at 24 fps.

The breakthrough: minutes of consistency at 720p using only text descriptions. No pre-built 3D models required.
Let's break it down - with some examples:
2/ How does it work? Genie 3 creates worlds frame by frame, remembering up to a minute of past scenes. You enter a text prompt, navigate in real time, and the AI dynamically adapts the world to your movements.
3/ Particularly impressive: “Promptable World Events.” While navigating, you can change the weather, introduce new objects, or spawn characters—all via text commands. This opens up completely new possibilities for training AI agents.
Read 7 tweets
Aug 2
With “Wide Research,” @ManusAI_HQ is bringing about a fundamental change in agentic work.

I instructed the tool to scan 100 scientific articles – including titles, abstracts, authors, journals, year, number of citations, institution, and country. Everything was automatically written into a table, sorted by relevance and impact.

The result: an immediately usable overview of the global research landscape. Ideal for science journalism, strategies in health tech – or simply to have more informed discussions.

Wide Research is rethinking research:

– Parallel scanning instead of individual queries
– Data directly usable for Notion, Excel, or video scripts
– Saves time, reduces costs – without compromising on depth

Here is the prompt:
Use the Wide Research function.

Objective: Conduct a meta-analysis of 100 scientific articles on the topic of “Artificial Intelligence in Medicine.”

For each article, please record the following information:

1. Title of the article

2. Short abstract (2–3 sentences)

3. Name of the author(s) (first name + last name)

4. Year of publication

5. Name of the journal or source

6. Number of citations (if available)

7. Institution or university (if available)

8. Country or region of the main author

9. Link to the original source or DOI

Compile the results in a structured table. Sort by relevance and citation frequency. At the end, provide a brief overview by country/region (e.g., US, China, Germany) of how many studies come from each region.

Target audience: Readers with an interest in science who want to quickly gain an overview of research activities in the field of AI and medicine.
● Large-scale parallel research: Research 100–500 items in a single task.
● Structured real-time output: Get clean tables that are ready to use in scripts, Excel, presentations, or Notion.
● Save hours of time: What used to take days now takes just minutes.
● Incredibly cost-effective: Extensive research at a fraction of the manual cost.
Read 4 tweets
Jun 30
1. Microsoft presents medical superintelligence

Microsoft's LLM is not only designed for multiple-choice questions, but also for real medical diagnoses in realistic scenarios – and outperforms even top models such as o3.

This is a bigger breakthrough than much of what we are currently seeing. Let's take a look:Image
2. Microsoft has taken a decisive step toward medical superintelligence with the MAI Diagnostic Orchestrator, a system that has the potential to fundamentally change everyday medical care. At its core is an orchestrated collaboration between several highly developed AI models such as GPT-4, Gemini, Claude, and others. Instead of relying on a single model, these agents work together like a digital team of experts – discussing, analyzing, and weighing diagnoses in a structured “chain of debate” process. This approach is novel because it replicates complex decision-making processes that were previously the preserve of human medical teams.Image
3. In a large-scale study with over 300 case studies from the New England Journal of Medicine, the system achieved a diagnostic accuracy of over 80%. This is not only four times higher than the participating doctors, but also marks a qualitative leap: the AI was not only more accurate, but also made more economical decisions – with around 20% lower costs because it avoided unnecessary tests.Image
Read 7 tweets
Jun 29
1. Compute is the new gold.

The current breakdown is as follows:

US companies (Amazon, Google, Microsoft)8
~63 %

China (Tencent, Alibaba, Huawei)3
~28 %

Europe (e.g. OVH, Hetzner, Exoscale)6
~4 %

Rest of the world
~6
~5 %

What does it mean? Lets have a look: Image
2. A new digital divide is emerging: Only 32 countries have specialized AI data centers - mainly in the US, China and Europe. Africa, South America and large parts of Asia have little or no access to this infrastructure. This leads to a dramatic imbalance in research, economic development and technological progress. Countries without data centers have to rent expensive resources, are slower and dependent on tech giants from abroad.
3. What is particularly worth emphasizing is that companies such as OpenAI, Microsoft and Google own almost all global AI centers - with billions invested that many countries simply cannot afford. The chips for this come almost exclusively from Nvidia, which is exploited geopolitically. Even countries such as Kenya, which cooperate politically with the USA, are not given free access to GPUs.
Read 5 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(