How large an emergency fund do I need? Do I have enough time to grab lunch before my next meeting?
We intuitively solve questions like these every day. Renowned physicist Enrico Fermi had a particular knack for it — these questions have become well known as Fermi Problems.
1/N
Solving Fermi Problems requires recursive decomposition, science/commonsense reasoning, abstraction, and creativity. The inherent complexity of these problems makes them an ideal candidate for #AI reasoning.
2/N
To spur research in this direction, we created two datasets — realFP and synthFP, a collection of real-world and templated Fermi problems. We found that large-scale LMs perform poorly even after fine-tuning, making estimates that can be off by 2+ orders of magnitude.
3/N
Find more details in our paper and the dataset here: allenai.org/data/fermi.
Finally, don’t miss Abhinav Kumar’s talk about this work if you are attending #EMNLP2021, starting at 12:45pm PT on Monday, November 8!

4/N

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Allen Institute for AI

Allen Institute for AI Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!

:(