Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Sam Rodriques

@SGRodriques

Sep 11, 2024 • 6 tweets • 3 min read • Read on X

Introducing PaperQA2, the first AI agent that conducts entire scientific literature reviews on its own.

PaperQA2 is also the first agent to beat PhD and Postdoc-level biology researchers on multiple literature research tasks, as measured both by accuracy on objective benchmarks and assessments by human experts. We are publishing a paper and open-sourcing the code.

This is the first example of AI agents exceeding human performance on a major portion of scientific research, and will be a game-changer for the way humans interact with the scientific literature.

Paper and code are below, and congratulations in particular to @m_skarlinski, @SamCox822, @jonmlaurent, James Braza, @MichaelaThinks, @mjhammerling, @493Raghava, @andrewwhite01, and others who pulled this off. 1/

PaperQA2 finds and summarizes relevant literature, refines its search parameters based on what it finds, and provides cited, factually grounded answers that are more accurate on average than answers provided by PhD and postdoc-level biologists. When applied to answer highly specific questions, like this one, it obtains SOTA performance on LitQA2, part of LAB-Bench focused on information retrieval. 2/

PaperQA2 can also do broad-based literature reviews. WikiCrow, which is an agent based on PaperQA2, writes Wikipedia-style articles that are significantly more accurate on average than actual human-written articles on Wikipedia, as judged by PhD and postdoc-level biologists. We are using WikiCrow to write updated summaries of all 20,000 genes in the human genome. They are still being written, but in the meantime see a preview: 3/wikicrow.ai

We spent a lot of effort on making our open source version be excellent. We put together a new system of building metadata of arbitrary PDFs, full-text search, and more. See it here: 4/github.com/Future-House/p…

Also, see our preprint for details, here: 5/ paper.wikicrow.ai

And, of course, this was all made possible through the wonderful generosity of Eric and Wendy Schmidt, and all of our other funders, including Open Philanthropy for supporting our work on LitQA2, the NSF National AI Resource program, and others! 6/

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @SGRodriques

Sam Rodriques

@SGRodriques

Oct 12, 2021

@AdamMarblestone

It’s amazing to see the FRO concept get off the ground. Congratulations to @AdamMarblestone, @AGamick , @SchmidtFutures, and everyone else involved!! Everyone needs to be paying attention to this. For people who aren’t familiar with FROs, I’ll provide some background here.

@Day1Project

Academia is great at creating new technologies but not at scaling them up. FROs are a new non-profit science funding structure proposed first in my thesis (dspace.mit.edu/handle/1721.1/…) and then in more detail together with Adam in the @Day1Project paper (dayoneproject.org/post/focused-r…).

FROs exist to scale up technologies that are too big for an academic lab, but that can’t be funded by the private sector, e.g. because they result in the creation of public goods like datasets or knowledge.

Read 13 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

Sam Rodriques

Try unrolling a thread yourself!

More from @SGRodriques

Sam Rodriques

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!