Tweet

Anna Rogers

10 Nov, 10 tweets, 5 min read

@nlpnoah

A few highlights from @nlpnoah's talk at insights-workshop.github.io:

@nlpnoah: NLP research and practice ask fundamentally different questions
/1

@nlpnoah

@nlpnoah: NLP practice asks whether X improves the outcome. NLP research tries to fill in the gaps in the knowledge map.
/2

@nlpnoah

@nlpnoah: Leaderboards are the dominant frame for presenting research findings. That frame by its very nature puts the winner at the top, and un-focuses all of this:
/3

@nlpnoah

@nlpnoah: let's admit to ourselves that "sota" is a verb. and it should be lowercased.
/4

@nlpnoah

@nlpnoah: depressingly many parallels between leaderboard-driven research and competitive sports
/5

@nlpnoah

@nlpnoah The very notion of "negative results" presupposes that very same sports-like frame! Useful as the leaderboards are, they are not the only thing we need.
(with follow-up comment from Margot Mieskes: the Insights workshop should be renamed!)
/6

@nlpnoah

@nlpnoah: here's some alternative frames that might be useful in NLP research.
/7

@nlpnoah

@nlpnoah: a bonus from focusing on gaining knowledge and not on sota-ing is improved mental health. If you're trying to answer a question, whatever answer you get is a result.
/8

@nlpnoah

Question: in an exploding field the simple leaderboard frame is partly a coping mechanism for the authors to try to reach a broader audience.
@nlpnoah: NLP community is not all that homogeneous. Let's be brave, non-mainstream papers may find a wider audience than we think.
/9

@nlpnoah

Question: one reason for leaderboards is to enable people to easily compare with prior work. How about we just publish multiple metrics, for maximal reach to future work?
@nlpnoah: Might work. We could also release raw system outputs.
/10

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @annargrs

Anna Rogers

@annargrs

9 Nov

@JoaoSedoc

#EMNLP2021 ends, but the Insights for Negative Results are coming tomorrow! The workshop is hybrid: virtual posters, talks by/for a mix of on-site & online speakers & attendees. Hosts: @JoaoSedoc @shabnamt1 @arumshisky @annargrs

Really proud of the program this year🧵:

8:45 Opening remarks
9:00 🗣️ Invited talk by Bonnie Webber: The Reviewers & the Reviewed: Institutional Memory & Institutional Incentives

10:00 💬🗨 Gathertown Poster session 1:

Read 11 tweets

Anna Rogers

@annargrs

9 Nov

@StevenBird

A highlight from #EMNLP2021 fascinating keynote by @StevenBird:
NLP often comes with a set of assumptions about what are the needs of communities with low-resource languages. But we need to learn what they *actually* need, they may have a completely different epistemology.
/1

@CPH_SODAS

AR: this is such a thought-provoking talk, pointing at the missing bridges between language tech and social sciences, esp. anthropology. As a computational linguist lucky to spend a year in @CPH_SODAS - I still don't think I even see the depth of everything we're missing.
/2

@bonadossou

An audience question (@bonadossou from @MasakhaneNLP?): how do we increase the volume of NLP research on low-resource languages when such work is not as incentivized?
@StevenBird: keep submitting. I've had many rejections. Theme track for ACL2022 will be language diversity.
/3

Read 4 tweets

Anna Rogers

@annargrs

4 May

@rethinkmlpapers

Tired of paper pdfs? Brainstorm with us about the future of research communication at @rethinkmlpapers (@iclr_conf Friday)!
Talks & panel by David Ha, Terrence Parr @evelynevs @FalaahArifKhan @Hugo_larochelle @jeffbigham @lillian_weng @deviparikh

🧵 Some ideas from the program:

https://twitter.com/JayAlammar/status/1386952976925958145?s=20

https://twitter.com/JayAlammar/status/1386952976925958145?s=20

https://twitter.com/rethinkmlpapers/status/1388993163600203779?s=20

https://twitter.com/rethinkmlpapers/status/1388993163600203779?s=20

Read 18 tweets

Anna Rogers

@annargrs

1 May

@icmlconf

🤦‍♀️ The only good thing about this is how much attention it attracted, so hopefully @icmlconf would reconsider.
/1

https://twitter.com/tomgoldsteincs/status/1388156022112624644

It can't even work, since peer review is only reliable for the clearly bad papers. Decisions on borderline papers are as good as random. This won't "raise the bar", it'll only reinforce the AC/SAC preferences. And likely improve the chances for preprinted papers by famous ppl.
/2

@IAugenstein

A paper on all of the above by @IAugenstein and yours truly:
aclweb.org/anthology/2020…
/3

Read 4 tweets

Anna Rogers

@annargrs

9 Oct 20

@IAugenstein

New paper📜: What Can We Do to Improve Peer Review in NLP?
arxiv.org/abs/2010.03863
with @IAugenstein

TLDR: In its current form, peer review is a poorly defined task with apples-to-oranges comparisons and unrealistic expectations. /1

Reviewers resort to heuristics such as reject-if-not-SOTA to cope with uncertainty, so the only way to change that is to reduce uncertainty. Which is at least partly doable: better paper-reviewer matching, unambiguous eval criteria, fine-grained tracks, better review forms etc /2

Which criteria and forms, exactly? Each field has to find out for itself, through iterative development and experiments. Except that in NLP such work would be hard to publish, so there are no incentives to do it - and no mechanisms to test and compare any solutions. /3

Read 8 tweets

Anna Rogers

@annargrs

30 Aug 20

Preprint anonymity debate continues!

TLDR for those who missed the prior discussion: non-anonymous preprints systematically disadvantage the unknown labs and/or underrepresented communities.
My previous post: hackingsemantics.xyz/2020/anonymity/ /1

@ducha_aiki

A new post by @ducha_aiki and @amy_tabb argues that fairness comes at a steep opportunity cost for the small labs. Full text here: amytabb.com/ts/2020_08_21/
/2

To summarize both posts, we have the following trade-off for the unknown/underrepresented authors:

* anonymous preprints: better acceptance chance;
* arXiv: lower acceptance chance, but more chances to try to promote unpublished work and get invited for talks and interviews.
/3

Read 10 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Thank you for your support!

Share this page!

Anna Rogers

Try unrolling a thread yourself!

More from @annargrs

Anna Rogers

Anna Rogers

Anna Rogers

Anna Rogers

Anna Rogers

Anna Rogers

Did Thread Reader help you today?

Like this author's thread?