David Graus Profile picture
Jun 2, 2020 4 tweets 2 min read Read on X
#recsys dataset + paper: "MIND contains 1M users, 160k English news articles and 15.7M impression logs. Every news article contains rich textual content including title, abstract, body, category and entities. " msnews.github.io
"Each impression log contains the click events, non-clicked events and historical news click behaviors of this user before this impression."

Wow! So much data 😳. Awesome! Hope this sparks done interesting research in news recommendation.
With all that data we could've probably fully done our #UMAP2020 paper's analysis on a public dataset 😅 graus.nu/publications/b…
Sparks done = sparks some 🤯

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with David Graus

David Graus Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @dvdgrs

Jun 3, 2020
Goed idee: “Blendle wil onderwerpbundels gaan maken.” — @AlexanderNL link.medium.com/7hdEwbVP06
Juist! MEER NIEUWSPERSONALISATIE. “Ik zeg niet dat het nieuws alleen nog moet gaan over onderwerpen die de lezer leuk vindt. Maar ik vind dat de groep die wél zo wil consumeren, ook bediend moet worden…” — @AlexanderNL link.medium.com/7hdEwbVP06
<einde van mijn liveblog van lezen medium artikel>
Read 4 tweets
Mar 27, 2020
Stoked that our paper "Beyond Optimizing for Clicks: Incorporating Editorial Values in News Recommendation" with @dsflu and @anca_dmtrch is accepted as a FULL PAPER at #UMAP2020! I am particularly happy with this publication because... 👇 (1/4) Image
1️⃣ In our paper we show how you can align algorithm design across stakeholders (data scientists + journalists), by effectively modeling an editorial value (dynamicness) in a news recommender
2️⃣ we present (more) empirical proof that #recsys (can) offer(s) users more diverse, serendipitous, and dynamic articles compared to editorially curated lists, and hence (can) help in avoiding, not creating filter bubbles!
Read 6 tweets
Feb 20, 2020
Wow, @Avaaz, really?

8% and 16% of search results to “climate change” and “global warming” respectively "had misinformation about climate change."

Leads to stating: "YouTube is promoting misinformation about climate change to millions."

secure.avaaz.org/campaign/en/yo…
@Avaaz To be honest, those numbers do not sound odd, to me. I'm afraid there's too much climate change-junk on @YouTube. Does that mean YouTube should start censoring? Imho: no.
@Avaaz @YouTube It reminds me of a situation we had in NL, where @bol_com allegedly "promoted anti-vaxx books". Why? A query for "vaccination" yielded a bunch of anti-vaxx books. Because pro-vaxxers don't write about the benefits of vaccination, but antivaxxers do... translate.google.com/translate?sl=n…
Read 4 tweets
Feb 18, 2020
Hey jongens hoe gaat het verder met dit koddige knutselproject van DE BELANGRIJKSTE TECH-CRITICUS TER WERELD (oprecht benieuwd 👀)
Het ziet er in ieder geval onleesbaar uit (the-syllabus.com/goods/thematic…) maar hoor graag van de vaste schare trouwe gebruikers
Ik begin even met A Maussian bargain, zo terug.
Read 5 tweets
Oct 24, 2019
Ik ga even dood op de grond liggen
Dus Morozov heeft het op zich genomen om met zijn non-technical wit-russische arts major neefje een recommender system te bouwen. Met taxonomieën en categorieën. Geweldig, terug naar de AI van de jaren '80 😂
Read 4 tweets
Aug 27, 2019
The work of @dietmarjannach has turned out to be an invaluable resource in our past few weeks of testing our @FD_Nieuws #recsys, often providing directly applicable insights and frameworks. e.g.: "Measuring the Business Value of Recommender Systems” arxiv.org/pdf/1908.08328…
@dietmarjannach @FD_Nieuws More examples? OK:
Measuring the impact of online personalisation: Past, present and future (2019): sciencedirect.com/science/articl…
@dietmarjannach @FD_Nieuws Beyond Accuracy: Evaluating Recommender Systems by Coverage and Serendipity (2010): citeseerx.ist.psu.edu/viewdoc/downlo…
Read 4 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(