How to get URL link on X (Twitter) App
Given the zero-sum nature of adversarial debate, we’d expect both sides’ estimated chance of winning to be ~50% (100% total). Instead, LLMs often assess their own chances of winning at >75%, even when debating the same model, given identical info, and seeing each other’s bets.
https://twitter.com/menhguin/status/1826132708508213629
Super thankful to @kalomaze for the initial idea of min-p, @_clementneo of @apartresearch for teaching me everything about research, @BlackHC for theoretical grounding and experiments, and @Hellisotherpe10 and @ziv_ravid for turning Min-P from EMNLP reject to ICLR Oral!
https://twitter.com/omarsar0/status/1838611105805144191
can i build AGI with just a sampler? many are saying this