How to get URL link on X (Twitter) App
https://twitter.com/janleike/status/1886452525437800874More capable LLMs can be misused to cause more harm. E.g., what if a terrorist can build a weapon of mass destruction with step-by-step guidance from an LLM?
https://twitter.com/nabla_theta/status/1798763600741585066What are sparse auto-encoders (SAEs)?