How to get URL link on X (Twitter) App
2/ Deliberative alignment worked surprisingly well! We saw 30x reduction in rate of covert action from 13% to 0.4% in our test environments.
2/5 Traditionally, AI models are just trained with (input, good response, bad response) data, but they are not taught to reason *why* these responses are good or bad. This teaches good "system 1" instincts, but these can fail in new situations. "System 2" allows model to adapt, e.g. when input is encoded.

https://twitter.com/minilek/status/15113585251794534403) He tweets Boaler's public contract with a public school district, which is available on their website.
https://twitter.com/tomgoldsteincs/status/1484609273162309634They were proposed in 80s and 90s, considered wildly impractical, and only recently began to be implemented and used.

2/20 Private key encryption often boils down to simply combining many non-local and non-linear operations. Making it efficient is challenging but if willing to lose some, a Monkey with a typewriter could probably construct a block cipher. See cambridge.org/core/journals/… by @wtgowers
https://twitter.com/boazbaraktcs/status/1380149213066096646?s=20