How to get URL link on X (Twitter) App
In a range of experimental settings, we discover that LLMs have some ability to evaluate programs for inputs when trained on their source code alone (no I/O examples). In this thread: my 3 favourite findings, and where I think this line of work can go.
Since LLMs entered the stage, there has been a hypothesis prevalent:
