How to get URL link on X (Twitter) App
To learn something new, we shouldnโt need to finetune all the parameters of a large model.
Currently, we train models by doing a single pass over the data.
@dan_fried @ancadianadragan Weโd like AI agents that not only follow our instructions (โbook this flightโ), but learn to generalize to what to do in new contexts (know what flights I prefer from our past interactions and book on my behalf) โ i.e., learn *rewards* from language. [2/n]