How to get URL link on X (Twitter) App
https://twitter.com/deliprao/status/1810660727646265852The unboxing experience is unlike anything else. The reader itself is in this pillow-like case. Dosa, the hoarder of pillows, wanted it for herself! 😆🐶

An important point not to be missed is that mixed-use students don't necessarily have a gain over no-LLM students on a sufficiently challenging task with reasonably competitive humans.
It seems to follow the PyTorch API closely and provides many useful primitives right out of the box. For example, implementing a decoder-only transformer is as simple as this!
https://twitter.com/_aidan_clark_/status/1681784631295893505Before you think this is some influencer^ thread sh*t, I worked explicitly on social media disinformation campaigns during the 2016 elections and have worked on countermeasures.
2. what makes it “turbo” or fast is still TBA but my bet is it’s some kind of mixture-of-experts setup, besides other systems optimizations.
https://twitter.com/deliprao/status/1630955050263773184But I am all for capitalism-driven innovation/progress because it reduces certain types of suffering, even if happiness is an elusive and unrelated goal.
I remember seeing @srush_nlp (Sasha even wrote an explainer) and @yoavgo tweeting about it some time ago when I couldn’t pay much attention to it. Surprisingly, the larger Transformer/LLM Twitter crowd has not given this as much attention as they should (read more for why).
Go tell a cis-white low-income class heterosexual male from the Appalachia that he’s well represented in ML research. OTOH I check many of these “URM” boxes, but it will be silly for me to claim that.
https://twitter.com/ilex_ulmus/status/1601987263411404801Western Buddhism lacks the devotional aspect primarily because of the extreme individualism that’s rooted in western society. This also explains why Zen movements became more palatable to westerners than any early flavors of Buddhism.
https://twitter.com/jheitzeb/status/15981549435125964801. Google has more LLMs deployed internally than any place I know. If private communication is to be believed that number is in the order of “few dozens”. Not talking of BERT/T5 sized models here.
https://twitter.com/gabrielpeyre/status/1393793331516350468The Laplace-Beltrami operator (LBO) on a Riemannian manifold is approximated by the graph laplacian (L = D - A). The normalized graph laplacian has connections with random walks, diffusion processes (Fokker-plank equations), Brownian motion, and heat equations.
Going to live tweet somethings because of the Shannon fanboy I am 😄
https://twitter.com/lousylinguist/status/1068285983483822085First, let’s understand why someone might want to eliminate stopwords? The origins of this method lies in information retrieval where it became a common practice to eliminate high frequency words (stopwords) while building inverted indices.