Large Language Models (LLMs) have emerged as the cornerstone of the current Generative AI revolution. The big problem with LLMs is that they are, well, large. Really, really, large.
1/7
They require an enormous amount of high-quality data to train and even more unfathomably large amount of computational power.
2/7
For a while now there has been a hope that fine-tuning a *smaller* language model on the output of some of these large ones would be a computationally efficient way of bringing the promises of GenAI to a wider audience.
3/7
However, in a recent in-depth research project, that promise has been seriously questioned. It seems that indeed for a genuinely general-purpose GenAI, the really large LLMs can’t be that easily replaced.
4/7
“Overall, we conclude that model imitation is a false promise: there exists a substantial capabilities gap between open and closed LMs that, with current methods, can only be bridged using an unwieldy amount of imitation data or by using more capable base LMs.
5/7
In turn, we argue that the highest leverage action for improving open-source models is to tackle the difficult challenge of developing better base LMs, rather than taking the shortcut of imitating proprietary systems.”
6/7
3. Post a screenshot of you in a session as a response below
1/7
A few points:
1. I am working with the NVIDIA marketing team to promote one giveaway; there are other influencers who are giving away more GPUs in their own giveaways.
2. GTC registration is completely free and open to the general public. All sessions are online.
2/7
3. You will need to take a screenshot of yourself in a session that you signed up for *while it's going on* and post it here as a response to the first tweet in this thread.
Below you will also find a few great sessions that I would recommend for this GTC.
3/7
Things seem to be moving at a breakneck speed in the world of generative AI and large language models. In a surprise press event yesterday, @Microsoft announced a wide integration of @OpenAI tools into a couple of their major products,
1/4
Bing search engine and Edge web browser. In particular, this seems to be the first time that we'll see anywhere a public use of OpenAI's next generation LLM, GPT4. Most of the new features are still relatively limited, and you'll need to join the waitlist for the full access. 2/4
This announcement is bringing a whole another level of interest and enthusiasm for Bing and Edge. I have used them only occasionally over the years, but these new capabilities might make me use them on a regular basis.
3/4
In a highly anticipated move, @Google yesterday announced that they are launching Bard, a conversational AI app that is based on their LaMDA model.
1/5
LaMDA - Language Model for Dialogue Applications - has been around for at least a year, but due to variety of considerations it has never been accessible to to the public.
2/5
Bard will be using a lighter, more computationally efficient, version of LaMDA. Bard’s rollout will proceed gradually, and in a month or so Google will be releasing an API for it to some trusted partners and developers.
3/5
Deep Learning and Neural Networks have become the default approaches to Machine Learning in recent years. However, despite their spectacular success in certain domains (vision and NLP in particular),
1/5
their use across the board for all ML problems and with all datasets is problematic, to say the least. Oftentimes better and more robust results can be obtained with simpler, easier to train and deploy, classical ML algorithms.
2/5
One such “traditional” approach was recently used to reevaluate sleep scoring on a few publicly available datasets. The results were published in the journal of Biomedical Signal Processing and Control.
3/5
There was nothing that shocked me more when I entered the industry from academia than this kind of attitude. I came from an environment where teaching and learning were the norm, to the one where giving help to “underperformers” was viewed with disdain as a liability.
1/5
Fortunately not all organizations and managers are this cutthroat, but this kind of mindset is pervasive, especially at startups. There is a widespread attitude that *it’s someone else’s responsibility to do the educating*: yours, your previous job’s, your college’s etc.
2/5
And in some way this is a *rational* attitude to have: there are hardly *any* incentives to help others get better, as this is almost never a peer of your performance evaluation.
3/5
Last week @DeepMind’s research on AlphaCode - a competative programming system - has been published in Science. AlphaCode has been able to beat 54% of humans on a competative coding challenges, putting it on par with many junior-level developers.
1/4
The original announcement from DeepMind came out in February, which in the fast-paced world of AI is already ancient history.
2/4
The explosive rise of generative AI over the past few months will most certainly have a major impact, if it already hasn’t, on the future versions of AlphaCode and similar AI-enabled coding resources.
3/4