1/ @huggingface: The #ArtificialIntelligence community building the future

Behind the hugging face emoji 🤗 is the fastest growing open source community in history

If you have not heard of @huggingface, it may be one of most important platforms of the next decade
2/ Founders @ClementDelangue , @Thom_Wolf, and @julien_c originally set out to build open domain AI.

Quickly, they realized the technology and platform they were building was bringing value to companies.

They decided to open source it
3/ Two and a half years later, the platform has exploded to over 10,000 models, 1,000 datasets, and 45,000 GitHub stars

4/ Why did that happen?

Artificial Intelligence and Machine Learning felt distant to most, but @OpenAI's release of #GPT3 shocked many

AI and ML is growing rapidly, and Natural Language Processing (NLP) like #GPT3 is a good example of innovations that can change industries
5/ What is NLP?

@IBM defines NLP as "the branch of computer science - and more specifically... AI - concerned with giving computers the ability to understand text and spoken words in much the same way human beings can"
6/ Basically, it is a computer model that can understand us.

This includes sentiment (e.g., angry or sad), idioms (e.g., break a leg), confusing spelling (e.g., your vs. you're), etc.

Examples include Google Search auto-complete, iMessage suggested words, Siri voice recognition
7/ How?

In a crude simplification, scientists / engineers build a predictive model on massive datasets

The model:
1. Tokenizes - Segments the data into digestable parts
2. Trains - Identifies patterns
3. Applies - predicts future patterns
8/ Consider this example below with "Dogs are better than cats"

While this may seem simple, consider this: GPT-3 was trained on 175 BILLION parameters
9/ So why does this matter?

First, these models are powerful and have many applications. Companies like @copy_ai are building on these.

Even so, NLP models are just one version. There are many more.

Second, these models are difficult to develop, understand, and operationalize
10/ Applications

@huggingface currently has 18 different ML and AI tasks
11/ I further simplified into three categories:
12/ The applications are already wide, but they are expanding rapidly.

Recently, @huggingface was leveraged for protein language modeling:
13/ Challenges

There are, however, challenges. Scientists develop the models. Engineers often operationalize and commercialize them.

Currently, the industry is siloed, creating many challenges for both parties
14/ This is why @huggingface is critical to the industry.

Their open source platforms has connected science and engineers, centralizing a fragmented industry into a collaborative, innovative community
15/ Scientists can now quickly publish models to thousands of engineers, receiving feedback and usage critical to further development

Engineers can quickly find, understand, deploy, and train models now. They are no longer wasting hours searching for fragmented resources
17/ The result has been a flywheel, accelerating both the development and adoption (or embrace… get it? Hugging?) of AI and ML models
18/ Conclusion

@huggingface is strategically positioned at the center of one of the most important fields of development of the next ten years.

It will truly become the GitHub of Machine Learning
19/ The full writeup can be found on my Substack. I send ~1 a month
jeffburke.substack.com/p/hugging-face…
20/ Business Model

@huggingface is built on a tiered subscription (SaaS) model.

Subscribers get access to increasingly helpful features for model operationalization

huggingface.co/pricing
21/ Market sizing

Based on this, I have done a bottom-up analysis of market sizing based on regressive adoption of the tiered SaaS offering
I forgot to add Lee Fixel to the list! I will tag the fundraise announcement here to avoid anymore!

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Jeff Burke

Jeff Burke Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @Jeff_Burke14

1 Apr
1/@pipe: The traditional venture capital disruptor

One month ago, @pipe raised a $50M series A from some of the biggest names in tech.

Here is my full breakdown of @pipe:
2/ The problem

High-growth companies need capital to scale rapidly.

Their options include:
A) Debt
B) Venture capital

Debt is not ideal. The structure is rigid, and interest expense compromises long-term cash

Most companies choose VC
3/ In the traditional VC model, founders raise capital in exchange for equity.

The model has been quite effective and is a contributor to the pace of innovation we see today.

But is that the only way?
Read 19 tweets
3 Mar
1/ @Anchorage: The First Federally Chartered Digital Bank

In January, the US government gave there largest stamp of approval for crypto currencies like #Bitcoin

Here is my full analysis of the company poised to disrupt the banking system
2/ Cryptocurrency has exploded over the past 5-10 years.

Most people are familiar with #Bitcoin, but there are many other digital assets such as #Ether, #ZCash, #FacebookDiem, etc.

The space changes rapidly, and it requires significant levels of agility and expertise to keep up
3/ Founders @nathanmccauley and @diogomonica have over 20 years of security expertise.

Both led security within payments at @Square in the nascent stages

They recognized the upcoming challenges of securing digital assets.

In 2017, @Anchorage was born.
Read 19 tweets
10 Feb
1/ A huge hobby of mine is learning about innovative startups.

Recently, I came across @replit. I have been hooked since.

@amasad & @HayaOdeh are building the next computing platform to empower developers across the globe with almost no barrier to entry. Startup Battlecard for Repl.it
2/ Don't believe me? They already have 5M+ users. You probably just have not heard about it.

As of December, @paulg (Replit investor) announced that @replit has already passed 5M registered users

3/ Two months prior, @paulg highlights the real beauty of @replit long-term. A 20-year old programmer can learn, experiment, host, and deploy within @replit, creating a monthly income!!

Read 16 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!

:(