Tweet

Brett Winton

20 Aug, 15 tweets, 4 min read

some thoughts on tesla AI day

starting with Tesla Vision

🧵

@karpathy

The Tesla Vision work is extraordinary.

And vision is the right word for it.

(Massive kudos to @karpathy and team.)

/1

The black magic of the system: its ability to iteratively predict future driving surfaces, intersection shapes, exhibit object persistence, and generally provide an intuited truth-state of the world onto which prediction, policy and planning can be built.
/2

That Tesla has managed to execute this vision system on end-point devices an ~order of magnitude less expensive than competing autonomous systems is both testament to the brutally parsimonious approach to the underlying problem and indicative of Tesla's strategic advantage.
/3

The hydranet architecture, which provides a pretrained trunk out of which split end-feature "heads" solves the problem of running the net latency-efficient on the end-device hardware and allowing disparate teams to iterate asyncronously on their particular problem-area.

/4

Periodically the entire system including the trunk gets re-trained and (I think) improvements in any individual head (say traffic light detection), will also have the advantage of improving the performance of the trunk (and so improve other outputs as well.)

/5

Of course a nice deep neural architecture is useless without the data to feed it. Since at least the introduction of its first autopilot board Tesla has had data option-value that outpaces its competitors. A frequent counterargument has been "well most of that data is boring"

/6

2 things are apparent (in this presentation and prior)

1: Tesla exploits the option value of its data resource to extract not-boring data for the particular corner-case they are trying to solve.

/7

(In this presentation the example used was vision-only depth estimation of a vehicle in front with debris obscuring view; with a neural net the task becomes trivial when you can scrape 10k truth-labeled examples of a similar situation in a few weeks.)

/8

2: Boring data is not as boring as you might think,

(IFF the data can be very inexpensively labeled and ingested, and if the root training task is to perceive and forecast the road surface and the dynamic and static objects using and surrounding it. )

/9

The amount of work that has gone into de-frictioning the labeling process

-Moving from 3rd party, to in-house;
-2d to 3d/4d;
-Hand-labeled to massively auto-labeled

has produced remarkable results.

/10

Using much more computationally intense neural nets to create the autolabeling tools that then clean the data to train the parsimonious net that can operate on the car-computer

is building a data machine that builds a data machine that runs the data machines

/11

Inexpensive marginal labeling capability matched against a massive queryable dataset allows Tesla to build a generalized model of the road-world (extensible to all geographies that the company has penetrated).

/12

As described in the presentation, the system presents spookily-good predictions of the road and objects ahead (early in the FSD beta release some accused the company of using HD-mapping b/c the vehicles assessment of the surroundings seemed so preternatural.)

/13

This is both the framework on which a generalized driving system can be built, and the way in which they built it provides the likely direction of travel for the other components in the system as they advance.

(More on this later when I have time).

/14

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @wintonARK

Brett Winton

@wintonARK

26 Aug

https://twitter.com/wintonARK/status/1430730151331991553

The hosting platforms (marketplaces, game engines and virtual world sandboxes) become massive centralization vectors in the NFTivere
🧵

https://twitter.com/wintonARK/status/1430730151331991553

People want to buy .jpgs in order to social status signal.
They will naturally congregate on platforms that protect their status.
If the non-true owners of an avatar can similarly wear it on-platform, why buy an avatar at all (and what’s the fun in wearing it)?

/2

Fortnite et al will seek to attract these whales, and so will disallow the use of non-owned assets.
The dominant platforms will enact this curation function which, at first blush, seems trivial (just verify ownership) but proves massively nuanced

/3

Read 9 tweets

Brett Winton

@wintonARK

23 Aug

https://twitter.com/wintonARK/status/1429904355931541521

Continued thoughts

Why should Tesla’s rate of improvement in planning and control should exceed its competitors

🧵

https://twitter.com/wintonARK/status/1429904355931541521

One reason why Tesla's planning module should improve at a higher velocity than that of its competitors is due to an undeniable advantage in error-finding.

Tesla can measure simulated intervention rates on its human-driven fleet by running its software in shadow mode.

/1

Further, upon updating its planning module it can more easily verify/refute the efficacy of the change, again by running in shadow mode.

Put simply, due to its fleet Tesla has a much quicker driving-policy error-detection-->solution-verification loop than its competitors.

/2

Read 14 tweets

Brett Winton

@wintonARK

20 Aug

https://twitter.com/wintonark/status/1428828394150502403

Perhaps useful (at this point) to compare to Tesla Vision to HD-mapping strategies where an understanding of the road layout and intersection shapes, stop-signs and traffic flow lane-mappings are held in-memory.

🧵

https://twitter.com/wintonark/status/1428828394150502403

One can simultaneously understand why a Waymo or Cruise would pursue the HD-mapping strategy, and the vulnerabilities of such a rigid approach.

/1

Intuitively, you and I drive more easily on roads that we already know;

holding HD-maps in memory narrows the problem-space—the dynamic objects are things that don't appear on this map--in a way that appears more tractable.

/2

Read 11 tweets

Brett Winton

@wintonARK

21 Apr

I have solar.

Installed by Tesla since January.

Hasn't generated a single watt-hour.

Waiting for utility to approve the connection.

Attach bitcoin mining and sell into fiat, and the solar could have (probabilistically) generated $1,500 over this 3 months but instead has lain idle.

Compare to $17k for the full cost of the system (and $20k accommodating a right-sized miner.)

The system hasn't yet generated a watt-hour *and* I haven't yet paid a penny;

Tesla won't collect until the system is allowed to begin generating power.

(Put another way, I paid for this friction on the higher upfront cost required to pay back Tesla for its cost of capital.)

Read 5 tweets

Brett Winton

@wintonARK

21 Apr

https://twitter.com/sqcrypto/status/1384901045134974978

A world with bitcoin is a world with an electric grid that can accommodate a higher share of renewable energy.

(Why do we think so? Read here: wintonark.medium.com/bitcoin-mining…)

https://twitter.com/sqcrypto/status/1384901045134974978

A solar-battery system alone experiences decaying profitability as it attempts to accommodate a larger share of grid demand.
Paired with a bitcoin miner, however, the system can be effectively overpowered to hit baseload quality dispatch rates without hurting system profitability

Since the bitcoin-battery-solar system overbuilds solar (relative to battery) in order to provide baseload-competitive electricity, there is a likely second-order impact in furthering solar along its cost decline curve.

Read 4 tweets