The black magic of the system: its ability to iteratively predict future driving surfaces, intersection shapes, exhibit object persistence, and generally provide an intuited truth-state of the world onto which prediction, policy and planning can be built.
/2
That Tesla has managed to execute this vision system on end-point devices an ~order of magnitude less expensive than competing autonomous systems is both testament to the brutally parsimonious approach to the underlying problem and indicative of Tesla's strategic advantage.
/3
The hydranet architecture, which provides a pretrained trunk out of which split end-feature "heads" solves the problem of running the net latency-efficient on the end-device hardware and allowing disparate teams to iterate asyncronously on their particular problem-area.
/4
Periodically the entire system including the trunk gets re-trained and (I think) improvements in any individual head (say traffic light detection), will also have the advantage of improving the performance of the trunk (and so improve other outputs as well.)
/5
Of course a nice deep neural architecture is useless without the data to feed it. Since at least the introduction of its first autopilot board Tesla has had data option-value that outpaces its competitors. A frequent counterargument has been "well most of that data is boring"
/6
2 things are apparent (in this presentation and prior)
1: Tesla exploits the option value of its data resource to extract not-boring data for the particular corner-case they are trying to solve.
/7
(In this presentation the example used was vision-only depth estimation of a vehicle in front with debris obscuring view; with a neural net the task becomes trivial when you can scrape 10k truth-labeled examples of a similar situation in a few weeks.)
/8
2: Boring data is not as boring as you might think,
(IFF the data can be very inexpensively labeled and ingested, and if the root training task is to perceive and forecast the road surface and the dynamic and static objects using and surrounding it. )
/9
The amount of work that has gone into de-frictioning the labeling process
-Moving from 3rd party, to in-house;
-2d to 3d/4d;
-Hand-labeled to massively auto-labeled
has produced remarkable results.
/10
Using much more computationally intense neural nets to create the autolabeling tools that then clean the data to train the parsimonious net that can operate on the car-computer
is building a data machine that builds a data machine that runs the data machines
/11
Inexpensive marginal labeling capability matched against a massive queryable dataset allows Tesla to build a generalized model of the road-world (extensible to all geographies that the company has penetrated).
/12
As described in the presentation, the system presents spookily-good predictions of the road and objects ahead (early in the FSD beta release some accused the company of using HD-mapping b/c the vehicles assessment of the surroundings seemed so preternatural.)
/13
This is both the framework on which a generalized driving system can be built, and the way in which they built it provides the likely direction of travel for the other components in the system as they advance.
(More on this later when I have time).
/14
• • •
Missing some Tweet in this thread? You can try to
force a refresh
People want to buy .jpgs in order to social status signal.
They will naturally congregate on platforms that protect their status.
If the non-true owners of an avatar can similarly wear it on-platform, why buy an avatar at all (and what’s the fun in wearing it)?
/2
Fortnite et al will seek to attract these whales, and so will disallow the use of non-owned assets.
The dominant platforms will enact this curation function which, at first blush, seems trivial (just verify ownership) but proves massively nuanced
/3
One reason why Tesla's planning module should improve at a higher velocity than that of its competitors is due to an undeniable advantage in error-finding.
Tesla can measure simulated intervention rates on its human-driven fleet by running its software in shadow mode.
/1
Further, upon updating its planning module it can more easily verify/refute the efficacy of the change, again by running in shadow mode.
Put simply, due to its fleet Tesla has a much quicker driving-policy error-detection-->solution-verification loop than its competitors.
/2
Perhaps useful (at this point) to compare to Tesla Vision to HD-mapping strategies where an understanding of the road layout and intersection shapes, stop-signs and traffic flow lane-mappings are held in-memory.
One can simultaneously understand why a Waymo or Cruise would pursue the HD-mapping strategy, and the vulnerabilities of such a rigid approach.
/1
Intuitively, you and I drive more easily on roads that we already know;
holding HD-maps in memory narrows the problem-space—the dynamic objects are things that don't appear on this map--in a way that appears more tractable.
/2
A solar-battery system alone experiences decaying profitability as it attempts to accommodate a larger share of grid demand.
Paired with a bitcoin miner, however, the system can be effectively overpowered to hit baseload quality dispatch rates without hurting system profitability
Since the bitcoin-battery-solar system overbuilds solar (relative to battery) in order to provide baseload-competitive electricity, there is a likely second-order impact in furthering solar along its cost decline curve.