1/5 Finding a do-operator in a @DeepMind article is a tectonic progress that deserves welcoming blessing. The "delusions" treated in this article are endemic of "Evidential Decision Theory" which Causality (ch 4.1.1 ucla.in/38bmhnO)
summarizes in a mnemonic limerick:
2/5
- Whatever evidence an act might provide
- On what could have caused the act,
- Should never be used to help one decide
- On whether to choose that same act.
Typical real life ramifications of these delusions are:
(1) patients should avoid going to the doctor “to reduce the
3/5
probability that one is seriously ill”
(2) workers should never hurry to work, to reduce the probability of having overslept, and more.
The deployment of the do-operator eliminates these "delusions" and has led to the "sequential backdoor criterion" of Sec. 4.4.3.
4/5
Ortega etal are justified to title the paper "Shaking the Foundations", since even seasoned decision analysts often use the wrong probabilities on their decision trees (see Sec.11.6
ucla.in/2NnfGPQ#page=50)
not to mention ML researchers, many of whom still equate "knowledge"
5/5
with "training".
I haven't read the last part but would recommend that the authors take a good look at the "sequential backdoor criterion" to ensure that we do not introduce new "delusions" in trying to eliminate old ones.
#DataScience #MachineLearning #DeepLearning #AI

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Judea Pearl

Judea Pearl Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @yudapearl

1 Nov
1/ To appreciate what I mean by "assumptions whose plausibility you cannot judge" I often ask readers to examine how Imbens and Rubin (2015) define "unconfoundedness", the key concept needed for all causal inference. Quoting from their page 479, we find (fasten your seat belts):
2/: First,"the conditional distribution of the outcome under the control treatment, Y i (0), given receipt of the active treatment and given covariates, is identical to its distribution conditional on receipt of the control treatment and conditional on covariates, and second,
3/ that, analogously, the conditional distribution of the outcome under the active treatment, Y i (1), given receipt of the control treatment and conditional
on covariates, is identical to its distribution given receipt of the active treatment and conditional on covariates." No!
Read 4 tweets
19 Oct
1/ I'm compelled to retweet this thread because so often I see well-intentioned people assuming that a "happy ever after" 1-state solution is inevitable because Palestinian's rejection of Jewish self determination (in ANY borders) is so total and deeply entrenched that any other
2/ arrangement amounts to endless blood shed. As one born in Israel and tuned daily to the country's pulse, let me mention another factor which is often ignored in conversations about the 1-state fantasy. Israelis resistance to a 1-state solution is at least as total and deeply
3/ entrenched as the Palestinians' rejection of Zionism. For Israelis, the idea of relinquishing Statehood (i.e., right to self defense and border control) towers as a collective suicide, against which people are willing to fight to the last soul (barring a few ultra orthodox
Read 5 tweets
23 Sep
It's now 34-countries boycotting the Durban Conference, but my eyes are still on Norway. Oh Norway, Norway! How could your Gv't face it's people: Sorry, we fell asleep, and found ourselves swimming in the cesspool of civilization. To honor readers whose Governments remained 1/2
morally sober, I am listing the 34 countries that decided to boycott the Zionophobic Durban Conference:
Albania, Australia, Austria, Bulgaria, Canada, Colombia, Croatia, Cyprus, Czech Republic, Dominican Republic, Estonia, France, Georgia, Germany, Greece, Honduras, Hungary, 2/
Read 4 tweets
21 Sep
1/ Summarizing our discussion of "demand" via "ceteris paribus" (CP), we've seen that, once formalized, CP amounts to comparing Y under two settings of X, say X=x and X=x', while leaving other variables in the structural equation for Y unchanged. The beauty of formal definitions
2/ is that they hold for all models and are independent on the meanings of X, Y,Z, etc,
or the procedure by which we estimate things. Leveraging these beauties, we come to realize that the resultant CP definition of "demand" is none other but the counterfactual definition of
3/ Causal Effect, namely {Y(x, Z),Y(x', Z)}, where Z is the set of other variables in the eq. of Y, both observed and unobserved. Thus, the analysis of "demand" can benefit directly from the literature on causal effects, presenting no peculiarities that demand special treatments.
Read 4 tweets
5 Aug
1/ This just in. A new successful paradigm for building AI systems has emerged, called "Foundation Model". According to its inventors
crfm-stanford.github.io, it works as follows: "Train one model on a huge amount of data and adapt it to many applications." Not only is it
2/ seen "as the beginnings of a sweeping paradigm shift in AI", but a whole Center has been erected in its honor, dozens of prominent researchers, post-docs and PhD students has joined its staff, and an interdisciplinary symposium has been announced. We, foot-soldiers in the
3/ trenches of AI research are asking, of course: "What is it?" or "What is the scientific principle by which 'Foundation models' can circumvent the theoretical limitations of data-centric methods as we know them, especially those that hinder generalization across environments?"
Read 4 tweets
31 Jul
1/ Can "traditional statistics" handle "effect sizes?" If we include Neyman-Rubin in "traditional statistics" and interpret "Can" to mean "Can, in principle", the answer is Yes. However, if we take "traditional statistics" to be represented by: Pearson, Fisher, Chochran, Tuckey,.
2/ Breiman, Friedman,...+deceased presidents of ASA, RSS...+authors of stat texts+..., and if we interpret "Can" to mean "Capable of handling a simple problem in 2 weeks time," I would bet 100:1 on "NO!". Reason: They lacked a language to articulate the assumptions needed for
3/ estimating effect sizes, and it takes about 2 weeks to learn such a language, be it "potential outcomes", DAGs, SCM, or equivalent. Why haven't the giants bothered to learn any? My 1993-9 email is full of reasoned excuses, but the most common one has been: "It takes us out of
Read 4 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Thank you for your support!

Follow Us on Twitter!

:(