1/ Robust Beauty of Improper Linear Models in Decision-Making (Dawes)
"Even improper linear models are superior to clinical intuition when predicting a numerical criterion from numerical predictors. In fact, unit (i.e., equal) weighting is quite robust."
2/ "In proper linear models, predictor variables are weighted such that the resulting linear composite optimally predicts some criterion of interest; examples of proper linear models are standard regression analysis, discriminant function analysis, and ridge regression analysis.
3/ "Research summarized in Paul Meehl's book on clinical vs. statistical prediction—and a plethora of research—indicate that when a numerical criterion (e.g., graduate GPA) is to be predicted from numerical predictor variables, proper linear models outperform clinical intuition.
4/ "Improper linear models are those in which the weights of the predictor variables are obtained by some nonoptimal method; for example, they may be obtained on the basis of intuition, derived from simulating a clinical judge's predictions, or set to be equal."
5/ "People are important. The model may integrate the information in an optimal manner, but it is always the individual (judge, clinician) who chooses variables. Moreover, the human judge knows the directional relationship between the predictors and the criterion of interest."
6/ "In multiple regression, b weights are notoriously unstable; the ratio of observations to predictors should be as high as 15 or 20 to 1 before b weights, which are the optimal weights, do better on cross validation than do simple unit weights."
7/ "Bootstrapping models use weights derived from judges; because they are not derived from the relationship between the predictor and criterion variables themselves, the resulting models are improper. Yet they consistently do better than the judges from which they were derived."
8/ "Random linear models perform about as well as the paramorphic models of the judges; these averages are presented in the third column of the table. Equal-weighting models [after standardizing the variables], presented in the fourth column, do even better."
9/ "Linear models are robust over deviations from optimal weighting. In other words, the bootstrapping finding, at least in these studies, has simply been a reaffirmation of the earlier finding that proper linear models are superior to human judgments."
10/ "People who argue that different criteria/judges/variables/time frames would produce different results have had 25 years in which to produce examples and have failed to do so.
"The illusion of good judgment may be reinforced when predictions have self-fulfilling effects."
11/ "Statistical prediction, because it includes the specification of exactly how poorly we can predict, bluntly strikes us with the fact that life is not all that predictable. Unsystematic clinical prediction, in contrast, allows us to hold to a comforting illusion."
12/ "The GRE is a carefully devised test. Do we really believe we can do a better job by a 10-minute folder evaluation or half-hour interview? Such cognitive conceit is unethical, especially given the complete lack of evidence that we do better than the linear equation does."
13/ Related reading:
Anomalies and News
"Analysts are normally wrong stocks' expected returns. Over pessimism/over-optimism could contribute to anomaly returns."
1/ Moneyball: The Art of Winning an Unfair Game (Michael Lewis)
"Baseball was at the center of a story about the possibilities—and limits—of reason. It showed how an unscientific culture responds (or fails to respond) to the scientific method." (p. xiv)
2/ "A small group of undervalued professional players & executives, many of whom had been rejected as unfit for the big leagues, turned themselves into one of the most successful franchises.
"How did one of the poorest teams, the Oakland Athletics, win so many games?" (p. xi)
3/ "Hitting statistics were abundant & had, for James, the powers of language. They were, in his Teutonic coinage, 'imagenumbers.' Literary material. When you read them, they called to mind pictures. He wrote... 'To get 191 hits in a season demands (or seems to) a consistency...
3/ "Value, momentum & defensive/quality applied to US individual stocks has a t-stat of 10.8. Data mining would take nearly a trillion random trials to find this.
"Applying those factors (+carry) across markets and asset classes gets a t-stat of >14."
2/ "The model's four terms describe different life stages for an individual who marries during the sample period. The intercept reflects the average life satisfaction of individuals in the baseline period [all noncohabiting years that are at least one year before marriage]."
3/ " 'How satisfied are you with your life, all things considered?' Responses are ranked on a scale from 0 (completely dissatisfied) to 10 (completely satisfied).
"We center life satisfaction scores around the annual mean of each population subsample in the original population."
1/ Short-sightedness, rates moves and a potential boost for value (Hanauer, Baltussen, Blitz, Schneider)
…
* Value spread remains wide
* Relationship between value and rates is not structural
* Extrapolative growth forecasts drive the value premium
… robeco.com/en-int/insight…
2/ "The valuation gap between cheap and expensive stocks remains extremely wide. This signals the potential for attractive returns going forward."
3/ "We observe a robust negative relationship between value returns and changes in the value spread.
"The intercept of ≈10% can be interpreted as a cleaner estimate of the value premium, given that it is purged of the time-varying effects of multiple expansions & compressions."