For the last 10 yrs, hedge funds swarmed at "alternative data" like pigeons at bread crumbs. Why? What exactly constitutes alt data & does it actually generate investment alpha?
👇
1/ What is it?
"Alternative data" is just a fancy term for data that doesn't appear on a 10-K/Q or earnings transcript.
Things like foot traffic at a retail store or credit card orders at a restaurant. Such data helps hedge funds predict earnings better (supposedly, at least).
2/ What are some common data sources?
App usage-- # of downloads over time is used to predict MAUs/DAUs & forward adoption rate; reviews also used to gauge product quality
Supply chain & logistics-- used to predict inventory & sales bottlenecks, pricing/bargaining power, etc.
Consumer transactions-- (e.g. credit card & email receipts) used to predict store-level & product-level revenue growth before quarterly earnings releases
Parking lot data-- to predict revenue growth at hotel, food, & entertainment chains
ESG-- used as early indicators of risks
Sentiment & NLP on wallstreetbets-- they wanna know the next 🦍🔥🚀 meme stonks
Satellite & drone imagery-- used to track industrial production (esp. in data-scarce 3rd world), oil inventories, agriculture harvests ...
Web crawlers-- used to track pricing & product distribution
3/ What data is underrated & under-tracked that hedge funds absolutely (should) use?
Cyber.
Especially short funds.
All these intrusion detection systems (Crowdstrike, Darktrace, Dragos) & automated pentesting tools (Metasploit, AttackIQ) & scanners (Qualys, Rapid7) etc... 🤔🤔
4/ Who provides alt data?
See diagram👇 for market map.
Data flow looks like this:
You+me/weather/some cornfield ➡️ aggregator➡️ cleanser/modeler➡️trader
Data providers are always aggregators & maybe cleansers/modelers.
Most hedge funds do their own (re)-cleansing/modeling.
5/ How much do hedge funds spend on alt data?
2020: Hedge funds spent $1.7B (in aggregate) on alt data (i.e. NOT THAT MUCH)
2019: 1/3 of funds spent >$1M/yr
Annual subscriptions range from $25K-$500K+
30% of transaction datasets cost >$150K/yr
40% of ESG datasets cost <$25K/yr
6/ Does it work?
So does alt data actually help generate investment alpha?
I wish I had stats to compare <returns of funds using alternative data> vs. <returns of funds NOT using>.
Sadly I don't. (Does anyone?)
The quick answer is that **some** datasets are truly useful.
Which?
1. A good wand needs to meet the right wizard. Data is only useful when an investor knows how to interpret it & knows its major sources of bias. 2. Alpha-rich data is not buyable by literally every bloke on the Street! 3. ESG is low demand, i.e. some low-signal charity donation.
4. Reread 1 & 2. It means do ur own data collection. Build ur own scrapers. That way u understand all sources of bias, and when things go wrong u can identify those hindsight-obvious factors you omitted.
5. If you can't build sh*t, at the very least sign an exclusive contract.
• • •
Missing some Tweet in this thread? You can try to
force a refresh
Each week I write a 🧵about investing, startups, or a cool market segment. All of it is knowledge my mentors & friends have taught me over the years.
🙏💗Now I'm passing it forward.
Here's a megathread of all past & future 🧵s. I'll be updating as we go. If you <3, plz share!
👇
1/ 💸 Jim Simons’ Playbook: King of Quant 💸
90% of active managers fail to beat the market, but Medallion boasts >40% annualized returns.
In 1926 McKinsey was a small tribe of bean counters led by a nobody accounting professor James. 95 yrs later that small tribe grew to a $10B ARR behemoth w/ 90 of the F100 as clients.
How?
What was McKinsey's secret sauce to world domination?
👇
1/ Language.
She who masters language wields the ultimate power of category creation. With this, all else falls into place.
McK’s biggest secret is that it category-created "management consulting.”
Projects aren't jobs; they’re “engagements.”
McK isn’t a bizniz; it's “The Firm”
Firing isn’t firing; it’s “corporate downsizing” and “increasing the bottom line.”
These nuances may seem trivial.
But we humans are storytelling animals. Nuances drive our narratives & narratives shape our aggregate spending. Today management consulting is a $255B industry. 🙀
Ryuk is the biggest Saas unicorn u've never heard of.
$150M ARR.
3 yrs old.
Maybe it’s taboo to learn business strategy from a cybergang. But the ransomware industry-- from supply chain operations to market microstructures-- is truly genius.
👇
1/ Some Highlights
$20B is the annual cost of global ransomware
$5M was the total payout to hackers @ Colonial Pipeline
$170K is the avg payout
2020 saw a 900% growth in fileless malware
67.3M attacks detected so far can be traced back to Ryuk
Every 11 seconds is a new attack
So how much $ do tier-1 ransomware gangs actually make?
The pie chart below shows total ransom paid to the top 15 groups in 2019. By 2020, outflows increased 311% YoY to $350M. Financial success in the cyber underground is clearly top-heavy, tracing out a power law distribution.
🤫Semi-Controversial Career Advice U Don't Want to Admit But Should🤫
How many cute fortune cookie aphorisms do u tell urself to feel better: "no pain, no gain" "it's the journey that counts!" And by now, how much progress have u sacrificed just to feel better?
Let's dig in👇
1/ Close doors early.
Stop putting life on the back burner just to "preserve optionality." Some ppl go into banking/consulting b/c it keeps doors open. It's a 2yr deferred ticket to HBS where u can then continue to preserve optionality! Hooray! Ur gonna be a generalist forever!
2/ Stop being a generalist.
"I'm good @ everything!"
Well for everything ur good at Billy/Sally is AMAZING at. Don't drink the liberal arts marketing kool-aid. They need admission money so they feed you b*shit. Been there done that. I wasted too many years of life. Save urself.
🏦Fed Jargon 101: A Primer on Hawks, Dots & Monetary Policy🏦
No matter what type of trader u are —discretionary, macro, crypto, chad, virgin— everyone obsesses abt the Fed.
But why? Here's a 🧵 on how Fed moves markets, controls ur PnL & the odd lingo we use to describe it.
👇
1/ Beta
#1 reason any trader cares about the Fed is to predict beta.
What's that?
Say u made 120% returns in 2020. Are u a genius or did u just own a buncha tech stocks as #jpow cranked the moneyprinter? Beta is the overall equity market risk (aka rising tide floats all boats).
To be "long beta" means ur net long stocks. "Short beta" means ur net short.
Beta on a single asset refers to correlation to an equity benchmark (eg SPY).
β>1: more volatile than SPY
0<β<1: less volatile
-1<β<0: less volatile & anticorrelated
β<-1: more volatile & anticorrelated
🎶Trader Jargon 101: How to Talk Like a Real Trader🎶
Learning the lingo was the #1 biggest hurdle that I experienced on the trading floor.
When u read fintwit posts from the pros, does it all sound like voodoo? 😫🙈Don't worry.
Here's 20 top terms to get you sounding smart.
👇
1/ Market maker vs Market taker
Markets are made up of makers & takers.
Makers: "producers" of buy/sell orders (they create quotes based on the max price at which they'd buy & min at which they'd sell)
Takers: "consumers" (they buy or sell instantly at the maker's price quote)
2/ Passive vs Aggressive Order
(Limit vs Market Order)
Passive/limit order: a trader sets a new price, different from the going rate (e.g. buy $TSLA at $800, while current price is $616)
Aggressive/market order: a taker executes a buy/sell immediately @ whatever the going rate