Robert Nishihara Profile picture
Co-founder and CEO @anyscalecompute. Co-creator of @raydistributed. Previously PhD ML at Berkeley.
Jan 25, 2023 β€’ 10 tweets β€’ 5 min read
I remember in 2016 when @ApacheSpark set the record for sorting 100TB in the most cost-efficient way ($144 in 2016, $115 in today's prices).

Today, @raydistributed broke the $1 / TB barrier and set the world record at $97! πŸ”₯πŸ“ˆπŸ₯³πŸŽ‚πŸŽ—οΈπŸ₯‚

anyscale.com/blog/ray-break… I was at Berkeley at the time: @ucbrise and @berkeley_ai.

Though Spark had been used by many companies in production for ages, for many people, this marked Spark's transition from being a research project to a production-grade system.

And the rest is history. 😏
Dec 31, 2022 β€’ 9 tweets β€’ 4 min read
Exciting to see Quokka at the top of Hacker News (written by Ziheng Wang).

In ~1000 lines of Python, Quokka is a high performance fault-tolerant query engine built on

1⃣ Ray (@raydistributed) - distributed execution
2⃣ Polars - fast dataframes
3⃣ Arrow (@ApacheArrow) - fast I/O Image We tried to design #Ray to be as flexible as possible, and this makes it possible to build not only scalable applications with Ray, but also to build entire scalable systems and products on top.