Kyunghyun Cho Profile picture
Nov 11, 2019 3 tweets 2 min read Read on X
digesting what I learned about dropout in the morning in the mercado Puerto #khipu2019 ImageImageImage
and of course digestion ends at...? Image
it's weirdly mesmerizing to look at this wheel circling over and over Image

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Kyunghyun Cho

Kyunghyun Cho Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @kchonyc

Aug 31, 2024
do we want to know which variables are direct causes of a target outcome, or the full dependencies among all variables?

gradually i started to think that it's probably neither, since the utility of each cause is not a function of the distance to the target outcome variable but is more a function of whether we can design an effective and efficient intervention strategy.

@JangHyun_k and i thus started to think of targeted cause discovery.

(1/4)Image
of course, a natural follow-up task is to design an algorithm, which is where most of the challenges lie. instead of designing an ingenious algorithm out of thin air, we decided to let a neural net design an algorithm for us, as has been found to be effectively for causal discovery in the recent years (e.g. work by and others)

in doing so, we realized that "cause discovery" rather than full "causal discovery" has a distinct advantage in scaling up these learning-based approaches.

(2/4)Image
it turned out a neural net can indeed discover an algorithm for targeted cause discovery that is robust to the distance between a cause and outcome and scales well up to thousands if not tens of thousands of variables.

we tested it on a variety of synthetic tasks as well as semi-synthetic gene-gene cause discovery.

(3/4)Image
Image
Read 5 tweets
Jul 23, 2024
enjoying #ICML2024 ? already finished with llama-3.1 tech report? if so, you must be concerned about the emptiness you'll feel on your flight back home in a couple of days.

do not worry! Wanmo and i have a new textbook on linear algebra for you to read, enjoy and cry on your long flight.

(1/5)Image
have you ever wondered why SVD comes so late in your linear algebra course?

both wanmo (math prof) and i (cs prof) began to question this a couple of years ago. after all, svd is one of the most widely used concepts from linear algebra in engineering, data science and AI. why wait until the end of the course?

(2/5)Image
Image
Image
Image
we began to wonder further whether SVD can be introduced as early as possible. i mean ... even before introducing positive definite matrices, matrix determinants and even ... eigenvalues (gasp!) without compromising on mathematical rigors.

(3/5)
Read 5 tweets
Jul 23, 2024
very cool to see a pretty exhaustive and extensive technical report on llama-3.1!

a few fun snippets 🧵
PLEASE release this custom html parse PLEASE 🙏 Image
lesson 1: AGI won’t happen due to the degrading QC of NVIDIA.

lesson 2: even Meta couldn’t figure out NCCL watchdog timeout error 😂 Image
Read 9 tweets
Jul 10, 2024
we all want to and need to be prepared to train our own large-scale language models from scratch.

why?

1. transparency or lack thereof
2. maintainability or lack thereof
3. compliance or lack thereof

and because we can, thanks to amazing open-source and open-platform ecosystem.

(1/12)
we have essentially lost any transparency into pretraining data.

(2/12)
Image
Image
we are being force-fed so-called values of silicon valley tech co's, ignoring the diversity in values across multiple geographies, multiple sectors and multiple groups.

(3/12)
Image
Image
Read 13 tweets
May 15, 2024
this semester (spring 2024), i created and taught a new introductory course on causal inference in machine learning, aimed at msc and phd students in cs and ds. the whole material was created from scratch, including the lecture note and lab materials;

1/4docs.google.com/document/d/1qN…
now that the course is finally over, i've put all the lab materials, prepared by amazing @taromakino, @Daniel_J_Im and @dmadaan_, into one @LightningAI studio, so that you can try them out yourselves without any hassle;

2/4lightning.ai/kc119/studios/…
i'm also making the lecture note i used to teach lectures throughout the semester publicly as well at .

3/4arxiv.org/abs/2405.08793
Read 4 tweets
Aug 23, 2021
good morning!

as i tweeted last week, Prescient Design Team at gRED within @genentech is hiring awesome people. in particular, we have the following positions already open and ready:
[Engineering Lead] we want you to work with us to build a team for creating an ML infrastructure that seamlessly integrate between ML and bio: gene.com/careers/detail…
[Machine Learning Scientist] we have a ton of challenging problems inspired & motivated by biology, chemistry & medicine that are waiting for your creativity, knowledge and ingenuity in ML/AI: gene.com/careers/detail…

cc: @stephenrra
Read 6 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(