Tweet

Ming "Tommy" Tang

Follow @tangming2005

Jul 26 • 4 tweets • 2 min read

@Matthew_N_B

1/ Using random forest to calculate feature importance?
The importance score might be biased. #machinelearning #featureimportance Thanks @Matthew_N_B for pointing it out
A thread 👇

2/ explained.ai/rf-importance/

The takeaway from this article is that the most popular RF implementation in Python (scikit) and R's RF default importance strategy do not give reliable feature importances

3/
when “... potential predictor variables vary in their scale of measurement or their number of categories.” (Strobl et al). Rather than figuring out whether your data set conforms to one that gets accurate results, simply use permutation importance.

4/
You can either use our Python implementation (rfpimp via pip) or, if using R, make sure to use importance=T in the Random Forest constructor then type=1 in R's importance() function. mljar.com/blog/feature-i…

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @tangming2005

Ming "Tommy" Tang

@tangming2005

Jul 27

1/ "What's the most important factor for your success?"

2/ I have heard many answers.

The most common one I got is "luck".

3/ A little story:

Qi Lu was earning $27 a month when he was 27 years old. At 47, he was the president of Microsoft.

Read 9 tweets

Ming "Tommy" Tang

@tangming2005

Feb 25

1/ collecting scRNAseq data in the context of immunotherapy. I will share what I know here. welcome to contribute. nature.com/articles/s4159…

2/ pubmed.ncbi.nlm.nih.gov/30388456/

3/ sciencedirect.com/science/articl…

Read 12 tweets

Ming "Tommy" Tang

@tangming2005

Dec 14, 2021

1/ Different ways to read in all files into R. A thread:

2/ files<- as.list(dir(".", pattern= ".tsv"))
datlist <- lapply(files, function(f) {
dat = read.table(f, header =T, sep ="\t", quote = "\"")
dat$sample = gsub(".tsv", "", f)
return(dat)
})
data<- do.call(rbind, datlist)

3/ github.com/vsbuffalo/devn…

Read 5 tweets

Share this page!

Ming "Tommy" Tang

People who liked this thread also liked...

Try unrolling a thread yourself!

More from @tangming2005

Ming "Tommy" Tang

Ming "Tommy" Tang

Ming "Tommy" Tang

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?