Latest Twitter Threads by @c_gebhard on Thread Reader App

Sep 14, 2021 • 10 tweets • 4 min read

@ParkerKMathias I haven't had the chance to look at the data yet, so these ideas might not be directly applicable, but I'd say that there are at least three ways to approach this, depending on the scope and "error tolerance" of the analysis: @ParkerKMathias (1) using tidyr::separate() to split a list of authors and then pivot_longer() to count them separately

You could split
|Song 1 | Mariah Carey and So-and-so |

into
|Song 1 | Mariah Carey | So-and-so |

then pivot_longer into

|Song 1 | Mariah Carey |
|Song 1 | So-and-so |

Share this page!

Enter URL or ID to Unroll