Christian Gebhard Profile picture
fascinated by small scales 🧬, large scales 🪐 and all things data 📊 in between. mostly casual inferences. I publish what I learn on my blog.
Sep 14, 2021 10 tweets 4 min read
@ParkerKMathias I haven't had the chance to look at the data yet, so these ideas might not be directly applicable, but I'd say that there are at least three ways to approach this, depending on the scope and "error tolerance" of the analysis: @ParkerKMathias (1) using tidyr::separate() to split a list of authors and then pivot_longer() to count them separately

You could split
|Song 1 | Mariah Carey and So-and-so |

into
|Song 1 | Mariah Carey | So-and-so |

then pivot_longer into

|Song 1 | Mariah Carey |
|Song 1 | So-and-so |