Orr et al. look at three RCTs in education (one on 36 charter schools, one on ed tech in 132 schools, and one on 84 Head Start centers).
Answer: not very well.
multisite evaluations to accurately predict the likely consequences of adopting an intervention or policy."
First, it would likely be even more difficult to extrapolate to other sites that didn't join the RCT in the first place.
* On temporal external validity, see nber.org/papers/w22449.