The root challenge of all modern frequentist statistics is "does the selection criteria for the sample actually allow me to generalize the results". Experiments and post-hoc analysis are really sensitive to selection biases and researchers (and much worse the media) tend to overgeneralize the results of a study. Its particularly bad in medicine. My personal bugbear is that you can only participate in a study for an autoimmune disorder if you only have the 1 disorder of focus, but real people with autoimmune disorders tend to have more than one (diagnosed or not). So studies on treatment efficacy basically can't actually generalize to the populations they intend to treat because they select against them.
All those things are either frequentist or baysian statistical models. Baysian models are much harder (often impossible) to use but at least pay attention to intersectionality.