P values?

Jey :crab: · 7 months ago

P values?

Jey :crab: · 7 months ago

@MarcusMuench @rstats @phdstudents @datascience @socialscience @org_studies
According to the second article:
“…A p value should be interpreted in terms of what would happen if you repeated the measurement multiple times with different samples…”
If I have a census, I would expect zero difference for repeating measurements due to random sampling. Therefore p values are irrelevant for census data.

Thanks for the references!

@arandomthought@sh.itjust.works · 7 months ago

Careful there. If you had a census of ALL the people in your population then you would not expect any variation, as you wrote. But not because of random sampling, but because next time you sample, you would just sample the exact same people (your whole population). And since the sample stays the same, so do the numbers.

If you however truly took a random sample of the population, then the next time you take a new random sample you ask different people and would therefore also get slightly different numbers. And there p-values are useful, because they are based on exactly this question of “well what if I took another random sample, and then another, and then another and so on”.