Relation between Cross Validation and Confidence Intervals

Question

Relation between Cross Validation and Confidence Intervals

Hing Wong

2022年3月7日 00:07

I've read from a source which I forgot where that 'In cross validation, the model with best scores at 95% confidence interval is picked'.
But according to my stat knowledge, in order for CI (confidence interval) to works, you need normality assumption about the sampling statistics of the experiment.
But how come from that unknown source it seems to simply use results from each flow to construct the sample mean the confidence interval. It seems to me that neither checking if central limit theorem testing at all. And it seems to me this is what people are doing as well:
i) automatically assume normality in sampling MEANS (instead of sampling distribution) ii) CLT automatically satisfied.
May I know if it's my misunderstanding or the industry is adopting a norm which is too loose? Thanks.

Topic confidence cross-validation

Category Data Science

Brian Spiering · Accepted Answer · 2020年10月5日 12:11

It depends how on the confidence interval (CI) is generated. The most common method is on a sample mean with the assumption that the samples are drawn from a normal distribution . However, a CI can be generated from any statistic from observed data. An alternative method would be through bootstrapping, resampling the statistic, which does not require the normality assumption.

Relation between Cross Validation and Confidence Intervals

About