Glossary

Title Description
Bootstrap Distribution The distribution of a statistic obtained by resampling from the empirical distribution to approximate its sampling variability.
Central Limit Theorem (CLT) A fundamental result stating that the sampling distribution of the sample mean becomes approximately normal as the sample size increases, regardless of the original distribution.
Data Generating Process (DGP) The probabilistic mechanism that generates the observed data.
Empirical Distribution A simple, from-scratch explanation of the empirical distribution as probabilities derived from observed frequencies in data.
Generalization The ability of a model or statistical estimate to perform well on new, unseen data drawn from the same underlying process.
Random Sample A collection of independent and identically distributed random variables drawn from the same data-generating process.
Random Vector A concise explanation of a random vector as a collection of random variables representing a single observation with multiple features.
Sampling Distribution The probability distribution of a statistic computed from repeated samples drawn from the same data-generating process.
Stratified Sampling A sampling technique that divides the population into subgroups (strata) and samples from each stratum to ensure representative coverage.
No matching items