Random Variables and Expectations

Review

Correlation
Covariance

ρ_{x y} = \frac{ρ _{x}}{ρ _{x}}

Correlation and causation
- Per capita cheese consumption is highly correlated with “Number of people who died by become tangled by their bedsheets”
- Popularity of the first name Annabelle with “UFO sightings in Mayland”

Correlation doesn’t imply causation

Anscombe’s quartet
- Anscombe created 4 data sets with very similar descriptive statistics (linear correlation coefficients)
- $ρ$ degree of linear association; $r_{s}$ degree of monotone association (rank), Chatterjee.
  - The Pearson Correlation Coefficient has limits
- The data sets appeared to be very different, which we could only talk about because we visualized the data

It illustrates the fact that even though we have the same correlation coefficient the reality behind that number may be really different

Capital $X$ is a random variable, $x_{1}$ are realizations or actual numerical values for $X$ .

Transclude of Random-Variables-and-Expectations-2024-12-12-05.12.39.excalidraw

The expectation values of the mean $\overline{X}$ is the consistent.

E (\overline{X}) = E [\frac{1}{n} (i = 1 \sum n X_{i})]

The same lies for the variance

σ_{\overline{X}}^{2} = Va r [\frac{1}{n} (i = 1 \sum n X_{i})] = i = 1 \sum n (\frac{1}{n})^{2} Va r (X_{i}) = \frac{σ _{X}^{2}}{n}

$σ_{\overline{X}}^{2}$

The higher the sample size, the lesser the variance of $\overline{X}$ , meaning that the value of the estimator will be more close to the real value of the population mean (the value that it’s trying to estimate)
$σ_{\overset{ˉ}{X}}^{2} = \frac{σ ^{2}}{n} ⟹ σ_{\overset{x}{ˉ}} = \frac{σ}{n}$

As the sample size becomes bigger and bigger, theoretically to such an extent that the sample size is equal to the size of the population, then the variance becomes zero, and thus our estimated $\overset{ˉ}{X}$ is actually the population mean.

When the sample size increases ( $n = 5, 10, 20, 50, 100$ ) the distribution of the mean gets concentrated towards the population mean.

Keywords:

Sampling
Estimation
Estimator
Estimate

What are the desirable properties of these Estimators?

Properties

Consistency

Sufficient condition:

The estimator is unbiased and its variance tends to $0$ when $n$ becomes large.
A biased estimator may be consistent (thus, not necessary), if the bias disappears when the size of the sample increases¹

The two conditions can also be stated as

$E_{θ} (T_{n}) \to γ (θ), n \to \infty$
$Va r_{θ} (T_{n}) \to 0, n \to \infty$ Then, $T_{n}$ is a consistent estimator of $γ (θ)$ .

Plim definition:

n \to \infty lim P [∣ Z_{n} - α ∣ \geq ϵ] \to 0 ⟹ plim Z_{n} = α

Plim properties

plim (X + Y + Z) = plim (X) + plim (Y) + plim (Z)

and 5 more…

If we have several unbiased estimators, how do we choose among them? We simply select the one with the smaller variance (higher precision).

If we have an unbiased estimator with a larger variance, but a biased estimator with a smaller variance, how to we choose?

We can turn to a loss function which is the

Transclude of Random-Variables-and-Expectations-2024-12-12-05.37.40.excalidraw

MSE (Z) = E [(Z - θ)^{2}] = E [(Z - μ_{Z} + μ_{Z} - θ)^{2}] = \dots solve using (a + b)^{2} = a^{2} + 2 ab + b^{2} (most remarkable identity) = σ_{Z}^{2} + (μ_{z} - θ)^{2}

The MSE is a synthetic measure of the bias and the variance. Looking at it we can see whether the bias-variance tradeoff is worth taking if it leads to a smaller MSE. But again, “best” is subjective…

Try looking at variance as precision.

There are certain theories that are only applicable for large samples ↩

h.notes

Random Variables and Expectations

Review

Properties

Consistency

Graph View

Table of Contents

Backlinks

h.notes

Random Variables and Expectations

Review

Properties

Consistency

Footnotes

Graph View

Table of Contents

Backlinks