Inferences about a Mean Vector

Definition:

Given a set of vectors as dataset, what can we infer about the mean

Simple Mean Vector Test:

Consider the test in $R^{p}$ with $H_{0} : μ = μ_{0}$ and $H_{1} : μ \neq = μ_{0}$
Define $T^{2} = n (\overset{ˉ}{X} - μ_{0})^{⊺} S^{- 1} (\overset{ˉ}{X} - μ_{0})$ , T-squared Statistic
- Where $\overset{ˉ}{X} = \frac{1}{n} \sum_{j = 1}^{n} X_{j}$ and $S = \frac{1}{n - 1} \sum_{j = 1}^{n} (x_{j} - \overset{x}{ˉ}) (x_{j} - \overset{x}{ˉ})^{⊺}$
$T^{2} \sim \frac{( n - 1 ) p}{n - p} F_{p, n - p}$
- whe re $F_{p, n - p}$ denotes F-distribution with $p$ and $n - p$ degrees of freedom
At $α$ level of significant, we reject $H_{0}$ if $T^{2} > \frac{( n - 1 ) p}{n - p} F_{p, n - p} (α)$
- where $F_{p, n - p} (α)$ is the upper (100 $α$ )-th percentile of the $F_{p, n - p}$ distribution
- If $T^{2}$ is too large, meaning $\overset{x}{ˉ}$ is too far from $μ_{0}$

Confidence Regions of Component Means:

A $100 (1 - α) %$ confidence region for the mean of a $p$ -dimension normal distribution is the ellipsoid determined by all $μ$ such that $n (\overset{x}{ˉ} - μ)^{⊺} S^{- 1} (\overset{x}{ˉ} - μ) \leq \frac{( n - 1 ) p}{n - p} F_{p, n - p} (α)$
- where $\overset{x}{ˉ} = \frac{1}{n} \sum_{j = 1}^{n} x_{j}$ and $S = \frac{1}{n - 1} \sum_{j = 1}^{n} (x_{j} - \overset{x}{ˉ}) (x_{j} - \overset{x}{ˉ})^{⊺}$
The confidence ellipsoid is $\overset{x}{ˉ} \pm λ_{i} \frac{( n - 1 ) p}{n - p} F_{p, n - p} (α) e_{i}$
- where $S e_{i} = λ e_{i}$ for $i = 1, ... p$
- Eigenvalue-Eigenvector pairs
Let $X_{1}, .., X_{2}$ be a normal random sample from an $N_{p} (μ, λ)$ population with $\sum$ positive definite.
- Then, simultaneously for all $a$ , the interval $a^{⊺} \overset{ˉ}{X} \mp \frac{( n - 1 ) p}{n ( n - p )} F_{p, n - p} (α) a^{⊺} S a$ will contain $a^{⊺} μ$ with probability $(1 - α)$
- these simultaneous intervals are also referred as $T^{2}$ -invervals
- ex: We can make statements about the differences $μ_{i} - μ_{k}$ by choosing $a^{⊺} = [0, 0, ..., a_{i}, ..., 0, 0, ...., a_{k}, ...., 0, 0]$ where $a_{i} = 1$ and $a_{k} = - 1$ .
  - In this case $a^{⊺} A a = s_{ii} - 2 s_{ik} + s_{kk}$ and the interval $\overset{x}{ˉ}_{i} - \overset{x}{ˉ}_{k} \pm \frac{( n - 1 ) p}{n ( n - p )} F_{p, n - p} (α) (s_{ii} - 2 s_{ik} + s_{kk})$ contains $μ_{i} - μ_{k}$ with probability $1 - α$
By choosing $a^{⊺} = [0, 0, ..., a_{i}, ..., 0]$ where $a_{i} = 1$ , $1 \leq i \leq p$ , the interval $\overset{x}{ˉ}_{i} \mp \frac{( n - 1 ) p}{n ( n - p )} F_{p, n - p} (α) S_{ii}$ contains $μ_{i}$ with probability $1 - α$

Large sample inferences about a population mean:

When the sample size is large, tests of hypotheses and confidence regions for $μ$ can be constructed without the assumption of a normal distribution
Let $X_{1}, ..., X_{n}$ be a random sample from a population with mean $μ$ and positive definite covariance matrix $\sum$ and $n - p$ is large:
1. $n (\overset{x}{ˉ} - μ_{0})^{⊺} S^{- 1} (\overset{x}{ˉ} - μ_{0}) \geq χ_{p}^{2} (α)$ , reject $H_{0} : μ = μ_{0}$
  - for level of significant $α$
2. $a^{⊺} \overset{ˉ}{X} \pm χ_{p}^{2} (α) \frac{a ^{⊺} S a}{n}$ will contain $a^{⊺} μ$ for every $a$ , with probability approximately $1 - α$
  - Consequently, we can make the $100% (1 - α)$ simultaneous confidence statements $\overset{x}{ˉ}_{i} \pm χ_{p}^{2} (α) \frac{S _{ii}}{n}$ contains $μ_{i}$ for $i = 1, .., p$

Multivariate Quality Control Chart, $\overset{ˉ}{X}$ chart:

Control charts make the variation visible
Allow one to distinguish common from special causes of variation.
One useful control chart is the $\overset{ˉ}{X}$ chart:
1. Plot the individual observations or sample means in time order.
2. Create and plot the centerline $\overset{ˉ}{\overset{x}{ˉ}}$ , the sample mean of all of the observations.
3. Calculate and plot the control limits given by
  - Upper control limit $(U C L) = \overset{ˉ}{\overset{x}{ˉ}}$ + 3(standard deviation)
  - Lower control limit $(L C L) = \overset{ˉ}{\overset{x}{ˉ}}$ - 3(standard deviation)
4. Plot with data points in order from 1; draw 3 sets of identical points for UCL, mean and LCL then connect them

Ellipsoid Format Chart:

only extract 2 dimensions, therefore $p = 2$
- the 95% quality ellipse consists of all x that satisfy $(x - \overset{x}{ˉ})^{T} S^{- 1} (x - \overset{x}{ˉ}) \leq χ_{2}^{2} (0.05)$
Find Cov, Cov $^{- 1}$
Find mean
Determinant = $∣∣ \sum ∣∣ = \prod_{i} λ_{i}$
Trace $= t r (\sum) = \sum λ_{i}$
Find eigenvalues from det and trace (use quadratic)
Find $a, b$
- $= λ_{i} * \frac{( n - 1 ) p}{n ( n - p )} F_{p, n - p} (α)$ for small sample
- $= λ_{i} * c hi - s q (α, 2)$ . Chi-sq $= C H I . I N V (α, 2)$
Theta(rad) $= θ = a t an 2 (σ_{12}, λ_{1} - σ_{11})$
Calculate rotation matrix $R = [cos (θ) sin (θ) - sin (θ) cos (θ)]$
Sample for points on esclipse:
- Generate n points from $0$ to $2 π$ : $i = \frac{k * 2 π}{n}$
- x coordinate of that point $= R_{11} * a * cos (i) + R_{12} * b * sin (i) + \overset{x}{ˉ}_{1}$
- y coordinate of that point $= R_{21} * a * cos (i) + R_{22} * b * sin (i) + \overset{x}{ˉ}_{2}$
- plot points and connect them
Plot data:
- For each data point: minus mean to normalize it and plot on same graph with ellipse
- then find if that point is inside $α$ confidence interval
  - $(C o v_{11}^{- 1} * (x_{i} - \overset{x}{ˉ})_{1} + C o v_{12}^{- 1} * (x_{i} - \overset{x}{ˉ})_{2}) * (x_{i} - \overset{x}{ˉ})_{1} + (C o v_{21}^{- 1} * (x_{i} - \overset{x}{ˉ})_{1} + C o v_{22}^{- 1} * (x_{i} - \overset{x}{ˉ})_{2}) * (x_{i} - \overset{x}{ˉ})_{2}$
  - If $> c hi - s q$ : outside, otherwise inside

$T^{2}$ chart:

For more than 2 dimensions
When a point is out of the control region, individual $\overset{ˉ}{X}$ charts are constructed.
When the lower control limit is less than zero for data that must be nonnegative, LCL is generally set to zero.
Points are displayed in time order rather than as a scatter plot, and this makes patterns and trends visible.
For the $j$ th points, we calculate the T-squared Statistic: $T_{j}^{2} = (x - \overset{x}{ˉ})^{⊺} S^{- 1} (x - \overset{x}{ˉ})$
Then plot the $T^{2}$ -values on a time axis, the lower limit is 0, and upper limt is $U C L = χ_{p}^{2} (α)$ , there is no centerline in $T^{2}$ -chart
When the multivariable $T^{2}$ -chart signals that the $j$ -th unit is out of order, it should be determined which variables are responsible
A region based on Bonferoni Interval is frequently chosen for this purpose. The k-th variable is out of control if $x_{jk}$ does not lie in the interval $\overset{x}{ˉ}_{k} \mp t_{n - 1} (0.005/ p) s_{kk}$ where $p$ is the total nb of measured variables

Inference when some observations are missing:

Often, some components of a vector observation are unavailable. We treat situations where data are missing at random.
To estimate the incomplete data, we use the EM algorithm.
1. Prediction step. Given some estimate $\tilde{θ}$ of the unknown parameters, predict the contribution of any missing observation to the (complete-data) sufficient statistics.
2. Estimation step. Use the predicted sufficient statistics to compute a revised estimate of the parameters.
When the observations $X_{1}, ..., X_{n}$ are a random sample from a p-variate normal population, the prediction–estimation algorithm is based on the complete data sufficient statistics
- $T_{1} = \sum_{j = 1}^{n} X_{j} = n \overset{ˉ}{X}$
- and $T_{2} = j - 1 \sum 2 X_{j} X_{j}^{⊺} = (n - 1) S + n \overset{ˉ}{X} \overset{ˉ}{X}^{⊺}$
We assume that the population mean $μ$ and variance $\sum$ are unknown and estimated with $\tilde{μ}$ and $\tilde{Σ}$
Estimation:
- $\tilde{μ} = \frac{T ~ _{1}}{n}$ and $\tilde{Σ} = \frac{1}{n} \tilde{T}_{2} - \tilde{μ} \tilde{μ}^{⊺}$
Prediction step:
- for each vector $x_{j}$ with missing values, let $x_{j}^{(1)}$ denotes the vector of missing components and $x_{j}^{(2)}$ denotes vector of available components
- Contribution estimation of $x_{j}^{(1)}$ to $T_{1} : x_{j}^{(1)} = E (X_{j}^{(1)} ∣ x_{j}^{(2)}; μ, Σ) = μ^{(1)} + Σ_{12} Σ_{22}^{- 1} (x_{j}^{(2)} - μ^{(2)})$
- Predicted contribution of $x_{j}^{(1)}$ to $T_{2}$ is
  - $x_{j}^{(1)} x_{j}^{(1) ⊺} = E (X_{j}^{(1)} X_{j}^{(1) ⊺} ∣ x_{j}^{(2)}; μ, Σ) = Σ_{11} - Σ_{12} Σ_{22}^{- 1} Σ_{21} + x_{j}^{(1)} x_{j}^{(1) ⊺}$
  - $x_{j}^{(1)} x_{j}^{(2) ⊺} = E (X_{j}^{(1)} X_{j}^{(2) ⊺} ∣ x_{j}^{(2)}; μ, Σ) = x_{j}^{(1)} x_{j}^{(2) ⊺}$

StrixTheKiet Notes

Explorer

Inferences about a Mean Vector

Definition:

Simple Mean Vector Test:

Confidence Regions of Component Means:

Large sample inferences about a population mean:

Multivariate Quality Control Chart, $\overset{ˉ}{X}$ chart:

Ellipsoid Format Chart:

$T^{2}$ chart:

Inference when some observations are missing:

Graph View

Table of Contents

Backlinks

StrixTheKiet Notes

Explorer

Inferences about a Mean Vector

Definition:

Simple Mean Vector Test:

Confidence Regions of Component Means:

Large sample inferences about a population mean:

Multivariate Quality Control Chart, Xˉ chart:

Ellipsoid Format Chart:

T2 chart:

Inference when some observations are missing:

Graph View

Table of Contents

Backlinks

Multivariate Quality Control Chart, $\overset{ˉ}{X}$ chart:

$T^{2}$ chart: