StrixTheKiet Notes

Search

❯

❯

❯

❯

Constrained Least Square

Constrained Least Square

Mar 20, 20253 min read

Linearly Constrained least square:

Minimize $∣∣ A x - b ∣ ∣^{2}$ subject to $C x = d$ , equality constraints
$\overset{x}{^}$ is a solution of CLS if $C \overset{x}{^} = d$ and $∣∣ A \overset{x}{^} - b ∣ ∣^{2} \leq ∣∣ A x - b ∣ ∣^{2}$ holds for any $n$ -vector $x$ that satisfies $C x = d$
- how many row of $C$ is dimension of $λ$
Piecewise-polynomial fitting:
- In the case of splitting the graph into 2 in $x$ -axis, the use $p (x)$ to estimate points on the left graph and $q (x)$ for the right
- Piecewise-polynomial $\hat{f}$ has the form:
  - $\hat{f} (x) = {p (x) = θ_{1} + θ_{2} x + θ_{3} x^{2} + θ_{4} x^{3} q (x) = θ_{5} + θ_{6} x + θ_{7} x^{2} + θ_{8} x^{3} x \leq a x > a$
- To connect the graph, we need $p (a) = q (a)$ and $p^{'} (a) = q^{'} (a)$ as constraints
  - $θ_{1} + θ_{2} a + θ_{3} a^{2} + θ_{4} a^{3} - θ_{5} - θ_{6} a - θ_{7} a^{2} - θ_{8} a^{3} = 0$
  - $θ_{2} + 2 θ_{3} a + 3 θ_{4} a^{2} - θ_{6} - 2 θ_{7} a - 3 θ_{8} a^{2} = 0$
- Then fitting is minizing $\sum_{i = 1}^{N} (\hat{f} (x_{i}) - y_{i})^{2}$ with constraints
- Prediction error on $(x_{i}, y_{i})$ is $a_{i}^{⊺} θ - y_{i}$
- Sum square error is $∣∣ A θ - y ∣ ∣^{2}$ where $a_{i}^{⊺}$ are the rows of $A$
Solving CLS problem:
- The constrants are $c_{i}^{⊺} x = d_{i}, i = 1, ..., p$
- Use Lagrange Multipliers $z_{1}, ... z_{p}$
  1. $L (x, z) = f (x) + z_{1} (c_{1}^{⊺} x - d_{1}) + ... + z_{p} (c_{p}^{⊺} x - d_{p})$ where $z$ is the $p$ vector of Larange
  2. Optimal conditions are: $\frac{\partial L}{\partial x _{i}} (\overset{x}{^}, z) = 0, i = 1, ..., n$ and $\frac{\partial L}{\partial z _{i}} (\overset{x}{^}, z) = 0, i = 1, ..., p$
- $\frac{\partial L}{\partial z _{i}} (\overset{x}{^}, z) = c_{i}^{⊺} \overset{x}{^} - d_{i} = 0$ which we knew
- first $n$ equations have form: $\frac{\partial L}{\partial x _{i}} (\overset{x}{^}, z) = 2 j = 1 \sum n (A^{⊺} A)_{ij} \overset{x}{^}_{j} - 2 (A^{⊺} B)_{i} + j = 1 \sum p z_{j} c_{i} = 0$
  - $2 (A^{⊺} A) \overset{x}{^} - 2 A^{⊺} b + C^{⊺} z = 0$
- Togethe with $C \overset{x}{^} = d$ to get KKT conditions: $[2 A^{⊺} A C C^{⊺} 0] [\overset{x}{^} z] = [2 A^{⊺} b d]$
  - a square set of $n + p$ linear equations in variable $\overset{x}{^},$
- Assumming KKT matrix is invertible: $[\overset{x}{^} z] = [2 A^{⊺} A C C^{⊺} 0]^{- 1} [2 A^{⊺} b d]$
  - KKT matrix is invertible if and only if $C$ has linearly independent rows and $[A C]$ has linearly independent columns
- implies $m + p \geq n, p \leq n$
- Compute $\overset{x}{^}$ in $2 m n^{2} + 2 (n + p)^{3}$ flops, order is $n^{3}$ flops
- Vertification of solution:
  - For every $x$ satisfies $C x = d$
  - $∣∣ A x - b ∣ ∣^{2} = ∣∣ (A x - A \overset{x}{^}) + (A \overset{x}{^} - b) ∣ ∣^{2}$
    - $= ∣∣ A (x - \overset{x}{^}) ∣ ∣^{2} + ∣∣ A \overset{x}{^} - b ∣ ∣^{2} + 2 (A x - A \overset{x}{^})^{⊺} (A \overset{x}{^} - b)$
  - expand last term, using $2 A^{⊺} (A \overset{x}{^} - b) = - C^{⊺} z, C x = C \overset{x}{^} = d$ :
    - $2 (A x - A \overset{x}{^})^{⊺} (A \overset{x}{^} - b) = 2 (x - \overset{x}{^})^{⊺} A^{⊺} (A \overset{x}{^} - b) = - (x - \overset{x}{^})^{⊺} C^{⊺} z = - (C (x - \overset{x}{^}))^{⊺} z = 0$
  - so $∣∣ A x - b ∣ ∣^{2} = ∣∣ A (x - \overset{x}{^}) ∣ ∣^{2} + ∣∣ A \overset{x}{^} - b ∣ ∣^{2} \geq ∣∣ A \overset{x}{^} - b ∣ ∣^{2}$ so $\overset{x}{^}$ is the solution

Least-norm Problem:

A simple case of CLS, to minimize $∣∣ x ∣ ∣^{2}$
- with $A = I, b = 0$
- subject to $C x = d$
Solving Least norm problem:
- matrix $[I C]$ always have independent columns
- Assume that $C$ has indepent rows
- Optimal condition reduce to $[2 I C C^{⊺} 0] [\overset{x}{^} z] = [0 d]$
- then $\overset{x}{^} = - (1/2) C^{⊺} z$ and $- (1/2) C C^{⊺} z = d$
- Plug $z = - 2 (C C^{⊺})^{- 1} d$ int the first equation to get $\overset{x}{^} = C^{⊺} (C C^{⊺})^{- 1} d = C^{†} d$
- so when $C$ has linearly independent rows:
  - $C^{†}$ is a right inverse of $C$
  - so for any $D, \overset{x}{^} = C^{†} d$ satisfies $C \overset{x}{^} = d$
  - ans we now know $\overset{x}{^}$ is the smallest solution of $C x = d$

Constrained least square applications:

Portfolio allocation:

Allocate investment in a vector of $n$ different assets, $w$
- $w_{j}$ is the fraction of portfolio allocated in asset $j$
- $w_{j}$ can be negative, meaning short position
- one of $w_{j}$ is the liquid, so $1^{⊺} w = 1$
- $w = e_{n} = [00 . . .1]$ means the portfolio is all cash
Leverage $L = ∣ w_{1} ∣ + ... + ∣ w_{n} ∣$
- $L = 1$ means all long position
- $L > 1$ means atleast 1 short position
Return over a period
- $\overset{r}{^}_{j}$ is the return of asset $j$ over the period, in fractional increase or decrease in value
- full portfolio return is $\frac{V ^{+} - V}{V} = \overset{r}{^}^{⊺} w$
- for $t$ -period, with return $r_{1}, ..., r_{t}$ ; then $V_{t + 1} = V_{1} (1 + r_{1}) (1 + r_{2}) ... (1 + r_{t})$
Return matrix:
- Hold portfolio with weights $w$ over $T$ periods
- define $T \times n$ (assets) return matrix, with $R_{t j}$ is return of asset $j$ in period $t$
  - 1 row is return vector of 1 period, $\tilde{r}_{t}^{⊺}$
  - 1 column is the time series of 1 asset
- if last asset is risk-free, the last column of $R$ is $μ^{r f} 1$ where $μ^{r f}$ is the risk-free per-period interest rate
Portfolio return and risk:
- porfolio return vector (1 entry is return of 1 asset), $r = Rw$
- average return is $avg (r)$ and risk is $std (r)$
- for small per-period returns, we have $V_{T + 1} = V_{1} (1 + r_{1}) ... (1 + r_{T}) \approx V_{1} + V_{1} (r_{1} + ... + r_{T}) = V_{1} + T avg (r) V_{1}$
  - so return approximates the avg per-period increase of portfolio value
Annualized return and risk:
- Mean return and risk are often expressed in annualized form
- If there are $P$ trading periods per year:
  - annualized return $= P avg (r)$
  - annualized risk $= P std (r)$
Portfolio optimization:
- minize the risk: $std (Rw)^{2} = (1/ T) ∣∣ Rw - ρ 1 ∣ ∣^{2}$
  - with constrains $1^{⊺} w = 1$ and $avg (Rw) = ρ$ , meaning mean of past return is $ρ$
- solution $w$ are Pareto Optimal
- Convert to contrained least squares:
  - minize $∣∣ Rw - ρ 1 ∣ ∣^{2}$
  - subject to $[1^{⊺} μ^{⊺}] w = [1 ρ]$
- $μ = R^{⊺} 1 / T$ is $n$ -vector of past asset returns
- solution: $w z_{1} z_{2} = 2 R^{⊺} R 1^{⊺} μ^{⊺} 100 μ 00^{- 1} 2 ρT μ 1 ρ$

Graph View

Linearly Constrained least square:
Least-norm Problem:
Constrained least square applications:

Backlinks

Least Square

Created with strixthekiet

GitHub
Email