Definition:

  • A scalar and an -vector are related by model, which is
    • is independent variable, the feature vector
    • is the outcome variable, sth we want to predict
  • , is the th data pair
    • isthe jth component of ith data point
  • Choose model are basis functions that we choose
    • are model parameters we can choose
  • is the model prediction of
  • Define:
    • as vectors of outcomes
    • as vectors of predictions
    • as vectors of residuals
      • for
  • Define a matrix mapping with so
  • Then we choose to minize
  • Similar to ordinary least square, (if columns of are linearly independent)
  • is the minimum mean-square error
  • For the weights of , we define the first weight as other weight are relative to it
  • The function but we can add another last element in to incorporate the value of , so we have

For :

  • Then so the model is a constant

For :

  • Model has form
  • Matrix has form:
  • Then can be found with

For polynomial:

  • Model is degree , here is powered, not the element
  • is then Vandermonde Matrix

Least Square Classification

Multi-objective Least Square