Definition:

  • Calculating some useful quantity from a joint probability distribution
  • We have a joint distribution of a joint distribution and wish to find
  • Evidence variables are variables in that help to query
    • this case:
  • Query variables are variables we wish to learn out
    • this case
  • Hiddent variables are variables that are not in question
  • time complexity to find probability
  • Ex:
    • Posterior probability: , query with evidences
    • Most likely explaination:

Basic Inference:

  • ex:
    • , therefore proportional, but the denominator is easy to calculate so we ignore for now
  • which is very long

Factors:

  1. Joint Distribution factor
    • Joint distributions: entries sum to 1
      • varied and
    • Selected Joint: only selected entries of joint distribution for some fixed value of some variables
      • assigned , varied
      • nb of dimenionality of table is the number of varying variable
  2. Conditional Distribution
    • Single conditional: entries , sums to 1
      • fixed and
    • Family of conditionals: : sums to domain of
      • varied and
      • has multiple conditionals
  3. Specified Family:
    • fixed but for all

Inference by Enumeration

  • Procedure: Join all factors, eliminate all hidden variables, normalize
    • ex:
      • R (rain) T (traffic) L (late)
      • we have table for R, T|R and L|T
      • and we wish to query
    1. Join all factors: similar to JOIN clause
      • Build a new table from conditional table and independent
        • then
    2. Eliminate all hidden variables:
      • Marginalize the joint table,
    3. Normalize to sum of 1
  • , from right to left: joint r, joint t, eliminate r, eliminate t
  • This way, it is join up the whole joint distribution before summing out the hidden variables, huge space complexity
  • We can normalize early, which is Variable Elimination

Variable Elimination:

Procedure:
  • Similar to inference by enumeration but we marginalize early to save space
  • Join new variable from joint distribution, marginalize it and join new, marginalize,…
  • With the same example:
    1. Join
    2. Join
  • While there are still hidden variables, pick the hidden variable and join it and sum it out (eliminate)
  • If there is evidence in the query, start with factors that join with the evidence first and only select the positive when joining
    • ex: query
  • If the final table is not but , use Bayes’ theorem
  • Normalize then we have the answer
    • ex: normalize between such
Variable Elimination Ordering:
  • Ordering which variable to eliminate can reduce much of complicity
  • ex:
    • With query , we join and marginalize right away
    • first way: top down, then times each to sum out
      • at , we have all and , therefore,
    • second way: bottom up, join all pairs first, then one by one join to have
      • at steps, we only have entries, then times and normalize to
  • Complexity is determined by the largest factor
  • There is no order that always result to small factors