Constraint Propagation: Inference in CSPs
A CSP algorithm has choices. It can generate successors by choosing a new variable assignment, or it can do a specific type of inference called constraint propagation: using the constraints to reduce the number of legal values for a variable, which in turn can reduce the legal values for another variable, and so on.
Constraint propagation may be intertwined with search, or it may be done as a preprocessing step, before search starts. Sometimes this preprocessing can solve the whole problem, so no search is required at all.
Node consistency
A single variable (corresponding to a node in the CSP graph) is node-consistent if all the values in the variable’s domain satisfy the variable’s unary constraints.
It is easy to eliminate all the unary constraints in a CSP by reducing the domain of variables with unary constraints at the start of the solving process. It is also possible to transform all n-ary constraints into binary ones.
Arc consistency
A variable in a CSP is arc-consistent if every value in its domain satisfies the variable’s binary constraints. More formally, $X_i$ is arc-consistent with respect to another variable $X_j$ if for every value in the current domain $D_i$ there is some value in the domain $D_j$ that satisfies the binary constraint on the arc $(X_i, X_j)$. A graph is arc-consistent if every variable is arc-consistent with every other variable.
The most popular algorithm for enforcing arc consistency is called AC-3.
To make every variable arc-consistent, the AC-3 algorithm maintains a queue of arcs to consider. Initially, the queue contains all the arcs in the CSP. (Each binary constraint becomes two arcs, one in each direction.) AC-3 then pops off an arbitrary arc $(X_i, X_j)$ from the queue and makes $X_i$ arc-consistent with respect to $X_j$.
If this leaves $D_i$ unchanged, the algorithm just moves on to the next arc. But if this revises $D_i$(makes the domain smaller), then we add to the queue all arcs $(X_k, X_i)$ where $X_k$ is a neighbor of $X_i$. We need to do that because the change in $D_i$ might enable further reductions in $D_j$, even if we have previously considered $X_k$. If is revised down to nothing, then we know the whole CSP has no consistent solution, and AC-3 can immediately return failure. Otherwise, we keep checking, trying to remove values from the domains of variables until no more arcs are in the queue.
At that point, we are left with a CSP that is equivalent to the original CSP—they both have the same solutions—but the arc-consistent CSP will be faster to search because its variables have smaller domains.
The complexity of AC-3 can be analyzed as follows. Assume a CSP with $n$ variables, each with domain size at most $d$, and with $d$ binary constraints (arcs). Each arc $(X_k, X_i)$ can be inserted in the queue only $d$ times because $X_i$ has at most $d$ values to delete. Checking consistency of an arc can be done in $O(d^2)$ time, so we get total $O(cd^3)$ worst-case time.
Path consistency
Arc consistency tightens down the domains (unary constraints) using the arcs (binary constraints). Path consistency tightens the binary constraints by using implicit constraints that are inferred by looking at triples of variables.
K-consistency
Stronger forms of propagation can be defined with the notion of k-consistency.