Local Search

Hill climbing
Gradient descent
Simulated annealing

For some problems the state description contains all of the information relevant for a solution. Path to the solution is unimportant.
Examples:
- map coloring
- 8-queens
- cryptarithmetic
Start with a state configuration that violates some of the constraints for being a solution, and make gradual modifications to eliminate the violations.
One way to visualize iterative improvement algorithms is to imagine every possible state laid out on a landscape with the height of each state corresponding to its goodness. Optimal solutions will appear as the highest points. Iterative improvement works by moving around on the landscape seeking out the peaks by looking only at the local vicinity.

Hill-climbing

Algorithm:
1. determine successors of current state
2. choose successor of maximum goodness (break ties randomly)
3. if goodness of best successor is less than current state's goodness, stop
4. otherwise make best successor the current state and go to step 1
No search tree is maintained, only the current state.
Similar to Greedy search, but only states directly reachable from the current state are considered.
Problems:

Local maxima

Once the top of a hill is reached the algorithm will halt since every possible step leads down.

Plateaux

If the landscape is flat, meaning many states have the same goodness, algorithm degenerates to a random walk.

Ridges

If the landscape contains ridges, local improvements may follow a zigzag path up the ridge, slowing down the search.
Shape of state space landscape strongly influences the success of the search process. A very spiky surface which is flat in between the spikes will be very difficult to solve.
Can be combined with nondeterministic search to recover from local maxima.
Random-restart hill-climbing is a variant in which reaching a local maximum causes the current state to be saved and the search restarted from a random point. After several restarts, return the best state found. With enough restarts, this method will find the optimal solution.
Gradient descent is an inverted version of hill-climbing in which better states are represented by lower cost values. Local minima cause problems instead of local maxima.

Simulated annealing

Instead of restarting from a random point, we can allow the search to take some downhill steps to try to escape local maxima.
Probability of downward steps is controlled by temperature parameter.
High temperature implies high chance of trying locally "bad" moves, allowing nondeterministic exploration.
Low temperature makes search more deterministic (like hill-climbing).
Temperature begins high and gradually decreases according to a predetermined annealing schedule.
Initially we are willing to try out lots of possible paths, but over time we gradually settle in on the most promising path.
If temperature is lowered slowly enough, an optimal solution will be found.
In practice, this schedule is often too slow and we have to accept suboptimal solutions.

Algorithm:

set current to start state for time = 1 to infinity { set Temperature to annealing_schedule[time] if Temperature = 0 { return current } randomly pick a next state from successors of current set ΔE to eval(next) - eval(current) if ΔE > 0 { set current to next } else { set current to next with probability e^{ΔE/Temperature} } }

Probability of moving downhill for negative ΔE values at different temperature ranges: