A Star Search

A* Search
A* Search
UCS Greedy
A*
Combining UCS and Greedy
o Uniform-cost orders by path cost, or backward cost g(n)
o Greedy orders by goal proximity, or forward cost h(n)
8 g=0
S h=6
h=1 g=1
e
1 h=5 a
1 3 2 g=2 g=9
S a d G
h=6 b d g=4 e h=1
h=6 h=5 h=2
1 h=2 h=0
1 g=3 g=6
c b g = 10
h=7 c G h=0 d
h=2
h=7 h=6
g = 12
G h=0
o A* Search orders by the sum: f(n) = g(n) + h(n)
Example: Teg
When should A* terminate?
o Should we stop when we enqueue a goal?

h=2
gh+
2 A 2
S 033
S h=3 h=0 G S->A 224

S->B 213
2 B 3
S->B->G 5 0 5
h=1
S->A->G 4 0 4
o No: only stop when we dequeue a goal
Is A* Optimal?
h=6
1 A 3
gh+
S 077
S h=7
G h=0 S->A 1 6 7
S->G 505
5
o What went wrong?

o Actual bad goal cost < estimated good goal cost
o We need estimates to be less than actual costs!
Admissible Heuristics
Idea: Admissibility
Inadmissible (pessimistic) heuristics Admissible (optimistic) heuristics

break optimality by trapping slow down bad plans but
good plans on the fringe never outweigh true costs
Admissible Heuristics
o A heuristic h is admissible (optimistic) if:
where is the true cost to a nearest goal
o Examples:
0.0
15 11.5
o Coming up with admissible heuristics is most of what’s

involved in using A* in practice.
Optimality of A* Tree Search
Optimality of A* Tree Search
Assume:
o A is an optimal goal node …
o B is a suboptimal goal node
o h is admissible
Claim:
o A will exit the fringe before B
Optimality of A* Tree Search: Blocking
Proof:
…
o Imagine B is on the fringe
o Some ancestor n of A is on the
fringe, too (maybe A!)
o Claim: n will be expanded
before B
1. f(n) is less or equal to f(A)
Definition of f-cost
Admissibility of h
h = 0 at a goal
Proof:
…
o Claim: n will be expanded
before B
2. f(A) is less than f(B)
B is suboptimal
h = 0 at a goal
Proof:
…
o Claim: n will be expanded before B
2. f(A) is less than f(B)
3. n expands before B
o All ancestors of A expand before B
o A expands before B
o A* search is optimal
Properties of A*
Properties of A*
Uniform-Cost A*
b b
… …
UCS vs A* Contours
o Uniform-cost expands equally in

all “directions”
Start Goal
o A* expands mainly toward the

goal, but does hedge its bets to
ensure optimality Start Goal
[Demo: contours UCS / greedy / A* empty (L3D1)]

[Demo: contours A* pacman small maze (L3D5)]
Video of Demo Contours (Empty) -- UCS
Video of Demo Contours (Empty) -- Greedy
Video of Demo Contours (Empty) – A*
A* Applications
o Video games
o Pathing / routing problems
o Resource planning problems
o Robot motion planning
o Language analysis
o Machine translation
o Speech recognition
o…
Creating Heuristics
Creating Admissible Heuristics
o Most of the work in solving hard search problems optimally is in
coming up with admissible heuristics
o Often, admissible heuristics are solutions to relaxed problems,

where new actions are available
366
15
o Inadmissible heuristics are often useful too

Example: 8 Puzzle
Start State Actions Goal State
o What are the states?

o How many states? Admissibleh
o What are the actions?
euristics?
o How many successors from the start state?
o What should the costs be?
8 Puzzle I
o Heuristic: Number of tiles misplaced
o Why is it admissible?
o h(start) =8
o This is a relaxed-problem heuristic
Start State Goal State
Average nodes expanded

when the optimal path has…
…4 steps …8 steps …12 steps
UCS 112 6,300 3.6 x 106
TILES 13 39 227
Statistics from Andrew Moore

8 Puzzle II
o What if we had an easier 8-puzzle

where any tile could slide any direction
at any time, ignoring other tiles?
o Total Manhattan distance

Start State Goal State
o Why is it admissible?
Average nodes expanded
o h(start) = 3 + 1 + 2 + … = 18 when the optimal path has…
…4 steps …8 steps …12 steps
TILES 13 39 227
MANHATTAN 12 25 73
8 Puzzle III
o How about using the actual cost as a heuristic?
o Would it be admissible?
o Would we save on nodes expanded?
o What’s wrong with it?
o With A*: a trade-off between quality of estimate and work per node
o As heuristics get closer to the true cost, you will expand fewer nodes but
usually do more work per node to compute the heuristic itself
Graph Search
Tree Search: Extra Work!
o Failure to detect repeated states can cause exponentially more
work.
State Graph Search Tree
Graph Search
o In BFS, for example, we shouldn’t bother expanding the circled nodes
(why?)
S
d e p
b c e h r q
a a h r p q f
p q f q c G
q c G a
a
Graph Search
o Idea: never expand a state twice
o How to implement:
o Tree search + set of expanded states (“closed set”)
o Expand the search tree node-by-node, but…
o Before expanding a node, check to make sure its state has
never been expanded before
o If not new, skip it, if new add to closed set
o Important: store the closed set as a set, not a list
o Can graph search wreck completeness? Why/why not?
o How about optimality?

A* Graph Search Gone Wrong?
State space graph Search tree
S (0+2)
A
1
1
S h=4 A (1+4) B (1+1)
C
h=1
h=2 1
2 C (2+1) C (3+1)
3
B
G (5+0) G (6+0)
h=1
G
h=0
Closed Set:S B C A
Consistency of Heuristics
o Main idea: estimated heuristic costs ≤ actual costs
A o Admissibility: heuristic cost ≤ actual cost to goal
1 h(A) ≤ actual cost from A to G

h=4 C h=1
o Consistency: heuristic “arc” cost ≤ actual cost for each
h=2
arc
3 h(A) – h(C) ≤ cost(A to C)
o Consequences of consistency:
o The f value along a path never decreases
G
h(A) ≤ cost(A to C) + h(C)
o A* graph search is optimal

Optimality of A* Search
o With a admissible heuristic, Tree A* is optimal.
o With a consistent heuristic, Graph A* is optimal.
o With h=0, the same proofs shows that UCS is optimal.
A*: Summary
A*: Summary
o A* uses both backward costs and (estimates of) forward
costs
o A* is optimal with admissible / consistent heuristics
o Heuristic design is key: often use relaxed problems

Tree Search Pseudo-Code
Graph Search Pseudo-Code
The One Queue
o All these search algorithms are
the same except for fringe
strategies
o Conceptually, all fringes are priority
queues (i.e. collections of nodes
with attached priorities)
o Practically, for DFS and BFS, you
can avoid the log(n) overhead from
an actual priority queue, by using
stacks and queues
o Can even code one implementation
that takes a variable queuing object
Search and Models
o Search operates over

models of the world
o The agent doesn’t
actually try all the plans
out in the real world!
o Planning is all “in
simulation”
o Your search is only as
good as your models…
Search Gone Wrong?

A Star Search

Enviado por

Dados do documento

Direitos autorais

Formatos disponíveis

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Direitos autorais:

Formatos disponíveis

A Star Search

Enviado por

Direitos autorais:

Formatos disponíveis

A* Search

o Should we stop when we enqueue a goal?

S h=3 h=0 G S->A 224

o What went wrong?

Inadmissible (pessimistic) heuristics Admissible (optimistic) heuristics

where is the true cost to a nearest goal

o Coming up with admissible heuristics is most of what’s

o Uniform-cost expands equally in

o A* expands mainly toward the

[Demo: contours UCS / greedy / A* empty (L3D1)]

o Often, admissible heuristics are solutions to relaxed problems,

o Inadmissible heuristics are often useful too

Start State Actions Goal State

o What are the states?

Average nodes expanded

Statistics from Andrew Moore

o What if we had an easier 8-puzzle

o Total Manhattan distance

o Important: store the closed set as a set, not a list

o Can graph search wreck completeness? Why/why not?

o How about optimality?

A o Admissibility: heuristic cost ≤ actual cost to goal

1 h(A) ≤ actual cost from A to G

o A* graph search is optimal

o A* is optimal with admissible / consistent heuristics

o Heuristic design is key: often use relaxed problems

o Search operates over

Você também pode gostar