Escolar Documentos
Profissional Documentos
Cultura Documentos
net/publication/3810360
CITATIONS READS
910 367
2 authors:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Gianni A. Di Caro on 19 February 2014.
use their memory to evaluate the built solution and to re- where Nik ⊆ Ni is the feasible neighborhood of node i
trace the same tour backward and increase the intensity of for ant k (that is, the set of cities ant k has not yet visited) as
the pheromone trails τij of visited connections lij (Figure 1, defined by using the ant private memory Mk and the problem
lines 13 → 16). This has the effect of making the visited con- constraints.
nections become more desirable for future ants. Then the ants The role of the parameters α and β is the following. If
die, freeing all the allocated resources. In AS all the ants de- α = 0, the closest cities are more likely to be selected: this
posit pheromone and no problem-specific daemon actions are corresponds to a classical stochastic greedy algorithm (with
performed. The triggering of pheromone evaporation hap- multiple starting points since ants are initially randomly dis-
pens after all ants have completed their tours. Of course, it tributed on the nodes). If on the contrary β = 0, only
would be easy to add a local optimization daemon action, like pheromone amplification is at work: this method will lead
a 3-opt procedure (Lin, 1965); this has been done in most of to the rapid emergence of a stagnation, that is, a situation
the ACO algorithms for the TSP that have been developed as in which all ants make the same tour which, in general, is
extensions of AS (see for example (Dorigo & Gambardella, strongly sub-optimal (Dorigo et al., 1996). An appropriate
1997; Stützle & Hoos, 1998a)). trade-off has to be set between heuristic value and trail inten-
The amount of pheromone trail τij (t) maintained on con- sity.
nection lij is intended to represent the learned desirability of After all ants have completed their tour, each ant k de-
choosing city j when in city i (which also corresponds to the posits a quantity of pheromone ∆τ k (t) = 1/Jψk (t) on each
desirability that arc lij belongs to the tour built by an ant).
connection lij that it has used, where Jψk (t) is the length of
The pheromone trail information is changed during prob-
lem solution to reflect the experience acquired by ants during tour ψ k (t) done by ant k at iteration t:
problem solving. Ants deposit an amount of pheromone pro-
τij (t) ← τij (t) + ∆τ k (t),
∀ lij ∈ ψ k (t), k = 1, . . . , m
portional to the quality of the solutions ψ they produced: the
(3)
shorter the tour generated by an ant, the greater the amount
where the number m of ants at each iteration is maintained
of pheromone it deposits on the arcs which it used to gener-
constant) and is set to m = NC . These parameters settings,
ate the tour. This choice helps to direct search towards good
as well as the settings α = 1, β = 5 and ρ = 0.5, were
solutions.
experimentally found to be good by Dorigo (Dorigo, 1992).
This way of setting the value ∆τ k (t) makes it a function of part of the whole problem solution, defined in terms of paths
the ant’s performance: the shorter the tour done, the greater between all the pairs (i, j) in the network, (ii) the costs asso-
the amount of pheromone deposited. ciated to the connections are not statically assigned: they de-
After pheromone updating has been performed by the pend on the connection’s physical properties and on the traf-
ants1 , pheromone evaporation is triggered: the following rule fic crossing the connection, that interacts recursively with the
is applied to all the arcs lij of the graph G routing decisions.
In the following subsection we present a simplified version
τij (t) ← (1 − ρ)τij (t) (4) of the AntNet algorithm. A full description of AntNet can be
found in (Di Caro & Dorigo, 1998).
where ρ ∈ (0, 1] is the pheromone trail decay coefficient (the
initial amount of pheromone τij (0) is set to a small positive
4.1 AntNet
constant value τ0 on all arcs).
In AntNet, each ant searches for a minimum cost path be-
4 ACO for routing in communications networks tween a pair of nodes of the network. Ants are launched from
each network node towards destination nodes randomly se-
The generic routing problem in communications networks lected to match the traffic patterns. Each ant has a source
can be informally stated as the problem of building and us- node s and a destination node d, and moves from s to d hop-
ing routing tables to direct data traffic so that some measure ping from one node to the next till node d is reached. When
of network performance, function of the network type and of ant k is in node i, it chooses the next node j to move to ac-
the services provided, is maximized. cording to a probabilistic decision rule which is a function of
We can use the terminology introduced in Section 2 to give the ant’s memory Mk and of the local ant-routing table Ai .
a formal definition of the routing problem. Let the sets C and Pheromone trails are still connected to arcs, but are mem-
L correspond respectively to the sets of processing nodes and orized in variables associated to arc-destination pairs. That is,
of communication links of the real network. Let G = (C, L) each directed arc (i, j) has NC − 1 trail values τijd ∈ [0, 1]
be a directed graph, where each node in the set C represents a associated, one for each possible destination node d an ant
network node with processing/queuing and forwarding capa- located in node i can have (therefore, in general, τijd 6= τjid ).
bilities, and each oriented arc in L is a directional transmis- Each arc has also associated an heuristic value ηij ∈ [0, 1]
sion system (link). Each link has associated a cost measure independent of the final destination. The heuristic values are
defined by its physical properties and crossing traffic flow. set to the following values:
Network applications generate data flows from source to des- qij
tination nodes. For each node in the network, the local rout- ηij = 1 − P (5)
l∈Ni qil
ing component uses the local routing table to choose the best
outgoing link to direct incoming data towards their destina- where qij is the length (in bits waiting to be sent) of the
tion nodes. The routing table Ri = [rijd ] of a generic node queue of the link connecting node i with its neighbor j.
i, where Ni is the set of neighbors of i, says to data pack- In AntNet, as well as in most other implementations of
ets entering node i and directed towards destination node d ACO algorithms for routing problems, the daemon compo-
which should be the next node j ∈ Ni to move to. Routing nent (line 6 of the ACO meta-heuristic of Figure 1) is not
tables are bi-dimensional because the choice of the neighbor present.
node to which a data packet entering a generic node i should The local ant-routing table Ai is obtained by a functional
be forwarded is a function of the packet destination node d. composition of the local pheromone trails τijd and heuristic
Ant-routing tables possess the same bi-dimensional structure: values ηij . While building the path to the destination, ants
pheromone trails associated to each connection are vectors of move using the same link queues as data. In this way ants
NC − 1 values. In fact, ant-routing tables, in all the ACO experience the same delays as data packets and the time Tsd
implementations for routing, are used to build the routing ta- elapsed while moving from the source node s to the destina-
bles by means of implementation-dependent transformations. tion node d can be used as a measure of the path quality. The
These vectorial pheromone trails are the natural extension of overall “goodness” of a path can be evaluated by an heuristic
the scalar trails used for the TSP. function of the trip time Tsd and of a local adaptive statisti-
Other important differences with the TSP implementation cal model maintained in each node. In fact, paths need to be
arise from the different nature of the two problems: (i) each evaluated relative to the network status because a trip time T
ant is assigned a defined pair (s, d) of start-destination nodes judged of low quality under low congestion conditions could
and, discovering a path between s and d, the ant builds only a be an excellent one under high traffic load. Once the generic
1 It should be noted that in the original ant system (Dorigo, 1992; Dorigo
ant k has completed a path, it deposits on the visited nodes an
amount of pheromone ∆τ k (t) proportional to the goodness
et al., 1991) pheromone evaporation was performed before pheromone up-
dating. The algorithm presented here and the original one are exactly of the path it built. To this purpose, after reaching its desti-
the same if the values ∆τ k (t) used in Equation 3 are set to ∆τ k (t) = nation node, the ant moves back to its source nodes along the
1/((1 − ρ) · Jψ k (t)).
same path but backward and using high priority queues, to 5 Other ACO meta-heuristic applications
allow a fast propagation of the collected information.
During this backward path from d to s the ant k increases There are now available numerous successful implementa-
the pheromone trail value τijd (t) of each connection lij previ- tions of the ACO meta-heuristic applied to a number of dif-
ously used while it was moving from s to d. The pheromone ferent combinatorial optimization problems.
trail intensity is increased by applying the following rule: The most widely studied problems using ACO algorithms
are the traveling salesman problem and the quadratic assign-
τijd (t) ← τijd (t) + ∆τ k (t) (6) ment problem (QAP).
Results obtained by the application of ACO algorithms to
The reason the ant updates the pheromone trails during its the TSP are very encouraging (see (Stützle & Dorigo, 1999b)
backward trip is that, before it can compute the amount of for an overview of applications of ACO algorithms to the
pheromone ∆τ k (t) to deposit on the visited arcs it needs to TSP): they are often better than those obtained using other
complete a path from source to destination to evaluate it. general purpose heuristics like evolutionary computation or
After the pheromone trail on the visited arc has been up- simulated annealing. Also, when adding to ACO algorithms
dated, the pheromone value of all the outgoing connections rather unsophisticated local search procedures based on 3-
of the same node i, relative to the destination d, evaporates:2 opt (Lin, 1965), the quality of the results obtained (Dorigo
& Gambardella, 1997) is close to that obtainable by more so-
τijd (t)
τijd (t) ← , ∀j ∈ Ni (7) phisticated methods. More research will be necessary to as-
(1 + ∆τ k (t)) sess whether ACO algorithms can reach the performance of
where Ni is the set of neighbors of node i. state-of-the-art algorithms like Iterated Lin-Kernighan (John-
son & McGeoch, 1997).
As we said, AntNet’s ant-routing table Ai = [aijd (t)] As in the TSP case, the first application of an ACO al-
of node i is obtained, as usual, by the composition of the gorithm to the QAP has been that of Ant System (Maniezzo
pheromone trail values with the local heuristic values. This is et al., 1994). In the last two years several ACO algorithms
done as follows: for the QAP have been presented by Maniezzo and Colorni
(Maniezzo & Colorni, 1999, in press), Maniezzo (Maniezzo,
wτijd (t) + (1 − w)ηij 1998), and Stützle (Stützle & Hoos, 1998a). Currently, ACO
aijd (t) = (8)
w + (1 − w)(|Ni | − 1) algorithms are among the best heuristics for attacking real-
where j ∈ Ni , d is the destination node, w ∈ [0, 1] is a life QAP instances3 . We refer to (Stützle & Dorigo, 1999a)
weighting factor and the denominator is a normalization term. for an overview of these approaches.
The ants decision rule is then defined as follows. Let, at The sequential ordering problem is closely related to the
time t, ant k be located on node i and be directed towards asymmetric TSP, but additional precedence constraints be-
node d. If Ni 6⊆ Mk , that is, if there is at least one city tween the nodes have to be satisfied. Gambardella and Dorigo
in the ant’s current location neighborhood that ant k has not have tackled this problem by an extension of ACS enhanced
visited yet, then the ant chooses the next node j ∈ Ni with by a local search algorithm (Gambardella & Dorigo, 1997).
probability They obtained excellent results and were able to improve the
½ best known solutions for many benchmark instances.
aijd (t) / Mk
if j ∈ A first application of AS to the job-shop scheduling prob-
pkijd (t) = (9)
0 if j ∈ Mk lem has been presented in (Colorni et al., 1994). Despite
showing the viability of the approach, the computational
otherwise, the ant chooses a city j ∈ Ni with uniform prob-
results are not competitive with state-of-the-art algorithms.
ability: pkijd (t) = 1/(|Ni |). (Note that in AntNet, differently
More recently, MMAS has been applied to the flow shop
from what happens in Ant System, the neighborhood and the
scheduling problem (FSP) in (Stützle, 1998). The computa-
feasible neighborhood are the same, that is, Nik ≡ Ni .)
tional results have shown that MMAS outperforms earlier
In other words, ants try to avoid cycles (Equation 9) but,
proposed Simulated Annealing algorithms, while it performs
in the case all the nodes in i’s neighborhood have already
comparably to Tabu Search. Another very recent applica-
been visited by the ant, the ant has no choice and it has to
tion is that to the single machine total tardiness problem, dis-
re-visit a node, generating in this way a cycle. In this case
cussed in a paper published in these same proceedings (Bauer
the generated cycle is deleted from the ant memory, that for-
et al., 1999). Again, the results obtained are very promising.
gets completely about it. Considering the stochasticity of the
Costa and Herz (Costa & Hertz, 1997) have proposed an
decision rule and the evolution in the traffic conditions, it is
extension of AS to assignment type problems and present
very unlikely that the ant repeats the same cycle over and over
an application of their approach to the graph coloring prob-
again.
3 Another very good algorithm inspired by the ant colony metaphor is
2 In this case the decay factor is chosen so that it operates a normalization HAS-QAP (Gambardella et al., 1997). It is not listed with the others above
of the pheromone values which continue therefore to be usable as probabili- because it doesn’t follow the ACO meta-heuristic.
ties.
lem obtaining results comparable to those obtained by other 7 Acknowledgments
heuristic approaches.
An application of ASrank to the capacitated vehicle Marco Dorigo is a Research Associate of the Belgian FNRS.
routing problem is presented by Bullnheimer, Hartl, and Gianni Di Caro was supported by a Madame Curie Fellow-
Strauss (Bullnheimer et al., 1997a). They obtained good ship (CEC-TMR Contract N. ERBFMBICT 961153).
computational results for standard benchmark instances,
slightly worse than the best performing Tabu Search algo- References
rithms. A recent application by Gambardella, Taillard, and
Agazzi to vehicle routing problems with time windows im- A. Bauer, B. Bullnheimer, R. F. Hartl, & C. Strauss. 1999.
proves the best known solutions for some benchmark in- An Ant Colony Optimization Approach for the Single
stances (Gambardella et al., 1999). Machine Total Tardiness Problem. In: Proc. of 1999
Michel and Middendorf (Michel & Middendorf, 1998; Congress on Evolutionary Computation. IEEE Press.
Michel & Middendorf, 1999) have proposed an application
D. Bertsekas. 1995. Dynamic Programming and Optimal
of AS to the shortest common supersequence. They introduce
Control. Athena Scientific.
the use of a lookahead function during the solution construc-
tion by the ants. They also present a parallel version of their B. Bullnheimer, R. F. Hartl, & C. Strauss. 1997a. An Im-
algorithm based on the island model typical of many parallel proved Ant System Algorithm for the Vehicle Routing
genetic algorithms implementations. Problem. Tech. rept. POM-10/97. Institute of Manage-
MMAS has recently been applied to the generalized ment Science, University of Vienna, Austria. Accepted
assignment problem by Ramalhinho Lourençou and Serra for publication in the Annals of Operations Research.
(Lourenço & Serra, 1998), obtaining very promising com-
putational results. In particular, their algorithm is shown to B. Bullnheimer, R. F. Hartl, & C. Strauss. 1997b. A New
find optimal and near optimal solutions faster than a GRASP Rank-Based Version of the Ant System: A Computa-
algorithm which was used for comparison. tional Study. Tech. rept. POM-03/97. Institute of Man-
Another very recent application of the ACO meta-heuristic agement Science, University of Vienna, Austria. Ac-
is the one to the multiple knapsack problem proposed in cepted for publication in the Central European Journal
one of the articles published in these same proceedings for Operations Research and Economics.
(Leguizamón & Michalewicz, 1999). Also in this case the
A. Colorni, M. Dorigo, V. Maniezzo, & M. Trubian. 1994.
obtained results are very promising.
Ant System for Job-Shop Scheduling. Belgian Journal
Perhaps the currently most active research area in ACO
of Operations Research, Statistics and Computer Sci-
algorithms is the study of their application to telecommu-
ence (JORBEL), 34, 39–53.
nication network problems, in particular to network routing
(Schoonderwoerd et al., 1996; Di Caro & Dorigo, 1998). D. Costa, & A. Hertz. 1997. Ants Can Colour Graphs. Jour-
The application of ACO algorithms to network optimization nal of the Operational Research Society, 48, 295–305.
problems is appealing because these problems have character-
istics like distributed information, non-stationary stochastic G. Di Caro, & M. Dorigo. 1998. AntNet: Distributed Stig-
dynamics, and asynchronous evolution of the network status mergetic Control for Communications Networks. Jour-
which well match some of the properties of ACO algorithms nal of Artificial Intelligence Research (JAIR), 9, 317–
like the use of local information to generate solutions, indi- 365.
rect communication via the pheromone trails and stochastic
G. Di Caro, & M. Dorigo. 1998. Two Ant Colony Algorithms
state transitions. A detailed overview of routing applications
for Best-Effort Routing in Datagram Networks. Pages
of ACO algorithms can be found in (Dorigo et al., 1998).
541–546 of: Proceedings of the Tenth IASTED Interna-
tional Conference on Parallel and Distributed Comput-
6 Conclusions ing and Systems (PDCS’98). IASTED/ACTA Press.
In this paper we have given a formal description of the Ant M. Dorigo. 1992. Optimization, Learning and Natural Algo-
Colony Optimization meta-heuristic, as well as of the class of rithms (in Italian). Ph.D. thesis, Dipartimento di Elet-
problems to which it can be applied. We have then described tronica, Politecnico di Milano, Italy.
two paradigmatic applications of ACO algorithms to the trav-
eling salesman problem and to routing in packet-switched M. Dorigo, & L. M. Gambardella. 1997. Ant Colony Sys-
networks, and concluded with a short overview on the cur- tem: A Cooperative Learning Approach to the Traveling
rently available applications. Ongoing work follows three Salesman Problem. IEEE Transactions on Evolutionary
main directions: the study of the formal properties of a sim- Computation, 1(1), 53–66.
plified version of ant system, the development of AntNet for
Quality of Service applications, and the development of fur- M. Dorigo, G. Di Caro, & L. M. Gambardella. 1998.
ther applications to combinatorial optimization problems. Ant Algorithms for Discrete Optimization. Tech. rept.
View publication stats
IRIDIA/98-10. Université Libre de Bruxelles, Belgium. V. Maniezzo, A. Colorni, & M. Dorigo. 1994. The Ant System
Accepted for publication in Artificial Life. Applied to the Quadratic Assignment Problem. Tech.
rept. IRIDIA/94-28. Université Libre de Bruxelles, Bel-
M. Dorigo, V. Maniezzo, & A. Colorni. 1991. Positive feed- gium.
back as a search strategy. Tech. rept. 91-016. Diparti-
mento di Elettronica, Politecnico di Milano, Italy. R. Michel, & M. Middendorf. 1998. An Island Model based
Ant System with Lookahead for the Shortest Superse-
M. Dorigo, V. Maniezzo, & A. Colorni. 1996. The Ant Sys- quence Problem. Pages 692–701 of: A. E. Eiben, T.
tem: Optimization by a colony of cooperating agents. Back, M. Schoenauer, & H.-P. Schwefel (eds), Proceed-
IEEE Transactions on Systems, Man, and Cybernetics– ings of PPSN-V, Fifth International Conference on Par-
Part B, 26(1), 29–41. allel Problem Solving from Nature. Springer-Verlag.
L. M. Gambardella, & M. Dorigo. 1997. HAS-SOP: An Hy- R. Michel, & M. Middendorf. 1999. An ACO Algorithm for
brid Ant System for the Sequential Ordering Problem. the Shortest Common Supersequence Problem. In: D.
Tech. rept. IDSIA-11-97. IDSIA, Lugano, Switzerland. Corne, M. Dorigo, & F. Glover (eds), New Ideas in Op-
L. M. Gambardella, E. D. Taillard, & M. Dorigo. 1997. Ant timization. McGraw-Hill.
Colonies for the QAP. Tech. rept. IDSIA-4-97. IDSIA, G. Reinelt. 1994. The Traveling Salesman Problem: Com-
Lugano, Switzerland. Accepted for publication in the putational Solutions for TSP Applications. Berlin:
Journal of the Operational Research Society (JORS). Springer-Verlag.
L. M. Gambardella, E. D. Taillard, & G. Agazzi. 1999. R. Schoonderwoerd, O. Holland, J. Bruten, & L. Rothkrantz.
MACS-VRPTW: A Multiple Ant Colony System for Ve- 1996. Ant-based Load Balancing in Telecommunica-
hicle Routing Problems with Time Windows. In: D. tions Networks. Adaptive Behavior, 5(2), 169–207.
Corne, M. Dorigo, & F. Glover (eds), New Ideas in Op-
timization. McGraw-Hill. T. Stützle. 1998. Parallelization Strategies for Ant Colony
Optimization. Pages 722–731 of: A. E. Eiben, T. Back,
M. R. Garey, & D. S. Johnson. 1979. Computers and In- M. Schoenauer, & H.-P. Schwefel (eds), Proceedings
tractability. W.H. Freeman and Company. of PPSN-V, Fifth International Conference on Parallel
D. S. Johnson, & L. A. McGeoch. 1997. The Traveling Sales- Problem Solving from Nature. Springer-Verlag.
man Problem: A Case Study. Pages 215–310 of: E.H. T. Stützle, & M. Dorigo. 1999a. ACO Algorithms for the
Aarts, & J. K. Lenstra (eds), In Local Search in Combi- Quadratic Assignment Problem. In: D. Corne, M.
natorial Optimization. Chichester: John Wiley & Sons. Dorigo, & F. Glover (eds), New Ideas in Optimization.
E. L. Lawler, J. K. Lenstra, A. H. G. Rinnooy-Kan, & D. B. McGraw-Hill.
Shmoys (eds). 1985. The Travelling Salesman Problem. T. Stützle, & M. Dorigo. 1999b. ACO Algorithms for the
Wiley. Traveling Salesman Problem. In: K. Miettinen, M.M.
G. Leguizamón, & Z. Michalewicz. 1999. A New Version Mäkelä, P. Neittaanmäki, & J. Periaux (eds), Evolu-
of Ant System for Subset Problems. In: Proc. of 1999 tionary Algorithms in Engineering and Computer Sci-
Congress on Evolutionary Computation. IEEE Press. ence: Recent Advances in Genetic Algorithms, Evo-
lution Strategies, Evolutionary Programming, Genetic
S. Lin. 1965. Computer Solutions of the Traveling Salesman Programming and Industrial Applications. John Wiley
Problem. Bell Systems Journal, 44, 2245–2269. & Sons.
H. Ramalhinho Lourenço, & D. Serra. 1998 (May). Adaptive T. Stützle, & H. Hoos. 1998a. MAX –MIN Ant System
Approach Heuristics for the Generalized Assignment and Local Search for Combinatorial Optimization Prob-
Problem. Economic Working Papers Series 304. Uni- lems. Pages 313–329 of: S. Voß, S. Martello, I.H. Os-
versitat Pompeu Fabra, Dept. of Economics and Man- man, & C. Roucairol (eds), Meta-Heuristics: Advances
agement, Barcelona, Spain. and Trends in Local Search Paradigms for Optimization.
Kluwer Academics, Boston.
V. Maniezzo. 1998. Exact and Approximate Nondetermin-
istic Tree-Search Procedures for the Quadratic Assign- T. Stützle, & Holger Hoos. 1998b. Improvements on the
ment Problem. Tech. rept. CSR 98-1. C. L. in Scienze Ant System: Introducing the MAX –MIN Ant Sys-
dell’Informazione, Universitá di Bologna, Italy. tem. Pages 245–249 of: R.F. Albrecht G.D. Smith,
N.C. Steele (ed), Artificial Neural Networks and Genetic
V. Maniezzo, & A. Colorni. 1999, in press. The Ant System Algorithms. Springer Verlag, Wien New York.
Applied to the Quadratic Assignment Problem. IEEE
Trans. Knowledge and Data Engineering.