06 Chapter1

Chapter 1
Queueing Theory : An
Introduction
5
6
1.1 Introduction
Queue or waiting line is a social phenomena common in any modern society, when the
available facilities are not sufficient to meet the immediate needs of the customers.
That is, a queue is a particular kind of collection in which the entities in the collection
are kept in order and the principal operation on the collection are the addition of the
entities to the rear terminal position and the removal of the entities from the front
terminal position. This is equivalent to the requirement that once an element is added,
all elements that were added before have to be removed before the new element can
be invoked. Queues provide services in computer science, transport and operations
research, where various entities such as data,object,persons or events are stored and
held to be processed later. In these contexts, the queue performs the function of
a buffer. The units in the queue, who demand service at the service center and
wait if the service is not prompt is called customers. The mechanism arranged to
provide service to the customers at the service center is called server. That is, queue
represents a certain number of customers waiting for service. If a queue is finite,
it can only hold a limited number of customers. Most queueing models assume an
infinite queue, even though this is almost certainly not strictly true in the majority
of applications of queueing theory. This assumption is made because it makes the
modeling process simpler. Also, if the maximum queue size is significantly larger than
the likely number of customers at any one time, then to all intents and purposes it is
infinite in size. The amount of time which is a customer waits in the queue for service
is called queueing time.
There are three main concepts in queueing theory, which are queue (waiting line),
customer and server. The queueing theory is the probabilistic study of waiting lines.
7
Although it does not solve all types of waiting line problems,it provides useful and
vital informations by forcasting or predicting various characteristics and parameters
of the particular waiting line under study. Since the predictions about the waiting
times, the no. of waiting at any time, the time for which the server/servers remain
busy etc. rely heavily on the basic concept of stochastic processes, it can very well
be taken as an application of stochastic processes. Also, queueing theory is generally
considered as a branch of operational research became the resources needed to provide
service. Then queueing theory is an application of stochastic processes in operations
research.
Erlang, Agner Krarup (1909,1917), an engineer at Copenhagen telephone exchange
started to work on applying the theory of probabilities to problems of telephone traf-
fic. It is interesting to note here that only after the world war II, it was realized
that the queueing theory has widespread applications in various fields, such as indus-
trial problems (production, scheduling and maintenance), congestions in road traffic,
scheduling of air traffic at the airport, waiting in a hospital outpatient department,
inventory control, machines waiting to be serviced by repairmen, physical processes
(operation of a set of dams), epidemic process in biology, intelligent transportation
systems, call centers, networks, telecommunications terminals, advanced telecommu-
nication system etc. Since queueing theory is applied in different fields, also the
terms jobs and tasks are often used instead customers and service center is often
named processor or machine.

8
1.2 Queueing System
Queueing theory deals with the analysis of a queueing system.Gross and Harris (1974)
described the queueing system as customers arriving for service,waiting for service if
it is not immediate and if having waited for service, leaving the system after be-
ing served. There are several everyday examples that can be described as queueing
system, such as bank teller service, computer systems, manufacturing systems, main-
tenance systems, communication systems and so on. An accurate representation of a
queueing system require a detailed characterization of the underlying processes. The
following are the six basis characteristics of a queueing system.
1. Arrival pattern of customers : The arrival process consists of describing how
customers arrive to the system. The system may have either a limited or an unlimited
capacity for holding units. The source from which the units come may be finite or
infinite. A unit may arrive either singly or in a group (bulk arrival). The interval
between two consecutive arrival is called the inter arrival time or interval. The arrival
pattern of a system is measured in terms of the average number of arrivals per unit
time (mean arrival rate) or by the average time between successive arrivals (mean
inter-arrival time). If the arrival pattern is deterministic, then it is fully determined
by either the mean arrival rate or the mean inter-arrival time. On the other hand, if it
is probabilistic, then their mean values provide only measures of central tendency for
the arrival pattern, and further characterization is required in the form of the prob-
ability distribution associated with the random process. If the arrival pattern does
not change with time, then it is called a stationary arrival pattern, otherwise, it is
called non-stationary (transient). The commonly used input distributions are Poisson
distribution, Degenerate distribution (deterministic), Erlangian type with parameter

9
k(k=1,2,3...), Hyper exponential distribution and Phase type distribution.
Another factor to be considered regarding the arrival pattern is the reaction of the
customers in the queue. If the queue, is too long, a customer may decide not to enter
it upon arrival and in this situation he is said to have balked. On the other hand, a
customer may enter the queue, but after some time lose patience and decide to leave.
In this case, he is said to have reneged. In the event that there are more than one
queue, customers may switch from one to another, that is jockey for position. These
three situations are all examples of queues with impatient customers. If the arrival
pattern does not change with time, then it is called a stationary arrival pattern,
otherwise, it is called non-stationary.
2. Service pattern of servers : Much of the discussions concerning the arrival
pattern is appropriate in discussing service patterns. Service pattern can also be
described by the number of services per unit time (service rate) or by the time required
to service customer (service time). Service may also be in single or in batch, further
it can be stationary or non-stationary. One important difference between arrival
and service is that service rate or service time are conditioned on the fact that the
system is not empty, that is, if the system is empty, the server is idle. The servers
that become idle may leave the system for a random period of time called vacation.
These vacations may be utilized to perform additional work assigned to the servers.
The service rate may depend in the number of customers waiting for service. In this
situation, it is called state-dependent service. The problems of customer impatience
can be looked upon as ones of state-dependent arrivals.
3. Queue discipline : Queue discipline refers to the rule by which customers are
selected for service when a queue is formed. One of the most important and often
10
practiced queue discipline is first in first out (FIFO). Some others in common usages
are last in first out (LIFO). Another queue discipline is service in random order (RSS).
In some cases, customers are given priorities up on entering the system, the ones
with higher priorities to be selected for service ahead of those with lower priorities,
regardless of their time of arrival to the system. There are two general situations
in priority discipline. In the first, which is called preemptive, a customer with the
highest priority is allowed to enter service immediately after suspending even the
service in progress to a customer with lower priority. In the non-preemptive case, the
highest priority customer goes to the head of the queue but cannot get into service
until the customer presently in service is completed.
4. System capacity : In some queueing process, there is a finite upper bound to the
queue size. In this situation, a customer is forced to balk if he arrives at a time when
queue size is at its limit. This is a simple case of balking, since it is known exactly
under what circumstance arriving customers must balk.
5. Service channels : The number of service channels refers to the number of
parallel service stations which can provide identical service facilities to the customers
simultaneously.
6. Stages of service : A service station may have several stages. That is, there may
exist a series of service stages through which each customer must progress prior to
leaving the system. They are called tandem queues.
1.3 Queueing processes
The analysis of queueing system with fixed (deterministic) inter-arrival and service
times does not present much difficulty. We shall be concerned with models or systems
11
where one or both (inter-arrival and service times) are stochastic. Their analysis will
involve a stochastic description of the system and the related performance measures
as discussed below.
1. Distribution of the number of customers N(t) in the system or queue at time t.
2. Distribution of the waiting time in the queue or system,the time that an arrival
has to wait in the queue. If Wn denote the waiting time of the nth arrival, then of
interest is the distribution of Wn .
3. Distribution of the virtual waiting time W(t), the length of time an arrival has to
wait has he arrive at time t.
4. Distribution of the busy period.
It is clear that N (t), t ≥ 0, W (t), t ≥ 0 and Wn , n ≥ 0 are stochastic processes, the
first two being in continuous time and the third one in discrete time. It will be seen
that some of the queueing processes that can we would come across are Markovian
and some are Semi-Markovian. From some of the non-Markovian processes that arise,
Markov chains can extracted at suitable regeneration points and Semi-Markov pro-
cesses can be constructed there from. The theory of Markov chains and Semi-Markov
process thus plays an important role in the study of queueing processes. The problem
studied in queueing theory may be grouped as
1. Stochastic behavior of various r.v’s and evaluation of various performance mea-
sures.
2. Method of solution - exact, transform, algorithmic, asymptotic, numerical, approx-
imations etc.
3. Nature of solution - time dependent, limiting form etc.
4. Control and design of queues - comparison of behavior and performances under

12
various situations, as well as queue discipline, service rules strategies etc and
5. Optimization of specific objective functions involving performance measures, asso-
ciated cost functions etc.
1.4 Notations
The widely used notation to represent a queueing process was introduced by Kendall
(1953). According to him, a queueing process is described by the notation A/B/X/Y /Z,
where A is inter-arrival time distribution, B is the service time distribution, X is the
number of service channels, Y is the system capacity and Z is the queue discipline.
The commonly used inter-arrival time and service time distributions are
M - Poisson arrival and exponential service time distribution
D - Degenerate distribution (deterministic)
Ek - Erlangian type with parameter k (k=1,2,3...)
GI - General Independent
H - Hyper exponential distribution
P H - Phase type distribution.
The following are the definitions of certain terms connected with a queueing process
1. Queue length : The number of customers waiting in the queue.
2. System size : The number of customers waiting in the queue and the customers
in service.
3. Waiting time in the queue : The duration of time spent by a customer in the
queue before getting service.
4. Waiting time in the system : The time spent by a customer in the queue and
for service in the system before leaving.

13
5. Busy period : The time interval over which a server is continuously busy.
6. Idle period : The time interval during which the server is completely free.
1.5 Different Methods for the Analysis of a Queue-

ing System
The analysis of queueing models give light to the behavior and performance measures
of the queueing system such as expected queue length, average system size, average
waiting time, expected busy period etc. The transient and steady state probabil-
ity distributions have been used for finding the performance measures of a queueing
system. The solution of a queueing system depending on time is called transient
distribution. The steady state distribution is a balanced equilibrium solution, which
does not depend on time. An equilibrium state is the stable behavior of the system
attained after being in operation for sufficiently long time. That is, steady state distri-
bution is the limiting behavior of the queueing system. Transient results are, however,
more difficult to obtain, and not many such results are found in the literature. Morse
(1955) studied the M/M/1 queue and obtained the transient state probabilities of the
number in the system at time t. The problem was studied and a complete solution
was obtained in the 1950’s by several researchers using different methods. The steady
state behavior of the queueing system were primarily investigated by Erlang (1917)
and he obtain the steady state probabilities for M/M/1/∞. The main limitation
observed in the steady state distribution is that there may exist many stochastic pro-
cesses with the same stationary distribution. Whitt (1983) in the detailed study of
the queueing system GI/M/1, remarked the problem of using stationary distributions
alone for describing the stochastic behavior of a queue. Hence to see the stochastic
14
nature of a queue, some transient behavior of the queueing system should also be
considered along with the stationary distribution. The method of Laplace transform
can be applied to obtain the transient distribution and the steady state probability
distribution (if exists) can be obtained by using the final value theorem on Laplace
transforms. The following are some of the methods for measuring the performances
of a queueing system.
1. Embedded Markov Chain Method : This is the widely used method in queue-
ing theory introduced by Kendall (1953). Under this method, a discrete time Markov
chain is extracted from a non-Markov process by using regeneration points due to
Palm (1943). The points of regenerations may be the service completion time, the
departure time of a customer, the arrival time of a customer etc. The transition prob-
ability matrix of the extracted markov chain can be used for finding the performance
measures of a queueing system.
2. The Supplementary Variable Method : The use of supplementary variable, as
a general technique, was introduced by Cox (1955). In fact, Kendall (1953) proposed
this concept by using the term ’augmentation’ instead of supplementary variable.
Cox (1955) analysed the M/G/1 queueing system using this technique. The method
is based on the treatment of the couplet (N(t),X(t)). Henderson (1972) considers the
couplet (N(t),Y(t)), where Y(t) is the remaining or residual service time of the cus-
tomer in service, if any. These methods involve inclusion of a supplementary variable
X(t) or Y(t).
3. Lindley’s Integral Equation Method : This method was introduced by Lind-
ley in 1952. In this method the customer arrival times are taken as the regeneration
points and considered the waiting time of the nth customer as the object of study.
15
Other notable contributions and discussions on the integral equations can be seen in
Smith (1953), Prabhu (1965a), Gross and Harris (1974), Karlin and Tailor (1975),
Kleinrock (1975) etc.
4. The Method of Sample Path : The realization points of a stochastic pro-
cesses {X(t), t ∈ T } or the sequence of observations X(t) of a stochastic processes
{X(t), t ∈ T }, for each time point t ∈ T is generally known as sample path or trajec-
tory. The sample path may be defined for all t, by joining the path (t, xt ) by straight
line or the trajectory being so defined that it remains a constant except for jumps of
unit magnitude corresponding to those Xt which are equal to one. The graphical rep-
resentation of sample path shows movements of the outcome of the process or state
over time. By associating probability with the path, it is possible to estimate the
probability of events related to sample path. Thus the process related to the sample
path can be analyzed more effectively by using this technique. In queueing theory,
the sample path is a powerful method for the evaluation of the performance measures
and its relation ships. See sample path method by Stidham and Thaha (1995) for
more details.
5. The Matrix Analytic Method : Starting with the introduction of phase type
probability distributions, Marcel Neuts (1975) has developed an analysis technique
that extends and modifies the earlier transform method to multivariables and makes
it amenable for solution with the help of computers. See Neuts (1978,1981), Sengupta
(1989), and Ramaswami (1990,2001). The use of phase type distributions in the rep-
resentation of system elements and the matrix analytic method in their analysis has
significantly expanded the scope of queueing systems for which useable results can be
derived.
16
1.6 History and Development of Queueing Theory
Queueing theory as a part of probability theory has been developed largely in the con-
text of telephone traffic engineering. In 1909 A.K.Erlang a Danish teletraffic engineer
published a paper called ”the theory of probabilities and telephone conversations”. In
the early 1920’s he developed the famous Erlang model to evaluate loss probabilities of
multi-channel point-to-point conversations. The Erlang model was extended to allow
for calculation in finite source input situations by Engset several years later leading to
the Engset model. In 1951 G.K.Kendall published his work about embedded Markov
Chains, which is the base for the calculation of queueing systems under fairly general
input conditions. He also defined a naming convention for queueing systems which
is still used. Nearly at the same time D.V.Lindely developed an equation allowing
for results of a queueing system under fairly general input and service conditions. In
1957 J.R.Jackson started the investigation of network queues thus leading to so called
queueing network models. With the appearance of computer and computer networks,
queueing systems and queueing networks have been identified as a powerful analysis
and design tool for various application.
The steady state solutions for queueing system were primarily investigated by Er-
lang (1917), where he obtained the steady state probabilities for a Markovian delay
model and a Markovian loss model. The investigation in the study of time depen-
dent solution of a queueing system began in 1930’s, however the credit for the first
transient solution goes to Bailey (1954b). He obtained time dependent solution for
an M/M/1 queueing system by using generating function to the partial differential
equations. Morse (1955) studies M/M/1 queue and obtained the transient state prob-
abilities of the queue size at different time point. Champernowne (1956) obtained the
17
transient solution of M/M/1 queue by using combinatorial method. The transient
solution of M/M/1 queue using the method of difference equations have considered by
Conolly (1958) and Feller (1966). Pegden and Rosenshine (1982) and Hubbard et al
(1986) have studied M/M/1 queueing system with two dimensional state associated
with the number of arrivals and departures and obtained transient state solutions.
Cohen (1982) studied transient solution of the M/M/1 queue and obtained the re-
sult involving three integrals. Parthasarathi (1987) applied an alternative approach
and obtained transient solution of M/M/1 queue in terms of an integral and a fi-
nite series. Abate and Whitt (1987,1988) considered M/M/1 queue and studied its
transient behavior via Laplace transforms. The method of sample path is applied by
Bacceli and Messey (1989) to obtain the performance measure of an M/M/1 queueing
system. Mohanthy and Panni (1990) have used a discrete analogue of the M/M/1
queue and discussed its transient solutions. Conolly and Languris (1993) investigated
M/M/1 queue and obtained a formula for the transient state probabilities. Sharma
and Maneshwar (1993) considered the busy period of M/M/1 queueing systems With
infinite capacity and obtained performance measures. Sharma and Sharma (1993)
obtained an expression for the waiting time of a unit in an M/M/1 queue, where the
priority of a unit decrease with time. Sharma and Tarabia (2000) obtained transient
state probabilities of the queueing model M/M/1/N. Lalith A Kulkarni and San-gi
Li (1996) considered the transient behavior of queueing systems with correlated traf-
fic. Barbot and Sericola (2002) considered an infinite capacity buffer receiving fluid
at a rate depending on the states of an M/M/1 queue. Draief and Mairesse (2005)
analyzed the service times of customers in a stable M/M/1 queue in equilibrium de-
pending on their position in a busy period. Antunes et al. (2006) studied the impact
18
of busy period of a M/M/1 queue of a small perturbation in the service rate. Kherani
(2006) considered an M/M/1 queue and obtained the distribution of sojourn times,
waiting time, busy period etc. Indra and Ruchi (2009) considered transient solution
of two-dimensional state M/G/1 queueing models with working vacations.
Multichannel (multiserver) queues constitute an important class of queueing pro-
cesses and have broad practical meaning. Examples of such systems may be found in
banks, telephone exchanges, hospital admission systems, seaports etc. Queueing sys-
tems with multiple channels have been studied by many researchers through various
methods. Disney (1962) analyzed two channel models with ordered entry having no
space for queue with same service rates and obtained steady state probabilities of the
waiting time distribution. Yao (1987) discussed the nature of the multi channel queue
with ordered entry. Mastsui and Fukuta (1997) obtained system size probabilities for
multi channel queue with ordered entry and heterogeneous server. Krishnamoorthi.B
(1963) considered on Poisson queue with two heterogeneous servers. G.S.Mokaddis,
C.H.Matta and M.M.El Genaidy (1998) considered on Poisson queue with three het-
erogeneous servers. The transient behavior of the multi server M/M/c queue has been
analyzed by Saaty (1960) and obtained its transient probability distribution. Jackson
and Henderson (1966) considered M/M/c queue and obtained transient probability
distribution by using Laplace transform and generating function. Chaudhary and
Templeton (1973) studied the multi server model M/M/c and derived the busy pe-
riod distribution. Koenigsberg (1980) considered M/M/N/N system (a multi server
with finite input source N) as an application problem to electronic and obtained sys-
tem probability distribution. Borthakur and Chaudhary (1999) discussed the steady
state behavior of an M/M/c queueing system with a general start up time under an
19
N-policy. Margolious (2005) derived an integral equation for the transient probabil-
ities and expected number in the queue for the time dependent multi server Poisson
queue.
There are several remarkable contributions by various researchers in the field
of non-Markovian queueing systems. A queueing system in which the inter-arrival
time or service time or both does not posses the Markov property is known as non-
Markovian queueung system. Hence the Markov property cannot be applied directly
for the analysis of non-Markovian queueing models. The first paper on estimating
parameters in a non-Markovian system is by Goyal and Harris (1972), who used the
transition probabilities of the imbedded Markov chain to set up the likelihood func-
tion. Since then significant progress has occurred in adapting statistical procedures
to various systems. For instance, Basawa and Prabhu (1981,1988) considered the
problem of estimation of parameters in the queue GI/G/1; Rao et al.(1984) used a
sequential probability ratio technique for the control of M/Ek /1 and Ek /M/1; and
Armero (1994) used Bayesian techniques for inference in Markovian queues, to iden-
tify only a few. Medhi (2002) studied more general M/G/1 queue with a second
optional channel and obtained transient solution, busy period distribution and the
steady state Pollaczek-Khinchin formula. Kella et al.(2005) studied some time de-
pendent properties of symmetric M/G/1 queue.
1.6.1 Bulk Queues
A queueing system in which the arrivals or service or both occur in groups is known as
a bulk queueing system. In ordinary queueing problems, it is assumed that customers
arrive singly at a service facility. However, this assumption is violated in many real-
world queueing situations. Letters arriving at a post office, ships arriving at a port in
20
convoy, people going to a theater, restaurant, and so on, are some of the examples of
queueing situations in which customers do not arrive singly but in bulk or in groups.
Also, the size of an arriving group may be a random variable or a fixed number.
Mathematically and also from practical point of view, the cases when the size of an
arriving group is a random variable, are more common, and also more difficult to
handle.
The study of bulk arrival queue may be said to have begun with Erlang’s solution of
the M/Ek /1 queue (Brockmeyer et al.1948). The Major contributions on bulk arrival
queues are made by many researchers. Sharada (1973) obtained the solution for a
queueing problem with batch arrivals and correlated departures. Indra and Sharada
(2004) obtained explicitly the probabilities of system with latest arrival run having
maximum effective length one. Madan et al.(2004) studied a single server queue with
batch arrivals and two types of heterogeneous service with different general service
time distributions. Schleyer and Furmans (2007) presented an analytical method to
calculate the waiting time distribution for the G/G/1 queueing system with batch
arrivals. Ahmed (2007) considered a multi-channel bi-level heterogeneous servers bulk
arrivals queueing system with Erlangian service time. Indra and Vijay (2010) analyzed
a two dimensional bulk arrival queueing model with exhaustive and non-exhaustive
service policy.
We now turn to queueing situations in which arrivals occur singly, but service is
in bulk. Bulk service queues have potential applications in loading and unloading
of cargoes at a sea port, in traffic signal systems, in computer networks where jobs
are processed in batches, manufacturing/production systems, in restaurants, cinema
halls and so on. Bailey (1954) introduced the concept of bulk service and the same
21
was later studied by a number of researchers.
Sharada (1981) obtained results for a limited space correlated queueing problem
with departures in batches of variable size. Prem Chand (1988) obtained the explicit
time dependent probabilities of given number of arrivals and departures by a given
time of a FCFS single server queueing model in which arrivals are occurring singly
and are served in batches of variable sizes. Chaudhry and Templeton (1983) gave a
comprehensive treatment of single and multiserver systems with group arrivals and
individual service or individual arrivals and batch service. In 1964 Bhat analyzed a
single server bulk queues by the technique of imbedded Markov chain. After that,
bulk arrival and bulk departure queues were studied by Gupta and Goyal (1966).
Delbrouck (1970), Gaur (1973), Boathakur and Medhi (1974), Prabhu (1987) etc.
Suzuki (2007) studied approximate analysis for bulk arrival queueing system with
bulk service and the steady state probabilities of the system by using successive
approximation are evaluated.
1.6.2 Queueing Systems with Vacation
Queueing models with server vacation are useful for the systems in which a server
wants to utilize the idle time for different purposes. Application of server vacation
models can be found in manufacturing systems, designing of local area networks and
data communication system etc. Introducing vacation in the queueing models make
the model more realistic and flexible in studying real world queueing situation. Be-
cause of the wide application of vacation queueing models, considerable attention has
paid to analyze queueing system with vacation policies. Many researchers have ana-
lyzed vacation queueing problems for both Markovian and non-Markovian queueing
models. The Markovian models may be more appropriate in practice and may have
22
certain advantages. Kella.O (1989) considered the threshold policy in the M/G/1
queue with server vacations. Chaudhary (1995) considered M/G/1 queueing system
with vacation and setup time. Boxma and Yechaiali (1997) introduced multiple types
of feedback gated vacations for M/G/1 queue. Lee et al.(2001) gave an analysis of
multiple class vacation queues with individual threshold.
The inclusion of vacation into bulk queueing models makes the model more feasible
to the practical situations. Some researchers have also paid their attention towards
this direction. Using matrix geometric method, Nadarajan and Subramaniyan (1984)
analyzed M/M (a,b) /1 queueing system with server’s vacation. Li and Zhu (1996) pro-
vided an explicit formula for the Laplace transform of the additional delay for M/G/1
queues with delayed vacations and exhaustive service discipline. Reddy and Anitha
(1998) determined the stationary distribution of the number of customers in queue
and waiting time distribution of an arriving customer for M/M (a,b) /1 queueing system
with multiple vacations and change over time.Reddy et al.(1998) presented analysis of
a bulk queue having the assumptions of N-policy, multiple vacations and setup times.
Chaudhary (2001) obtained the expected delay in services due to vacation period for
M/M/1 queueing system under N-policy and exponential setup time. Madhu Jain
and Poonam Singh (2005) considered the state dependent bulk service queue with
delayed vacation. P.Vijaya Laxmi, V Goswami and O.M.Yesuf (2010) considered a
finite buffer single vacation queue with accessible and non-accessible batch service.
1.6.3 Network Queueing System
A network of queues is a collection of service centers, which represent system re-
sources, and customers, which represent users or transactions. The term network of
queues describe the situation where the input for one queue is the output from one or
23
more other queues. Queueing network models are widely used in diverse areas, such
as production and assembly lines, maintenance and repair operations, airport termi-
nals, communication networks, computer-sharing and multiprogramming systems and
health-care centers. A network queueing system constitute a central tool in modeling
and performance analysis of telecommunication and computer systems. A queueing
network is said to be open if customers enter from outside the network, circulate
among the service centers (or queues) for service, and depart from the network after
the service. That is, an open queueing network can receive customers from outside
the network. In the case of closed network a fixed number of customers circulated
indefinitely among the queues and no arrivals from outside of it. A network queue
is said to be mixed if some customers enter from outside the network and eventually
leave and if some customers always remain in the network.
The first article on queueing networks is by J.Jackson (1957). Mathematical
formulations for the analysis of queueing networks are due to Whittle (1967,1968)
and Kingman (1969), who treated them in the terminology of population processes.
Complex queueing network problems have been investigated extensively since the be-
ginning of 1970’s. Several survey papers and books summarize the major contribution
made in this area. These includes Basket (1973), Kelly (1979), Whitt (1983a,b) Dis-
ney and Kiessler (1987), Harrison and Nguyen (1990,1993) Ananthram et al.(1993)
and Dai (1998).
1.6.4 Discrete Time Queues
In discrete time queueing systems both the inter-arrival and service time distributions
are discrete. During the last two decades, discrete time queueing models have been ap-
plied to model digital communication systems and packet switches. Recently interests
24
in the discrete time queues have increased due to their numerous applications in the
analysis of computer and communication systems such as asynchronous transfer mode
(ATM), multiplexers in the broad band integrated services digital net work (B-ISDN),
circuit- switched time-division multiple access (TDMA) systems, slotted carrier-sense
multiple access (CSMA), protocols and traffic concentrators in which the time axis is
divided in to slots. One of the reasons for this is that in discrete time queues t, the
slotted nature of telecommunication systems better than the continuous time counter-
parts, and hence they give more accurate performance measures of these systems. In
discrete time queues, the time axis is divided into fixed length time intervals referred
to as slots. Arrivals and services of customers can start or end at slot boundaries only,
so that the service time is an integer multiple of slots. These assumptions are suitable
for modeling ATM (Asynchronous Transfer Mode) networks. Discrete time queue-
ing process are better suited than continuous time models to evaluate performance
measures such as packet loss and packet delay in computer and telecommunication
networks, because of the clock driven operation of those systems. In real life, usually
we observe limited waiting space. So the study of finite buffer queueing models is
important. Bruneel.H. (1993) considered the performance of discrete time queueing
systems. Bruneel.H. and Kim.B. considered the discrete time models for communi-
cation systems including ATM. Guptha.U.C.and Goswami.V.(2002) considered the
performance analysis of finite buffer discrete time queue with bulk service. Lee.Y
(2001) considered the discrete time GeoX/G/1 queue with preemptive resume prior-
ity. Lee.Y and Lee.K.S (2003) considered the discrete time GeoX/G/1 queue with
preemptive repeat different priority. Lee.Y, Kim.H and Huh.J.D (2003) considered
GeoX/G/1 queue with non preemptive priority. Goswami.V and Samanta.S.K (2009)
25
considered a discrete time single and batch service queue with accessibility to the
batches. C.Baburaj (2010) considered the performance measures of a discrete time
(a, c, d) policy bulk service queue.
1.6.5 Priority Queues
In many cases, it is desirable to give preferential treatment to certain classes of cus-
tomers for which the queue is ordered and the higher priority customers are served
first. Under this rule, customers are grouped in priority classes on the basis of some
attributes such as service time or urgency or according to some identifiable charac-
teristics, and FCFS rule is used with in each class to provide service. Treatment of
VIP’s in preference to other patients in a hospital is an example of priority service.
Priority queueing systems have been considered by many researchers in different con-
text. Jaiswal (1968) has given a detailed discussion on priority queues. Scharge and
Miller (1966) considered an M/G/1 queue with priority is assigned on the basis of
shortest processing time. Miller (1981) considered M/M/1 priority queues and ob-
tained steady state probabilities. Deng and Tan (2001) considered a single server
two-queue priority system with change over times and switching threshold. Harison
(2002) considered response time in M/M/1 queue with ageing priority and obtained
response time probability distribution via Laplace transform.
1.6.6 Retrial Queues
Queueing systems in which arriving customers who find all servers and waiting posi-
tions, if any, occupied, may retry for service after a period of time, are called retrial
queues or queues with repeated attempts. The most obvious example is provided by a
person who desires to make a phone call. If the line is busy, then he cannot queue up
26
but tries again sometime later. Thus retrial queues are characterized by the following
feature: a customer arriving when all servers accessible for him are busy, leaves the
service area but after some random time repeats his demand. Retrial queues are a
type of network with re-servicing after blocking. Thus, this network contains two
nodes : the main node, where blocking is possible and a delay node for repeated
attempts. As for other networks with blocking, the investigation of such systems
presents great analytical difficulties. Neverthless, the main feature of the theory of
retrial queueing systems as an independent part of queueing theory are quite clearly
drawn. In particular, the nature of results obtained, methods of analysis and areas
of applications allow us to divide retrial queues into three large groups in a natural
way as single channel systems, multi channel fully available systems and structurally
complex systems. The standard queueing models do not take in to account the phe-
nomenon of retrials and therefore cannot be applied in solving a number of practically
important problems. Retrial queues have been introduced to solve these deficiency.
One of the earliest papers in retrial queues was On the Influence of Repeated Calls
in the Theory of Probabilities of Blocking by Kosten. In 1957, J.W.Kohan published
basic problems of telephone traffic theory and the influence of repeated calls in which
he considered the more general M/M/c retrial queue with impatient customers. In
1968, Keilson, Cozzolino and Young published the first result on M/G/1 retrial queue
using the method of supplementary variable. In 1990, Neuts and Rao suggested an
approximation with the help of the model where the retrial rate stays constant when
the number of customers in orbit exceeds some level. In 2004, Janos Roszic, Janos
Sztrik and Che-Soong Kim considered retrial queues in the performance modeling of
cellular mobile networks using mosel.

27
1.6.7 Design and Control of Queues
Queueing systems with control limit policy has many concrete applications and also
has the capability to narrate certain real life situations. There is increasing interest
in the subject of control of queueing systems. This is due to in large part to the wide
applicability of results in this area to the control of computers and communication
systems. There are many real life queueing situations in which jobs are served with
a control limit policy. For example, in some manufacturing systems it is possible to
process jobs manually or by a machine. Up to some control limit it may be profitable
to process the jobs manually one by one, and when the number of jobs exceeds that
control limit, processing in a batch by a machine may be cheaper. Here we assume
that some set up cost is needed to initiate a batch service and when a batch service is
started, no setup cost is required for the service of the successive batches and only the
service cost is required. So when a batch service is started the server may continue
to serve in batches, even when the queue size is less than the specified level but not
less than a secondary limit.
In a queueing system it is not possible to control the arrivals of customers to the
system, however, a control can be designed by regulating service mechanism. The
procedure of controlling a queueing system by control limit policy include the choice
of state dependent service rate, specification of limits to begin and stop services, suit-
able choice of vacation scheme to the servers and assigning cost structure to holding
customers in the queue and service etc. In the queueing system under N-policy the
server commences service only when the queue size reaches at least N(N ≥ 1). The
server serves the customer either one by one or in batches until the system becomes
empty or below a specified level (quorum). In each time, when the system becomes
28
empty or below the quorum size, the server waits until the queue size become at
least N to commence the service. If the server continue the service until the system
become empty is called N-policy with exhaustive service. The first contribution to
the analysis of a queueing system with N-policy without vacation was due to yadin
and Naor (1963). Bhom (1992) analyzed an M/G/1 queue with N-policy and de-
rived transient distribution of the queue length. T-policy is another control policy,
according to which the service facility is turned off for a fixed period of time T, after
the service completion epoch of all customers present in the system. Heyman (1968)
considered an M/G/1 queueing system under T-policy. An M/M/1 queueing system
with control limit policies and exponential startup time is considered by Baker (1973).
In a queueing system with D-policy, after the service completion epoch of all the units
in the system, the server re-opens as and when the total accumulated work load ex-
ceed the critical level D. The queueing system under D-policy Was first studied by
Balachandran (1973). Sivazalin (1979) and Artilejo (2001,2002) have studied M/G/1
queueing system with D-policy. Lee and Beak (2005) considered queue length of the
queueing system with BMAP arrivals under D-policy.
An M/M/1 queue under (M,N) policy is considered by Bhom and Mohanthy
(1993), where the server begins service if the queue size is at least N and continue to
serve even when the queue size is less than M after a service completion epoch. Motir
and Ramy Khalid (2010) considered the random walk process in a bi-level (M,N)-
policy queue with multiple vacations. A single and batch service M/M/1 queue with
a control on the batch size was considered by C.Baburaj (1999). A controllable
bulk service queueing system under the policy (K,N,M) was considered by C.Baburaj
(2000). The single server bulk service rule introduced by Bailey (1954) is known as
29
the usual bulk service rule, in which the server serves the customers in batches of size
not more than b, the capacity of the server. After a service completion epoch, if he
finds not more than b waiting, then he takes all of them in a batch for service and
the server is intermittently available. Neuts (1967) introduced a bulk service system,
where the server starts service only when at least a units in the system and serves
a maximum of b units at a time. If after a service completion epoch, the queue size
is less than a, the server waits till there are a units. This bulk service rule is called
general bulk service rule (GBSR). Borthakur (1971) considered the busy period of a
bulk queueing system with a general bulk service rule. Medhi and Borthakur (1972)
considered a two server Markovian queue with a general bulk service rule. Medhi
(1975) introduced the waiting time distribution in a Poisson queue with general bulk
service rule. Baburaj and Surendranath (2005) considered an M/M/1 bulk service
queue under the policy (a, c, d), where the server begins service only when the queue
size is at least a specified level c, and serves a maximum of d units in a batch. The
server continue to serve even when the queue size is less than the specified level c,
but not less than a secondary limit a (a ≤ c ≤ d), after a service completion epoch.
In many practical situations the arriving customers are considered for service with
current batch in service,with some limitation, for example, cinema hall, elevator etc.
That is, in the general service (a, b) rule, if a batch being served does not employ its
full capacity of service, late arrivals may join the ongoing service as long as the number
in that service batch is less than a pre-defined threshold d (a ≤ d < b). The service
time of the batch is not changed by inclusion of such arriving customers in course
of ongoing service. Such a batch is said to be an accessible batch (AB). However,
if the number in the service batch exceeds d, the batch becomes non-accessible for
30
late arriving customers and such a batch is called non-accessible batch (NAB). This
has been considered by Kleinrock (1975). It may be noted that the accessible batch
service has more economic utilizations in providing better service to the queue. For
example in many shuttle transportation systems, we observe units being transported
according to accessibility rule with some limitations. Sivasami.R (1990) considered a
bulk service queue with accessible and non-accessible batches. Baburaj.C and T.M
Surendranath (2006) considered an (a, c, d) policy bulk service queue with accessible
and non-accessible batches. A discrete time single and batch service queues with
accessibility to the batches was considered by Goswami and Samantha (2009). A
finite buffer single vacation queue with accessible and non-accessible batch service
was considered by Vijayalaxmi, Goswami and Yesuf (2010).
1.7 An Overview of the Study
In this study we consider queueing models under different control limit policies. The
single server bulk service rule introduced by Bailey (1954) is known as the usual bulk
service rule. Neuts (1967)introduced the general bulk service rule (GBSR). Borthakur
(1971), Medhi (1975,1984,1994), Chaudhary and Templeton (1983) and many other
researchers studied similar models. Bhat (1964), Teghem (1986),Cohen (1969) etc.
have considered bulk queues with varying batch sizes. Baburaj and Surendranath
(2005) considered an M/M/1 bulk service queue under the policy (a, c, d), where the
server begins service only when the queue size is at least a specified level c, and serves
a maximum of d units in a batch. The server continue to serve even when the queue
size is less than the specified level c, but not less than a secondary limit a(a ≤ c ≤ d),
after a service completion epoch. Here we study the (a, c, d) policy queue under
31
Bernoulli schedule, under the customer’s choice, with single and batch service in
discrete case, with state dependent service rates and with multiple vacations. Also
the performance measures of a controllable bulk service queueing system with two
servers are considered.
In chapter 2, we considered an (a, c, d) policy M/M/1 bulk service queue with
Bernoulli schedule. Here the server begins service only if the number of units in the
queue is at least c and serves a maximum of d units in a batch. If after a service
completion epoch, the queue size n is less than c but not less than a secondary limit
a (a ≤ c ≤ d), then with probability p, the server serves them altogether in a batch
and with probability 1 − p, the server becomes idle. The service time distribution
is assumed to be exponential with parameter µ and the arrival process is assumed
to be Poisson with parameter λ.The transient and steady state behavior is discussed
and obtained explicit expressions for the steady state probabilities, expected queue
length, expected busy period and expected waiting time in the queue.The optimality
problem is also discussed and obtained expressions for the optimal control limits a
and c.
An (a, c, d) policy bulk service queue with accessible batches under the customer’s
choice is considered in chapter 3, In this model the customers arrive according to
a poisson process with parameter λ. Here the server allows late arrivals to join the
ongoing service till the batch size is less than b (accessible batch) and the arriving unit
has a choice either to enter the accessible batch or to wait in the queue till the service
of the next batch begins. Here we assume that the arriving unit join the accessible
batch with probability p or with probability 1 − p he waits in the queue till the service
of the next batch begins. In this model the server continue to serve even when the
32
queue size n is less than c but not less than a secondary limit a (a ≤ c ≤ b ≤ d),
after a service completion epoch. The service time distribution is assumed to be
exponential with parameter µ. In this chapter Section 2 provides the description of
the model. Section 3 presents the analysis of the model. The transient and steady
state behavior of the model is studied in sections 4. Explicit expressions for the
steady state distribution, expected queue length and expected busy period are given
in sections 5,6 and 7. In section 8 the method of determining the optimal control
limits a and c is discussed.
Chapter 4 deals with a state dependent M/M/1 single and batch service queue
under the policy (a, c, d). The arrival process is assumed to be Poisson with parameter
λ. Here the customers are served by a single server either one at a time or in batches
with different service rates depending on the number of units in the queue. In this
model if the number of units in the queue is less than a specified limit c, the server
serves the units manually according to FCFS rule and the service time distribution
is assumed to be exponential with parameter µ1 (Type I service). If n ≥ c, the server
serves a maximum of d units in a batch and the service time distribution is assumed
to be exponential with parameter µ2 (Type II service). If after a Type II service
completion epoch, the number of units in the queue is less than c but not less than a
secondary limit a, the server serves them altogether in a batch and the service time
distribution is assumed to be exponential with parameter µ3 , (a ≤ c ≤ d) (Type
III service). When n less than a, the server begins manual service. The transient
and steady state behavior of the model is discussed and explicit expressions for the
steady state probabilities, expected queue length and busy period are obtained. The
optimality problem is also discussed.

33
Chapter 5 explains the steady state probabilities, expected queue length, expected
busy period and optimality problem of a state dependent M/M/1 single and batch
service queue under the policy (a, c, d) with multiple vacations. Here the customers
are served by a single server either one at a time or in batches with different service
rates depending on the number of units in the queue and the arrival process is assumed
to be Poisson with parameter λ. In this model if the number of units in the queue is
less than a specified limit c,the server serves the units manually according to FCFS
rule and the service time distribution is assumed to be exponential with parameter
µ1 (Type I service). If n ≥ c, the server serves a maximum of d units in a batch and
the service time distribution is assumed to be exponential with parameter µ2 (Type
II service). If after a Type II service completion epoch, the number of units in the
queue is less than c but not less than a secondary limit a, the server serves them
altogether in a batch and the service time distribution is assumed to be exponential
with parameter µ3 , (a ≤ c ≤ d) (Type III service).When n less than a, the server
will go for a vacation which is assumed to be exponential with parameter β. After
a vacation, when he returns, if the queue length is less than c, he leaves for another
vacation and so on until he finds at least c customers in the queue.
A discrete time single and batch service queue under the policy (a, c, d) is discussed
in chapter 6. Here the customers are served by a single server. The inter-arrival
times are assumed to be independent and geometrically distributed. In this model
if the number of units in the queue is less than a specified limit c, the server serves
a single customer at a time and if the number of units in the queue exceeds the
specified limit c the server adopts batch service and serves a maximum of d units
in a batch. When a batch service is started, the server may continue to serve in
34
batches even when the queue size is less than c, but not less than a secondary limit a
(a ≤ c ≤ d), after a service completion epoch. When the queue size is less than a, the
server begins manual service. The service times are also assumed to be independent
and geometrically distributed. Various characteristics of the queueing system are
presented in this Chapter.
Chapter 7 explains the transient and steady state behavior and some performance
measures of a controllable bulk service queueing system with accessible and non ac-
cessible batches. A controllable bulk service queueing system with two servers, where
the server I allows accessibility to the batches of ongoing service is considered is con-
sidered in this chapter. The arrival process is assumed to be Poisson and the service
rule is as follows: When the system size is less than or equal to the control limit b1 , the
server I is busy and he serves them altogether in a batch, incorporating accessibility
to the batches while the service is on progress and if the system size is more than b1 ,
then both the servers (server I and server II) are busy, where server I serves b1 units
in a batch and from the remaining server II serves a maximum of (b2 − b1 ), where
b2 > b1 units in a batch according to FCFS rule with out accessibility to the batches.
The service time distributions are assumed to be exponential.
Finally a concluding remarks is given in the last section.

06 Chapter1

Enviado por

Dados do documento

Título original

Direitos autorais

Formatos disponíveis

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Direitos autorais:

Formatos disponíveis

06 Chapter1

Enviado por

Direitos autorais:

Formatos disponíveis

Chapter 1

be invoked. Queues provide services in computer science, transport and operations

represents a certain number of customers waiting for service. If a queue is finite,

of applications of queueing theory. This assumption is made because it makes the

is called queueing time.

vital informations by forcasting or predicting various characteristics and parameters

be taken as an application of stochastic processes. Also, queueing theory is generally

considered as a branch of operational research became the resources needed to provide

service. Then queueing theory is an application of stochastic processes in operations

Erlang, Agner Krarup (1909,1917), an engineer at Copenhagen telephone exchange

started to work on applying the theory of probabilities to problems of telephone traf-

trial problems (production, scheduling and maintenance), congestions in road traffic,

scheduling of air traffic at the airport, waiting in a hospital outpatient department,

inventory control, machines waiting to be serviced by repairmen, physical processes

(operation of a set of dams), epidemic process in biology, intelligent transportation

systems, call centers, networks, telecommunications terminals, advanced telecommu-

named processor or machine.

1.2 Queueing System

tenance systems, communication systems and so on. An accurate representation of a

queueing system require a detailed characterization of the underlying processes. The

following are the six basis characteristics of a queueing system.

1. Arrival pattern of customers : The arrival process consists of describing how

inter-arrival time). If the arrival pattern is deterministic, then it is fully determined

distribution, Degenerate distribution (deterministic), Erlangian type with parameter

k(k=1,2,3...), Hyper exponential distribution and Phase type distribution.

otherwise, it is called non-stationary.

2. Service pattern of servers : Much of the discussions concerning the arrival

pattern is appropriate in discussing service patterns. Service pattern can also be

it can be stationary or non-stationary. One important difference between arrival

situation, it is called state-dependent service. The problems of customer impatience

can be looked upon as ones of state-dependent arrivals.

until the customer presently in service is completed.

under what circumstance arriving customers must balk.

5. Service channels : The number of service channels refers to the number of

leaving the system. They are called tandem queues.

1.3 Queueing processes

1. Distribution of the number of customers N(t) in the system or queue at time t.

interest is the distribution of Wn .

wait has he arrive at time t.

4. Distribution of the busy period.

It is clear that N (t), t ≥ 0, W (t), t ≥ 0 and Wn , n ≥ 0 are stochastic processes, the

studied in queueing theory may be grouped as

1. Stochastic behavior of various r.v’s and evaluation of various performance mea-

2. Method of solution - exact, transform, algorithmic, asymptotic, numerical, approx-

3. Nature of solution - time dependent, limiting form etc.

4. Control and design of queues - comparison of behavior and performances under

5. Optimization of specific objective functions involving performance measures, asso-

ciated cost functions etc.

where A is inter-arrival time distribution, B is the service time distribution, X is the

M - Poisson arrival and exponential service time distribution

D - Degenerate distribution (deterministic)

Ek - Erlangian type with parameter k (k=1,2,3...)

H - Hyper exponential distribution

P H - Phase type distribution.

1. Queue length : The number of customers waiting in the queue.

queue before getting service.

for service in the system before leaving.

1.5 Different Methods for the Analysis of a Queue-

system. The solution of a queueing system depending on time is called transient

distribution. The steady state distribution is a balanced equilibrium solution, which

chain is extracted from a non-Markov process by using regeneration points due to

measures of a queueing system.

2. The Supplementary Variable Method : The use of supplementary variable, as