Escolar Documentos
Profissional Documentos
Cultura Documentos
Architecture
Maotong Xu, Chong Liu, and Suresh Subramaniam
Department of Electrical and Computer Engineering
The George Washington University
Email: htfy8927@gwu.edu, cliu15@gwu.edu and suresh@gwu.edu
I. INTRODUCTION
Power consumption, latency, and scalability are critical
factors in designing data center network (DCN) architectures.
Power consumption of data centers will reach 140 billion
kilowatt-hours annually by 2020, and it will cost $13 billion
annually [2]. On the other hand, interactive applications,
e.g., web search, social network, and stock exchange, require
low network latency; e.g., the acceptable latency range of
stock exchange is 5-100 ms [3]. Furthermore, as data volume
increases, the sizes of data centers scale up tremendously.
Microsoft owns over 1 millions servers and its Chicago data
center alone is estimated to contain over 250,000 servers [4].
Considering reduced power consumption, low latency, and
high scalability, optical networking is a promising solution for
current data centers [5]. Optical networks are commonly based
on optical circuit switching, e.g., Micro-Electro-Mechanical
Systems Switches (MEMS) and Arrayed Waveguide Grating
Router (AWGR). MEMS is a power-driven reconfigurable
optical switch and its reconfiguration time is in the order
of a few milliseconds [6], and is not best suited for fast
packet switching in DCN applications. Compared to MEMS,
AWGR is a passive optical device that does not require
reconfiguration, and can achieve contention resolution in the
wavelength domain. The cyclic routing characteristic of the
AWGR allows different inputs to reach the same output simul-
taneously by using different wavelengths. Previous AWGRbased architectures, e.g., DOS [1] and Petabit [7], employ
Tunable Wavelength Converters (TWC), which are powerhungry devices [5]. Moreover, TWC highly increases the total
capital expenditure (CapEx) of architectures.
Our solution is to employ passive optical devices, e.g.,
AWGR, coupler and demultiplexer, to reduce power consumption, while achieving low packet latency and high throughput.
Moreover, our architecture is highly scalable and it can easily
accommodate more than 2 million servers. The main contributions of our work are as follows:
We propose three different-sized architectures smallsized, medium-sized and large-sized passive optical architectures, and present the wavelength assignment and
packet transmission algorithm for each.
We obtain the packet latency and network throughput of
the three architectures through simulations. Simulation
results show that packet latencies of under 9s and a
throughput of 100% are achievable. We also present
evaluation results for a number of different architectural
parameters such as the the number of tunable transmitters
per rack,1 the type of architecture, as well as packet size.
We compare the power consumption and CapEx of our
architectures with DOS, Proteus [6], and Petabit. Results
show that our architectures can save up to 90% on power
consumption, and 88% on CapEx.
The rest of this paper is organized as follows: Section
II provides three optical data center architectures and corresponding wavelength assignment and packet transmission
algorithms. Section III describes the performance evaluation
of the architectures and the simulation results. In Section
IV, we compare the proposed architectures with three other
all-optical architectures in terms of power consumption and
CapEx. Finally, we conclude the paper in Section V.
II. P ROPOSED A RCHITECTURES AND A LGORITHMS
Our architectures are hierarchical, and use an AWGR to
interconnect racks. Racks that are connected to each port of
the AWGR are said to form a cluster, and the entire DCN is
a group of clusters. Suppose each cluster has M racks, i.e.,
d PS e=M , where S is the total number of racks and P is the
1 The term rack and ToR (for top-of-rack) are used interchangeably in
the paper.
T1,1
ToR
Switches
N1 Coupler
ToR
Switches
1N Demux
C1
D1
PP
CAWG
R1,1
... ...
A. PODCA-S
Central
Control
... ...
...
(1)
TP,1
CP
Tunable TX
1~W
RP,1
DP
...
mod P ] + 1, 1 i P, 1 c W.
...
[(i + c 2)
...
Fixed WideBand RX
1~W
(2)
(3)
Here, f F 1 because the maximum number of available wavelengths is W =F P . The coupler combines different
wavelengths from different tunable transmitters and outputs
the combined signal to the AWGR. The output signal of
the AWGR is evenly demultiplexed N ways, i.e., the first
wavelengths are demultiplexed to output port 1, the next
wavelengths to port 2, and so on. The pseudocode of the
wavelength assignment algorithm is shown in Algorithm 2.
B. PODCA-M
The PODCA-M architecture is shown in Fig 2. In this case,
P SW . We connect M (> 1) racks to each AWGR port.
An (N M )1 coupler connects all the N transmitters on each
Central
Control
ToR
Switches
M1 Coupler
RP,2
TP,1
...
RP,2
CP
DP
...
TP,M
MM
AWGR
R1,M
RP,1
TP,2
...
Fixed WideBand RX
1~W
MM
AWGR
PP
AWGR
...
...
RP,M
D1
...
...
R1,2
C1
...
... ...
...
T1,M
RP,1
1M Demux
T1,2
...
R1,M
...
Tunable TX
1~W
...
...
...
...
...
DP
ToR
Switches
R1,1
T1,1
... ...
CP
ToR
Switches
MM
AWGR
...
PP
AWGR
Central
Control
R1,2
... ...
TP,2
R1,1
...
...
D1
... ...
TP,1
...
C1
... ...
T1,M
TP,M
1NM Demux
...
...
T1,2
ToR
Switches
NM1 Coupler
... ...
... ...
T1,1
MM
AWGR
RP,M
Tunable TX
1~W
Fixed WideBand RX
1~W
(5)
(6)
(7)
(sit >ir ),
(8)
If it ir and sjr sit ir 6=, the packet only needs intercluster transmission. Titt ,jt can tune to a wavelength in
sjr sit ir , which is the wavelength that can be received by
Rrir ,jr using inter-cluster transmission.
If it ir and sjr sit ir =, Titt ,jt tunes to one of the
wavelengths in sit ir and the packet needs both inter-cluster
transmission and intra-cluster transmission to arrive at its
destination. Suppose Titt ,jt tunes to c1 , where c1 sit ir . The
packet arrives at Rir ,d M c1 e by using inter-cluster transmission.
W
In another time slot, Rir ,d M c1 e will transmit the packet to
W
Rir ,jr by using intra-cluster transmission.
If it >ir and sjr sit >ir 6=, the packet only needs
inter-cluster transmission. Titt ,jt can tune to wavelength in
sjr sit >ir , and the packet can arrive at Rrir ,jr by using only
inter-cluster transmission.
If it >ir and sjr sit >ir =, Titt ,jt tunes to one of the
wavelengths in sit >ir and the packet needs both inter-cluster
transmission and inter-cluster transmission to arrive at its
destination. Suppose Titt ,jt tunes to c2 , where c2 sit >ir . The
packet first arrives at Rir ,d M c2 e . Rir ,d M c2 e will transmit the
W
W
packet to Rir ,jr using intra-cluster transmission in another
time slot. The wavelength assignment algorithm is shown in
latency (ns)
PODCA-S
PODCA-M
PODCA-L
512
1500
2048
Latency (ns)
5000
4000
IntraTx=1, InterTx=2
IntraTx=2, InterTx=2
IntraTx=1, InterTx=3
IntraTx=2, InterTx=3
3000
2000
1000
0
200
400
600
3 The
Protues
Petabit
DOS
PODCA
1.5
1
0.5
In this section, we conduct simulations to evaluate the latency and throughput performance of PODCA. Packet arrivals
follow a Poisson process. The transmission rate of a tunable
transmitter (and wavelength capacity) is assumed to be 10
Gbps. The tuning time of tunable transmitters is 8 ns [9].
The latency consists of transmission time, queueing delay, and
tuning time. The destination rack of packets is drawn from a
uniform distribution. The shared buffer size of each ToR is
8MB. For PODCA-L, we set the threshold for determining
if the packet uses intra-cluster transmission or inter-cluster
transmission as 100 packets.
10000
9000
8000
7000
6000
5000
4000
3000
2000
1000
0
2.5
Power (Megawatts)
10K
20K
30K
40K
50K
60K
Table I
P OWER CONSUMPTION AND C OST OF THE OPTICAL COMPONENTS IN DIFFERENT ARCHITECTURES
Component
Power(Watts)
Cost (Dollars)
DOS
Proteus
Petabit
SFP transceiver
Fast Wavelength tunable transmitter (WTT)
Fixed wideband receiver (photodetector)
Flexgrid WSS
Tunable Wavelength Converter (TWC)
Arrayed waveguide grating (AWG) per port
MEMS switch per port
Circulator
Coupler
Optical (DE)MUX
Fiber Delay Line
1.0
1.5
0.9
15
20
0.0
0.25
0.0
0.0
0.0
0.0
45
1955
40
1245
8000
15
255
105
195
3000
2000
S
0
0
0
S
0
0
0
0
S
S
32S
0
0
S
0
0
4S
4S
S
2S
0
0
S
0
0
2S
6S
0
0
0
0
0
3.5
Petabit
DOS
Proteus
PODCAS
3
2.5
2
1.5
1
900
Petabit
DOS
Proteus
PODCAM
800
700
50
100
150
200
250
(a)
300
400
300
200
250
Petabit
DOS
Proteus
PODCAL
500
Large
0
2S
2S
0
0
S+P
0
0
P
P
0
600
100
0.5
0
4.5
PODCA
Medium
0
2S
2S
0
0
P
0
0
P
P
0
Small
0
2S
2S
0
0
S
0
0
S
S
0
0
300
350
400
450
500
550
10K
20K
30K
40K
50K
60K
(b)
(c)
Figure 7. CapEx comparison of different architectures: (a) PODCA-S. (b) PODCA-M. (c) PODCA-L.
V. C ONCLUSION
In this paper, we present architectures for a passive optical
data center network. The key component of our architectures
is the arrayed waveguide grating router (AWGR). Based on
simulation results, the packet latency of our architectures is
below 9s and our architectures can achieve 100% throughput.
Compared with other optical architectures in terms of power
consumption and CapEx, results show that our architectures
can save up to 90% power consumption and 88% CapEx.
R EFERENCES
[1] X. Ye, Y. Yin, S. B. Yoo, P. Mejia, R. Proietti, and V. Akella,
DOS: A scalable optical switch for datacenters, in Proceedings of
the 6th ACM/IEEE Symposium on Architectures for Networking and
Communications Systems. ACM, 2010, p. 24.
[2] Pierre Delforge, Available online: http://www.nrdc.org/, 2015.
[3] Available online: https://www.citycloud.com/.
[4] Available online: http://www.datacenterknowledge.com/.
[5] C. Kachris and I. Tomkos, A survey on optical interconnects for data
centers, in IEEE Comm. Surveys & Tutorials, 14(4):1021-1036, 2012.
[6] A. Singla, A. Singh, K. Ramachandran, L. Xu, and Y. Zhang, Proteus:
a topology malleable data center network, in Proceedings of the 9th
ACM SIGCOMM Workshop on Hot Topics in Networks. ACM, 2010.
[7] K. Xia, Y.-H. Kaob, M. Yangb, and H. Chao, Petabit optical switch
for data center networks, Polytechnic Institute of New York University,
New York, Tech. Rep, 2010.
[8] Z. Cao, R. Proietti, M. Clements, and S. Yoo, Demonstration of
scalable, flat, and high-throughput data center architecture based on
arrayed waveguide grating routers, in ECOC, 2014.
[9] S. Matsuo and T. Segawa, Microring-resonator-based widely tunable
lasers, IEEE J. Sel. Topics Quantum Electron, vol. 15, no. 3, pp. 545
554, 2009.
[10] J. Chen, Y. Gong, M. Fiorani, and S. Aleksic, Optical interconnects at
the top of the rack for energy-efficient data centers, Communications
Magazine, IEEE, vol. 53, no. 8, pp. 140148, 2015.
[11] S. Cheung, T. Su, K. Okamoto, and S. Yoo, Ultra-compact silicon
photonic 512 512 25 GHz arrayed waveguide grating router, Selected
Topics in Quantum Electronics, IEEE Journal of, vol. 20, no. 4, pp. 310
316, 2014.