This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Scale-Out Networking in the Data Center
July/August 2010 (vol. 30 no. 4)
pp. 29-41
Amin Vahdat, University of California, San Diego
Mohammad Al-Fares, University of California, San Diego
Nathan Farrington, University of California, San Diego
Radhika Niranjan Mysore, University of California, San Diego
George Porter, University of California, San Diego
Sivasankar Radhakrishnan, University of California, San Diego

Scale-out architectures supporting flexible, incremental scalability are common for computing and storage. However, the network remains the last bastion of the traditional scale-up approach, making it the data center's weak link. Through the UCSD Triton network architecture, the authors explore issues in managing the network as a single plug-and-play virtualizable fabric scalable to hundreds of thousands of ports and petabits per second of aggregate bandwidth.

1. L.A. Barroso, J. Dean, and U. Hoelzle, "Web Search for a Planet: The Google Cluster Architecture," IEEE Micro, vol. 23, no. 2, Mar.-Apr. 2003, pp. 22-28.
2. J. Dean and S. Ghemawat, "MapReduce: Simplified Data Processing on Large Clusters," Proc. Symp. Operating Systems Design & Implementation (OSDI 04), Usenix Assoc., pp. 137-150.
3. M. Isard et al., "Dryad: Distributed Data-parallel Programs from Sequential Building Blocks," ACM Special Interest Group on Operating Systems (SIGOPS) Operating Systems Rev., vol. 41, no. 3, 2007, pp. 59-72.
4. M. Al-Fares, A. Loukissas, and A. Vahdat, "A Scalable, Commodity, Data Center Network Architecture," ACM Special Interest Group on Data Communication (SIGCOMM) Computer Comm. Rev., vol. 38, no. 4, Oct. 2008, pp. 63-74.
5. B. Heller et al., "ElasticTree: Saving Energy in Data Center Networks," Proc. Usenix Symp. Networked Systems Design and Implementation (NSDI 10), Usenix Assoc., 2010; www.usenix.org/events/nsdi10/tech/full_papers heller.pdf.
6. A. Greenberg et al., "VL2: A Scalable and Flexible Data Center Network," Proc. ACM Special Interest Group on Data Communication Conf. Data Comm. (SIGCOMM 09), ACM Press, 2009, pp. 51-62.
7. W.D. Hillis and L.W. Tucker, "The CM-5 Connection Machine: A Scalable Supercomputer," Comm. ACM, vol. 36, no. 11, 1993, pp. 31-40.
8. N. Farrington, E. Rubow, and A. Vahdat, "Data Center Switch Architecture in the Age of Merchant Silicon," Proc. IEEE Symp. High Performance Interconnects (HOTI 09), IEEE CS Press, 2009, pp. 93-102.
9. C. Guo et al., "BCube: A High Performance, Server-Centric Network Architecture for Modular Data Centers," Proc. ACM Special Interest Group on Data Communication Conf. Data Comm. (SIGCOMM 09), ACM Press, 2009, pp. 63-74.
10. R.N. Mysore et al., "PortLand: A Scalable, Fault-Tolerant Layer 2 Data Center Network Fabric," Proc. ACM Special Interest Group on Data Communication Conf. Data Comm. (SIGCOMM 09), ACM Press, 2009, pp. 39-50.
11. N. McKeown et al., "OpenFlow: Enabling Innovation in Campus Networks," ACM Special Interest Group on Data Communication (SIGCOMM) Computer Comm. Rev., vol. 38, no. 2, Apr. 2008, pp. 69-74.
12. M. Al-Fares et al., "Hedera: Dynamic Flow Scheduling for Data Center Networks," Proc. Usenix Symp. Networked Systems Design and Implementation (NSDI 10), Usenix Assoc., 2010; www.usenix.org/events/nsdi10/tech/slides al-fares.pdf.
13. N. Gude et al., "NOX: Towards an Operating System for Networks," ACM Special Interest Group on Data Communication (SIGCOMM) Computer Comm. Rev., vol. 38, no. 3, 2008, pp. 105-110.
14. S. Kandula et al., "The Nature of Data Center Traffic: Measurements & Analysis," Proc. ACM Special Interest Group on Data Communication (SIGCOMM) Internet Measurement Conf. (IMC 09), ACM Press, 2009, pp. 202-208.
15. A. Phanishayee et al., "Measurement and Analysis of TCP Throughput Collapse in Cluster-based Storage Systems," Proc. Usenix Conf. File and Storage Technologies (FAST 08), Usenix Assoc., 2008, pp. 175-188; www.usenix.org/events/fast08/tech/full_papers/ phanishayeephanishayee.pdf.
16. M. Gupta and S. Singh, "Greening of the Internet," Proc. Conf. Applications, Technologies, Architectures, and Protocols for Computer Comm., ACM Press, 2003, pp. 19-26.
1. A. Greenberg et al., "VL2: A Scalable and Flexible Data Center Network," Proc. ACM Special Interest Group on Data Communication Conf. Data Comm. (SIGCOMM 09), ACM Press, 2009, pp. 51-62.
2. C. Guo et al., "DCell: A Scalable and Fault-Tolerant Network Structure for Data Centers," Proc. ACM Special Interest Group on Data Communication (SIGCOMM) Computer Comm. Rev., vol. 38, no. 4, 2008, pp. 75-86.
3. C. Guo et al., "BCube: A High Performance, Server-Centric Network Architecture for Modular Data Centers," Proc. ACM Special Interest Group on Data Communication Conf. Data Comm. (SIGCOMM 09), ACM Press, 2009, pp. 63-74.
4. J. Touch and R. Perlman, Transparent Interconnection of Lots of Links (TRILL): Problem and Applicability Statement, IETF RFC 5556, May 2009; www.rfc-editor.org/rfcrfc5556.txt.
5. C. Kim, M. Caesar, and J. Rexford, "Floodless in SEATTLE: A Scalable Ethernet Architecture for Large Enterprises," Proc. ACM Special Interest Group on Data Communication Conf. Data Comm. (SIGCOMM 08), ACM Press, 2008, pp. 3-14.
6. M. Scott and J. Crowcroft, "MOOSE: Addressing the Scalability of Ethernet," poster session, ACM Special Interest Group on Operating Systems (SIGOPS) European Conf. Computer Systems, 2008; www.cl.cam.ac.uk/~mas90/MOOSEEuroSys2008-Poster+abstract.pdf .
7. A. Greenberg et al., "Towards a Next Generation Data Center Architecture: Scalability and Commoditization," Proc. ACM Workshop Programmable Routers for Extensible Services of Tomorrow (PRESTO 08), ACM Press, 2008, pp. 57-62.
8. S. Kandula et al., "Walking the Tightrope: Responsive Yet Stable Traffic Engineering," ACM Special Interest Group on Data Communication (SIGCOMM) Computer Comm. Rev., vol. 35, no. 4, 2005, pp. 253-264.
9. A. Elwalid et al., "MATE: MPLS Adaptive Traffic Engineering," Proc. IEEE Int'l Conf. Computer Comm. (INFOCOM 01), IEEE Press, vol. 3, 2001, pp. 1300-1309.
10. S. Sinha, S. Kandula, and D. Katabi, "Harnessing TCPs Burstiness Using Flowlet Switching," Proc. ACM Special Interest Group on Data Communication (SIGCOMM) Workshop Hot Topics in Networks (HotNets 04), ACM Press, 2004; http://nms.csail.mit.edu/papersflare-hotnet04.ps .
11. M. Zhang et al., "RR-TCP: A Reordering-Robust TCP with DSACK," Proc. IEEE Int'l Conf. Network Protocols (ICNP 03), IEEE CS Press, 2003, p. 95.
12. S. Bohacek et al., "A New TCP for Persistent Packet Reordering," IEEE/ACM Trans. Networking (TON), vol. 14, no. 2, 2006, pp. 369-382.

Index Terms:
datacenter network architecture, scale-out networking, merchant silicon, L2 fabric, datacenter routing, network fault-tolerance.
Citation:
Amin Vahdat, Mohammad Al-Fares, Nathan Farrington, Radhika Niranjan Mysore, George Porter, Sivasankar Radhakrishnan, "Scale-Out Networking in the Data Center," IEEE Micro, vol. 30, no. 4, pp. 29-41, July-Aug. 2010, doi:10.1109/MM.2010.72
Usage of this product signifies your acceptance of the Terms of Use.