TerraServer SAN-Cluster Architecture and Operations Experience
Tom Barclay, Jim Gray

TL;DR
This paper discusses the design, implementation, and three-year operational experience of the TerraServer SAN-Cluster architecture, highlighting its high availability and practical challenges faced.
Contribution
It provides a detailed case study of deploying a large-scale, high-availability SAN-cluster system for a major online geospatial database.
Findings
Achieved over 99.99% system availability
Operational mistakes reduced actual availability to 99.9%
System demonstrated scalability and reliability over three years
Abstract
Microsoft TerraServer displays aerial, satellite, and to-pographic images of the earth in a SQL database available via the Internet. It is one of the most popular online at-lases, presenting seventeen terabytes of image data from the United States Geological Survey (USGS). Initially de-ployed in 1998, the system demonstrated the scalability of PC hardware and software - Windows and SQL Server - on a single, mainframe-class processor. In September 2000, the back-end database application was migrated to 4-node active/passive cluster connected to an 18 terabyte Storage Area Network (SAN). The new configuration was designed to achieve 99.99% availability for the back-end application. This paper describes the hardware and software components of the TerraServer Cluster and SAN, and describes our experience in configuring and operating this system for three years. Not surprisingly, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Computational Techniques and Applications · Data Management and Algorithms · Advanced Database Systems and Queries
