John Heidemann / Papers / Mapping the Expansion of Google’s Serving Infrastructure

Mapping the Expansion of Google’s Serving Infrastructure
Matt Calder, Xun Fan, Zi Hu, Ethan Katz-Bassett, John Heidemann and Ramesh Govindan
USC/Information Sciences Institute

Citation

Matt Calder, Xun Fan, Zi Hu, Ethan Katz-Bassett, John Heidemann and Ramesh Govindan. Mapping the Expansion of Google’s Serving Infrastructure. Technical Report TR 13-935. University of Southern California Computer Science Department. [PDF] [alt PDF]

Abstract

Modern content-distribution networks both provide bulk content and act as “serving infrastructure” for web services in order to reduce user-perceived latency. These serving infrastructures (such as Google’s) are now critical to the online economy, making it imperative to understand their size, geographic distribution, and growth strategies. To this end, we develop techniques that enumerate servers in these infrastructures, find their geographic location, and identify the association between clients and servers. While general techniques for server enumeration and geolocation can exhibit large error, our techniques exploit the design and mechanisms of serving infrastructure to improve accuracy. We use the EDNS-client-subnet extension to DNS to measure which clients a service maps to which of its servers. We devise a novel technique that uses this mapping to geolocate servers by combining noisy information about client locations with speed-of-light constraints. We demonstrate that this technique substantially improves geolocation accurate relative to existing approaches. We also cluster servers into physical sites by measuring RTTs and adapting the cluster thresholds dynamically. Google’s serving infrastructure has grown dramatically in the last six months, and we use our methods to chart its growth and understand its content serving strategy. We find that Google has almost doubled in size, and that most of the growth has occurred by placing servers in large and small ISPs across the world, not by expanding on Google’s backbone.

Bibtex Citation

@techreport{Calder13b,
  author = {Calder, Matt and Fan, Xun and Hu, Zi and Katz-Bassett, Ethan and Heidemann, John and Govindan, Ramesh},
  title = {Mapping the Expansion of {Google's} Serving Infrastructure},
  institution = {University of Southern California Computer Science Department},
  year = {2013},
  sortdate = {2013-06-01},
  project = {ant},
  jsubject = {omit},
  number = {TR 13-935},
  month = jun,
  jlocation = {johnh: pafile},
  keywords = {CDN mapping, clustering, google},
  url = {https://ant.isi.edu/%7ejohnh/PAPERS/Calder13b.html},
  pdfurl = {https://ant.isi.edu/%7ejohnh/PAPERS/Calder13b.pdf},
  myorganization = {USC/Information Sciences Institute},
  copyrightholder = {authors}
}
Copyright © by John Heidemann