Detecting Internet Outages with Active Probing (extended)
Lin Quan and John HeidemannUSC/Information Sciences Institute
Abstract
With businesses, governments, and individuals increasingly dependent on the Internet, understanding its reliability is more important than ever. Network outages vary in scope and cause, from the intentional shutdown of the Egyptian Internet in February 2011, to outages caused by the effects of March 2011 earthquakes on undersea cables entering Japan, to the thousands of small, daily outages caused by localized accidents or human error. In this paper we present a new method to detect network outages by probing entire blocks. Using 24 datasets, each a 2-week study of 22,000 /24 address blocks randomly sampled from the Internet, we develop new algorithms to identify and visualize outages and to cluster those outages into network-level events. We validate our approach by comparing our data-plane results against control-plane observations from BGP routing and news reports, examining both major and randomly selected events. We confirm our results are stable from two different locations and over more than one and half years of observations. We show that our approach of probing all addresses in a /24 block is significantly more accurate than prior approaches that use a single representative for all routed blocks, cutting the number of mistake outage observations from 44% to under 1%. We use our approach to study several large outages such as those mentioned above. We also develop a general estimate for how much of the Internet is regularly down, finding about 0.3% of the Internet is likely to be unreachable at any time. By providing a baseline estimate of Internet outages, our work lays the groundwork to evaluate ISP reliability.Availability
This paper is available in several formats: abstract web page with pointers and cites, PDF, paper copies can be obtained by mail to the authors. Copyright terms for this paper appear below.
Reference
- Quan11a
- Lin Quan and John Heidemann. Detecting Internet Outages with Active Probing (extended). Technical Report ISI-TR-2011-672, USC/Information Sciences Institute, May, 2010. <http://www.isi.edu/~johnh/PAPERS/Quan11a.html>.
@techreport{Quan11a,
author = "Lin Quan and John Heidemann",
title = "Detecting Internet Outages with Active Probing (extended)",
institution = "USC/Information Sciences Institute",
year = "2010",
number = "ISI-TR-2011-672",
month = "May",
keywords = "routing outage detection, active probing,
ntework outages",
url = "http://www.isi.edu/~johnh/PAPERS/Quan11a.html",
pdfurl = "http://www.isi.edu/~johnh/PAPERS/Quan11a.pdf",
myorganization = "USC/Information Sciences Institute",
copyrightholder = "authors",
}
Copyright
This paper is copyright © 2010 by its authors. Permission to make digital or hard copies of part or all of this work for personal use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that new copies bear this notice and the full citation on the first page. Abstracting with credit is permitted.To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission of the authors.