Show simple item record

dc.contributor.authorDíaz, Oscar A.
dc.date.accessioned2012-07-02T20:23:58Z
dc.date.available2012-07-02T20:23:58Z
dc.date.issued2012-07-02
dc.date.submittedMay 2012
dc.identifier.urihttp://hdl.handle.net/1928/20764
dc.description.abstractWireline high-speed networks have become a critical part of modern cyberinfra-structures and provide the base substrates to support a full range of higher-layer user services and applications. Indeed, a wide range of technologies have been deployed in these domains, ranging from ultra-fast Internet Protocol (IP) packet routing systems to multi-wavelength optical switching nodes. Today these setups provide immense levels of traffic scalability, reaching well into the 100s of gigabits/second and even terabits/second ranges. Owing to this growth, network survivability is now a central concern, as even a single link or node failure can cause widespread service disruption for thousands of users or more. Now over the years, a full range of network survivability schemes have been developed for packet routing and optical switching networks. Indeed, the open research literature lists many types of solutions here, broadly classified as pre-fault protection and post-fault restoration strategies. The former schemes pro-actively set up backup (redundant) resource pools to overcome anticipated failure events. Meanwhile, the latter strategies are more reactive by design and attempt to re-establish connectivity after failures. By and large, the bulk of these solutions are only concerned with single failure recovery, i.e., either at the link or node level. In general, these are the most common types of faults events experienced in operational networks. However, recent developments and considerations are pushing the need for more capable schemes to recover from multiple failure events, i.e., as occurring during natural disasters, massive power outages, and weapon of massive destruction (WMD) type attacks. Indeed, these types of scenarios are much more challenging, as they induce large numbers of correlated failures which can quickly overwhelm most traditional single-failure recovery schemes. Along these lines, some recent studies have looked at network recovery under massive correlated network failures. The key idea here is to introduce probabilistic risk information into the path provisioning (routing, protection) processes in order to minimize vulnerability to random failures. However, even though these schemes can reduce connections failure rates, they yield very high resource inefficiencies (usage consumption). In turn, these concerns will inhibit their adoption in most practical network settings, as operators have to balance the need for improved resiliency with revenue generation. To address this challenge, this thesis proposes a novel multi-failure survivability scheme that jointly incorporates both risk mitigation and traffic engineering (TE) efficiency objectives. In particular, the approach leverages multi-path routing strategies to first compute a selection of diverse working/backup path pairs and then uses ranking methods to select the most balanced combination. This framework applies graph-theoretic principles and hence can readily be integrated into real-world traffic provisioning systems. The performance of the proposed solution is evaluated using discrete event simulation techniques for a variety of network topologies and compared against several existing schemes. Overall findings show that the scheme yields notably improved survivability rates as compared to vanilla traffic engineering policies. At the same time, it also gives much better operational resource efficiencies versus existing probabilistic risk reduction routing strategies. Hence network carriers can fully leverage this new design to achieve much-improved reliability for critical data flows without sacrificing operational revenues.en_US
dc.description.sponsorshipDTRA: Defense Threat Reduction Agencyen_US
dc.language.isoen_USen_US
dc.subjectNetwork survivabilityen_US
dc.subjectMulti-failure recoveryen_US
dc.subjectProbabilistic routingen_US
dc.subjectTraffic engineeringen_US
dc.subject.lcshComputer networks--Reliability.
dc.subject.lcshFault-tolerant computing.
dc.subject.lcshData recovery (Computer science)
dc.subject.lcshPacket switching (Data transmission)--Computer simulation.
dc.titleDesign and evaluation of network survivability schemes for correlated multi-failure scenariosen_US
dc.typeThesisen_US
dc.description.degreeComputer Engineeringen_US
dc.description.levelMastersen_US
dc.description.departmentUniversity of New Mexico. Dept. of Electrical and Computer Engineeringen_US
dc.description.advisorGhani, Nasir
dc.description.committee-memberHayat, Majeed M.
dc.description.committee-memberPattichis, Marios S.


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record