Show simple item record

Designing and reconfiguring fault-tolerant multiprocessor systems.

dc.contributor.authorDutt, Shantanuen_US
dc.contributor.advisorHayes, John P.en_US
dc.date.accessioned2014-02-24T16:26:13Z
dc.date.available2014-02-24T16:26:13Z
dc.date.issued1990en_US
dc.identifier.other(UMI)AAI9116169en_US
dc.identifier.urihttp://gateway.proquest.com/openurl?url_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation&res_dat=xri:pqm&rft_dat=xri:pqdiss:9116169en_US
dc.identifier.urihttps://hdl.handle.net/2027.42/105171
dc.description.abstractThis thesis presents a general theory for designing multiprocessor computer systems that can tolerate faulty processors. It is especially concerned with structural fault tolerance, defined as the ability to reconfigure around faults in order to preserve the interconnection structure of a multiprocessor. A major goal is to model some important practical design features not previously addressed, including applicability to any multiprocessor structure and any number of faults. Low hardware overhead and efficient reconfigurability are also important goals. The systems of interest and their faults are represented by graphs, and reconfiguration is modeled by graph-to-graph mappings that replace faulty structures by nonfaulty ones. Within this framework, two general design methodologies for fault tolerance are defined. The first approach called node covering performs reconfiguration by mapping a node (processor) to one of a specific subset of other nodes called its covers. The relation between nodes and their covers is represented efficiently by covering graphs. We show how to design k-fault-tolerant trees from their covering graphs. The resulting designs are near-optimal with respect to hardware cost. We also generalize the node-covering approach to arbitrary multiprocessor graphs, and demonstrate that the resulting fault-tolerant designs have low-cost practical implementations. Our second design theory uses graph automorphisms to represent the reconfiguration process. We demonstrate the efficacy of this theory by applying it to hypercube multiprocessors, and obtain fault-tolerant designs that are superior to those proposed in previous work. We also apply automorphisms to local sparing, which associates spare nodes with disjoint groups of processors to simplify reconfiguration. Two local sparing techniques are developed, one to tolerate fault clusters, and the other to tolerate arbitrary faults. The theory underlying the node-covering and automorphic methods, and the relation between them are studied in depth. The two design methods are shown to complement each other over a wide range of different multiprocessor systems. We also demonstrate that they successfully address all the practical design criteria stated earlier. They thus provide a powerful new methodology for designing and evaluating fault-tolerant multiprocessor systems.en_US
dc.format.extent218 p.en_US
dc.subjectEngineering, Electronics and Electricalen_US
dc.subjectComputer Scienceen_US
dc.titleDesigning and reconfiguring fault-tolerant multiprocessor systems.en_US
dc.typeThesisen_US
dc.description.thesisdegreenamePhDen_US
dc.description.thesisdegreedisciplineComputer Science and Engineeringen_US
dc.description.thesisdegreegrantorUniversity of Michigan, Horace H. Rackham School of Graduate Studiesen_US
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/105171/1/9116169.pdf
dc.description.filedescriptionDescription of 9116169.pdf : Restricted to UM users only.en_US
dc.owningcollnameDissertations and Theses (Ph.D. and Master's)


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.