A comparison of genetic algorithms and other machine learning systems on a complex classification task from common disease research.

Congdon, Clare Bates

A comparison of genetic algorithms and other machine learning systems on a complex classification task from common disease research.

dc.contributor.author	Congdon, Clare Bates	en_US
dc.contributor.advisor	Holland, John H.	en_US
dc.contributor.advisor	Laird, John E.	en_US
dc.date.accessioned	2014-02-24T16:21:30Z
dc.date.available	2014-02-24T16:21:30Z
dc.date.issued	1995	en_US
dc.identifier.other	(UMI)AAI9527608	en_US
dc.identifier.uri	http://gateway.proquest.com/openurl?url_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation&res_dat=xri:pqm&rft_dat=xri:pqdiss:9527608	en_US
dc.identifier.uri	https://hdl.handle.net/2027.42/104450
dc.description.abstract	The thesis project is an investigation of some well-known machine learning systems and evaluates their utility when applied to a classification task from the field of human genetics. This common-disease research task, an inquiry into genetic and biochemical factors and their association with a family history of coronary artery disease (CAD), is more complex than many pursued in machine learning research, due to interactions and the inherent noise in the dataset. The task also differs from most pursued in machine learning research because there is a desire to explain the dataset with a small number of rules, even at the expense of accuracy, so that they will be more accessible to medical researchers who are unaccustomed to dealing with disjunctive explanations of data. Furthermore, there is asymmetry in the task in that good explanations of the positive examples is of more importance than good explanations of the negative examples. The primary machine learning approach investigated in this research is genetic algorithms (GA's); decision trees, Autoclass, and Cobweb are also included. The GA performed the best in terms of descriptive ability with the common-disease research task, although decision trees also demonstrated certain strengths. Autoclass and Cobweb were recognized from the onset as being inappropriate for the needs of common-disease researchers (because both systems are unsupervised learners that create probabilistic structures), but were included for their interest in the machine learning community; these systems did not perform as well as GA's and decision trees in terms of their ability to describe the data. In terms of predictive accuracy, all systems performed poorly, and the differences between any two of the three best systems is not significant. When positive and negative examples are considered separately, the GA does significantly better than the other systems in predicting positive examples and significantly worse in predicting negative examples. The thesis illustrates that the investigation of "real" problems from researchers in other fields can lead machine learning researchers to challenge their systems in ways they may not otherwise have considered, and may lead these researchers to a symbiotic relationship that benefits multiple research communities.	en_US
dc.format.extent	204 p.	en_US
dc.subject	Health Sciences, Public Health	en_US
dc.subject	Artificial Intelligence	en_US
dc.subject	Computer Science	en_US
dc.title	A comparison of genetic algorithms and other machine learning systems on a complex classification task from common disease research.	en_US
dc.type	Thesis	en_US
dc.description.thesisdegreename	PhD	en_US
dc.description.thesisdegreediscipline	Computer Science and Engineering	en_US
dc.description.thesisdegreegrantor	University of Michigan, Horace H. Rackham School of Graduate Studies	en_US
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/104450/1/9527608.pdf
dc.description.filedescription	Description of 9527608.pdf : Restricted to UM users only.	en_US
dc.owningcollname	Dissertations and Theses (Ph.D. and Master's)

Files in this item

Name:: 9527608.pdf
Size:: 9.477MB
Format:: PDF
Description:: Restricted to UM users only.

View/Open

Dissertations and Theses (Ph.D. and Master's)

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.