Algorithms for Selecting Informative Marker Panels for Population Assignment

Rosenberg, Noah A.

Algorithms for Selecting Informative Marker Panels for Population Assignment

dc.contributor.author	Rosenberg, Noah A.	en_US
dc.date.accessioned	2009-07-10T19:13:22Z
dc.date.available	2009-07-10T19:13:22Z
dc.date.issued	2005-11-01	en_US
dc.identifier.citation	Rosenberg, Noah A. (2005). "Algorithms for Selecting Informative Marker Panels for Population Assignment." Journal of Computational Biology 12(9): 1183-1201 <http://hdl.handle.net/2027.42/63393>	en_US
dc.identifier.uri	https://hdl.handle.net/2027.42/63393
dc.identifier.uri	http://www.ncbi.nlm.nih.gov/sites/entrez?cmd=retrieve&db=pubmed&list_uids=16305328&dopt=citation	en_US
dc.description.abstract	Given a set of potential source populations, genotypes of an individual of unknown origin at a collection of markers can be used to predict the correct source population of the individual. For improved efficiency, informative markers can be chosen from a larger set of markers to maximize the accuracy of this prediction. However, selecting the loci that are individually most informative does not necessarily produce the optimal panel. Here, using genotypes from eight species—carp, cat, chicken, dog, fly, grayling, human, and maize—this univariate accumulation procedure is compared to new multivariate "greedy" and "maximin" algorithms for choosing marker panels. The procedures generally suggest similar panels, although the greedy method often recommends inclusion of loci that are not chosen by the other algorithms. In seven of the eight species, when applied to five or more markers, all methods achieve at least 94% assignment accuracy on simulated individuals, with one species—dog— producing this level of accuracy with only three markers, and the eighth species—human— requiring ∼13–16 markers. The new algorithms produce substantial improvements over use of randomly selected markers; where differences among the methods are noticeable, the greedy algorithm leads to slightly higher probabilities of correct assignment. Although none of the approaches necessarily chooses the panel with optimal performance, the algorithms all likely select panels with performance near enough to the maximum that they all are suitable for practical use.	en_US
dc.format.extent	664491 bytes
dc.format.extent	2489 bytes
dc.format.mimetype	application/pdf
dc.format.mimetype	text/plain
dc.publisher	Mary Ann Liebert, Inc., publishers	en_US
dc.title	Algorithms for Selecting Informative Marker Panels for Population Assignment	en_US
dc.type	Article	en_US
dc.subject.hlbtoplevel	Health Sciences	en_US
dc.description.peerreviewed	Peer Reviewed	en_US
dc.identifier.pmid	16305328	en_US
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/63393/1/cmb.2005.12.1183.pdf
dc.identifier.doi	doi:10.1089/cmb.2005.12.1183	en_US
dc.identifier.source	Journal of Computational Biology	en_US
dc.identifier.source	Journal of Computational Biology	en_US
dc.owningcollname	Interdisciplinary and Peer-Reviewed

Files in this item

Name:: cmb.2005.12.1183.pdf
Size:: 648.9KB
Format:: PDF

View/Open

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.