Algorithms for Selecting Informative Marker Panels for Population Assignment
dc.contributor.author | Rosenberg, Noah A. | en_US |
dc.date.accessioned | 2009-07-10T19:13:22Z | |
dc.date.available | 2009-07-10T19:13:22Z | |
dc.date.issued | 2005-11-01 | en_US |
dc.identifier.citation | Rosenberg, Noah A. (2005). "Algorithms for Selecting Informative Marker Panels for Population Assignment." Journal of Computational Biology 12(9): 1183-1201 <http://hdl.handle.net/2027.42/63393> | en_US |
dc.identifier.uri | https://hdl.handle.net/2027.42/63393 | |
dc.identifier.uri | http://www.ncbi.nlm.nih.gov/sites/entrez?cmd=retrieve&db=pubmed&list_uids=16305328&dopt=citation | en_US |
dc.description.abstract | Given a set of potential source populations, genotypes of an individual of unknown origin at a collection of markers can be used to predict the correct source population of the individual. For improved efficiency, informative markers can be chosen from a larger set of markers to maximize the accuracy of this prediction. However, selecting the loci that are individually most informative does not necessarily produce the optimal panel. Here, using genotypes from eight species—carp, cat, chicken, dog, fly, grayling, human, and maize—this univariate accumulation procedure is compared to new multivariate "greedy" and "maximin" algorithms for choosing marker panels. The procedures generally suggest similar panels, although the greedy method often recommends inclusion of loci that are not chosen by the other algorithms. In seven of the eight species, when applied to five or more markers, all methods achieve at least 94% assignment accuracy on simulated individuals, with one species—dog— producing this level of accuracy with only three markers, and the eighth species—human— requiring ∼13–16 markers. The new algorithms produce substantial improvements over use of randomly selected markers; where differences among the methods are noticeable, the greedy algorithm leads to slightly higher probabilities of correct assignment. Although none of the approaches necessarily chooses the panel with optimal performance, the algorithms all likely select panels with performance near enough to the maximum that they all are suitable for practical use. | en_US |
dc.format.extent | 664491 bytes | |
dc.format.extent | 2489 bytes | |
dc.format.mimetype | application/pdf | |
dc.format.mimetype | text/plain | |
dc.publisher | Mary Ann Liebert, Inc., publishers | en_US |
dc.title | Algorithms for Selecting Informative Marker Panels for Population Assignment | en_US |
dc.type | Article | en_US |
dc.subject.hlbtoplevel | Health Sciences | en_US |
dc.description.peerreviewed | Peer Reviewed | en_US |
dc.identifier.pmid | 16305328 | en_US |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/63393/1/cmb.2005.12.1183.pdf | |
dc.identifier.doi | doi:10.1089/cmb.2005.12.1183 | en_US |
dc.identifier.source | Journal of Computational Biology | en_US |
dc.identifier.source | Journal of Computational Biology | en_US |
dc.owningcollname | Interdisciplinary and Peer-Reviewed |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.