Show simple item record

Computational Methods for Characterizing Post-translational and Chemical Modifications Found in Open Searches

dc.contributor.authorGeiszler, Daniel
dc.date.accessioned2022-09-06T16:11:35Z
dc.date.available2022-09-06T16:11:35Z
dc.date.issued2022
dc.date.submitted2022
dc.identifier.urihttps://hdl.handle.net/2027.42/174422
dc.description.abstractPost-translational modifications (PTMs) govern many processes within cells and understanding their function is critical to both the basic and biomedical sciences. However, identifying modified peptides, particularly unexpected and rare modifications, remains a challenge to proteomics researchers. Recent advances in proteomics search tools have expanded the capacity to identify the entire modification landscape in an unbiased way, but the modifications identified in this manner—called “open searching”—require extensive post-processing to elucidate their identities. In this dissertation, I develop computational methods to characterize and identify modifications derived from open searches. In Chapter Two, I develop a method for comprehensively characterizing open search results, PTM-Shepherd, enabling new applications for unbiased PTM discovery. PTM-Shepherd automates characterization of PTM profiles detected in open searches based on attributes such as amino acid localization, fragmentation spectra similarity, retention time shifts, and relative modification rates. I show how open searches can be used to profile experimental artifacts by identifying a set of PTMs common across several formalin-fixed paraffin-embedded datasets that researchers can include in future analyses, identifying a range of Cys-specific artifacts in a commonly used high-quality dataset, finding two previously undescribed PTMs in synthetic peptide data and TMT data, and tracing major site-specific PTM batch effects in a multi-university consortium’s proteomics data back to sample processing. In Chapter Three, I extend the algorithm developed in Chapter Two, introducing additional metrics that allow researchers to peer into the spectra of PTMs and extract PTM-specific fragmentation patterns and diagnostic ions. I find new diagnostic for multiple common PTMs, including identifying new fragmentation patterns for glycopeptides under high energy fragmentation, new diagnostic ions for sialic acid under high energy fragmentation, and new diagnostic ions and peptide remainder masses for ADP-ribosylation, as well as examining general trends in the utility of PTM-specific diagnostic features such as the inverse relationship between an ion’s average intensity and its specificity to the modification. In Chapter Four, I expound my methods’ utility by applying it in multiple settings to characterize synthetic and chemical PTMs. In each case, I show how these methods aid in interpretation of results or increase coverage of the proteome by recovering additional modified peptides. For chemoproteomics probes, I demonstrate how expensive isotopic labeling to identify fragmentation patterns can be avoided, finding multiple novel diagnostic ions for a Cys-specific triazole biotin probe. For RNA crosslinked data, I show how the number of recovered identifications increases by up to 50% over existing state of the art methods when incorporating fragmentation information discovered by PTM-Shepherd. Finally, for protein-protein crosslinking, I show how PTM-Shepherd can derive fragmentation patterns for non-cleavable crosslinkers without computationally expensive or custom workflows, discovering that auto crosslinks can be used to identify fragment remainder masses for that can reduce computational complexity during searching. The ability to survey the entire post-translational modification landscape has major implications across proteomics subdisciplines. In total, the work described herein represents a major milestone in the interpretation of open search results and opens the door to better understandings of cellular processes and disease by facilitating new modes of analysis.
dc.language.isoen_US
dc.subjectbioinformatics
dc.subjectproteomics
dc.subjectmass spectrometry
dc.subjectpost-translational modifications
dc.subjectcomputational biology
dc.titleComputational Methods for Characterizing Post-translational and Chemical Modifications Found in Open Searches
dc.typeThesis
dc.description.thesisdegreenamePhDen_US
dc.description.thesisdegreedisciplineBioinformatics
dc.description.thesisdegreegrantorUniversity of Michigan, Horace H. Rackham School of Graduate Studies
dc.contributor.committeememberNesvizhskii, Alexey
dc.contributor.committeememberHakansson, Kristina I
dc.contributor.committeememberKarnovsky, Alla
dc.contributor.committeememberMills, Ryan Edward
dc.contributor.committeememberRao, Arvind
dc.subject.hlbsecondlevelComputer Science
dc.subject.hlbsecondlevelMolecular, Cellular and Developmental Biology
dc.subject.hlbsecondlevelChemistry
dc.subject.hlbsecondlevelScience (General)
dc.subject.hlbtoplevelEngineering
dc.subject.hlbtoplevelHealth Sciences
dc.subject.hlbtoplevelScience
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/174422/1/geiszler_1.pdf
dc.identifier.doihttps://dx.doi.org/10.7302/6153
dc.identifier.orcid0000-0002-7691-8534
dc.identifier.name-orcidGeiszler, Daniel; 0000-0002-7691-8534en_US
dc.working.doi10.7302/6153en
dc.owningcollnameDissertations and Theses (Ph.D. and Master's)


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.