Towards an Algorithmic Account of Phonological Rules and Representations
dc.contributor.author | Belth, Caleb | |
dc.date.accessioned | 2023-09-22T15:32:10Z | |
dc.date.available | 2023-09-22T15:32:10Z | |
dc.date.issued | 2023 | |
dc.date.submitted | 2023 | |
dc.identifier.uri | https://hdl.handle.net/2027.42/177942 | |
dc.description.abstract | The development of computer science in the middle of the twentieth century provided a valuable tool for the study of language as a cognitive system, by allowing linguistic theories to be stated in computational terms. The resulting theories have traditionally placed emphasis on describing the space of possible human languages, and viewed this delineated space as antecedent to a theory of how such a language might be learned from linguistic data. In the domain of phonology—the study of the structure of linguistic sound—this dissertation takes steps approaching the problem from the opposite direction, by framing the problem as that of identifying the learning procedure(s) by which humans construct a language in response to linguistic exposure. The object of study is shifted from the investigation of how a learner will discover a supposed target grammar, to the investigation of the ontogenetic process by which humans develop computational, phonological systems. The proposed algorithmic approach identifies independently-established psychological mechanisms available to a learner, and then uses these as the components of a hypothesized learning procedure. The dissertation includes an algorithmic account of how graph-based representations of words, which render long-distance dependencies as local in that graph structure and are known as phonological tiers, arise naturally from a learning algorithm sensitive to only adjacent dependencies. The dissertation also proposes an algorithmic account of when abstract representations of morphemes are needed for effective generalization to unseen words in the face of the sparsity of linguistic input, and how rules can be constructed to map between these abstract representations and their concrete realizations. Stated in explicit, computational terms, the proposed learning system is evaluated on realistic natural language data, and makes precise, testable predictions. The learner constructs accurate linguistic generalizations from naturalistic data: across languages evaluated, the learner achieves, on average, 0.96 accuracy on held-out test words, and never lower than 0.92. These results are achieved with training data of no more than a thousand words. Moreover, the models' predictions are consistently borne out in developmental predictions and experimental settings, including a novel experiment carried out to directly test this model. When compared to a prominent alternative learning-based model—neural networks—the proposed model achieves higher accuracy, while producing comparatively interpretable outputs, and—critically—providing an intelligible algorithm, which brings greater understanding to the mechanisms underlying phonological development. | |
dc.language.iso | en_US | |
dc.subject | phonology | |
dc.subject | computational modeling | |
dc.subject | language acquisition | |
dc.subject | low-resource NLP | |
dc.subject | representations | |
dc.subject | psycholinguistics | |
dc.title | Towards an Algorithmic Account of Phonological Rules and Representations | |
dc.type | Thesis | |
dc.description.thesisdegreename | PhD | en_US |
dc.description.thesisdegreediscipline | Computer Science & Engineering | |
dc.description.thesisdegreegrantor | University of Michigan, Horace H. Rackham School of Graduate Studies | |
dc.contributor.committeemember | Coetzee, Andries W | |
dc.contributor.committeemember | Koutra, Danai | |
dc.contributor.committeemember | Lewis, Richard L | |
dc.contributor.committeemember | Wang, Lu | |
dc.contributor.committeemember | Yang, Charles | |
dc.subject.hlbsecondlevel | Computer Science | |
dc.subject.hlbsecondlevel | Linguistics | |
dc.subject.hlbtoplevel | Engineering | |
dc.subject.hlbtoplevel | Humanities | |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/177942/1/cbelth_1.pdf | |
dc.identifier.doi | https://dx.doi.org/10.7302/8399 | |
dc.identifier.orcid | 0000-0002-6256-3381 | |
dc.identifier.name-orcid | Belth, Caleb; 0000-0002-6256-3381 | en_US |
dc.working.doi | 10.7302/8399 | en |
dc.owningcollname | Dissertations and Theses (Ph.D. and Master's) |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.