Show simple item record

Neural Language Generation for Content Adaptation: Explainable, Efficient Low-Resource Text Simplification and Evaluation

dc.contributor.authorGarbacea, Georgeta-Cristina
dc.date.accessioned2023-09-22T15:38:35Z
dc.date.available2023-09-22T15:38:35Z
dc.date.issued2023
dc.date.submitted2023
dc.identifier.urihttps://hdl.handle.net/2027.42/178028
dc.description.abstractThere are rich opportunities to reduce the language complexity of professional content (either human-written or computer-generated) and make it accessible to a broad audience. As a sub-task of natural language generation (NLG), text simplification has considerable potential to improve the fairness and transparency of text information systems. Recent approaches to text simplification usually complete the task in an end-to-end fashion, employing neural machine translation models in a monolingual setting regardless of the type of simplifications to be done. These models are limited on the one hand due to the absence of large-scale parallel (complex → simple) monolingual training data, and on the other hand due to the lack of interpretability of their black-box procedures. Furthermore, despite fast development of algorithms, there is an urgency to fill the huge gap in evaluating NLG systems in general (including text simplification systems). Indeed, given no clear model of text quality and no agreed objective criterion for comparing the “goodness of texts”, the evaluation of NLG systems is inherently difficult. The present work addresses these problems: i) sample-efficient approaches to NLG that improve the fairness and transparency of text information systems by adapting their content to the literacy level of the target audience, ii) systematic analysis of evaluation metrics for NLG models informed by theory and empirical evidence. In particular, we show that text simplification can be decomposed into a compact pipeline of tasks to ensure the transparency and explainability of the process; low-resource text simplification can be framed from a task and domain adaptation perspective which can be decomposed into multiple adaptation steps via meta-learning and transfer learning; and evaluators for NLG can be evaluated at scale and compared with human judgements. Beyond the problem of low-resource text simplification, the methodology proposed in this dissertation (explainable decomposition, chain of adaptations to new tasks and domains, and meta-evaluation) may benefit other research areas related to generative artificial intelligence (AI).
dc.language.isoen_US
dc.subjectNeural Language Generation
dc.subjectLow-Resource Text Simplification
dc.subjectContent Adaptation
dc.subjectExplainable Prediction of Text Complexity
dc.subjectNatural Language Evaluation
dc.subjectArtificial Intelligence
dc.titleNeural Language Generation for Content Adaptation: Explainable, Efficient Low-Resource Text Simplification and Evaluation
dc.typeThesis
dc.description.thesisdegreenamePhDen_US
dc.description.thesisdegreedisciplineComputer Science & Engineering
dc.description.thesisdegreegrantorUniversity of Michigan, Horace H. Rackham School of Graduate Studies
dc.contributor.committeememberMei, Qiaozhu
dc.contributor.committeememberCollins-Thompson, Kevyn
dc.contributor.committeememberChai, Joyce
dc.contributor.committeememberMower Provost, Emily
dc.contributor.committeememberWang, Lu
dc.subject.hlbsecondlevelComputer Science
dc.subject.hlbtoplevelEngineering
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/178028/1/garbacea_1.pdf
dc.identifier.doihttps://dx.doi.org/10.7302/8485
dc.identifier.orcid0000-0001-5340-594X
dc.identifier.name-orcidGarbacea, Georgeta-Cristina; 0000-0001-5340-594Xen_US
dc.working.doi10.7302/8485en
dc.owningcollnameDissertations and Theses (Ph.D. and Master's)


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.