Show simple item record

Detecting Machine-obfuscated Plagiarism

dc.contributor.authorFoltynek, Tomas
dc.contributor.authorRuas, Terry
dc.contributor.authorScharpf, Philipp
dc.contributor.authorMeuschke, Norman
dc.contributor.authorSchubotz, Moritz
dc.contributor.authorGrosky, William
dc.contributor.authorGipp, Bela
dc.date.accessioned2019-12-13T13:52:54Z
dc.date.available2019-12-13T13:52:54Z
dc.date.issued2019-12-13
dc.identifier.urihttps://hdl.handle.net/2027.42/152346
dc.descriptionRelated dataset is at https://doi.org/10.7302/bewj-qx93 and also listed in the dc.relation field of the full item record.
dc.description.abstractResearch on academic integrity has identified online paraphrasing tools as a severe threat to the effectiveness of plagiarism detection systems. To enable the automated identification of machine-paraphrased text, we make three contributions. First, we evaluate the effectiveness of six prominent word embedding models in combination with five classifiers for distinguishing human-written from machine-paraphrased text. The best performing classification approach achieves an accuracy of 99.0% for documents and 83.4% for paragraphs. Second, we show that the best approach outperforms human experts and established plagiarism detection systems for these classification tasks. Third, we provide a Web application that uses the best performing classification approach to indicate whether a text underwent machine-paraphrasing. The data and code of our study are openly available.en_US
dc.language.isoen_USen_US
dc.relationhttps://doi.org/10.7302/bewj-qx93
dc.titleDetecting Machine-obfuscated Plagiarismen_US
dc.typeConference Paperen_US
dc.subject.hlbsecondlevelComputer Science
dc.subject.hlbtoplevelEngineering
dc.description.peerreviewedPeer Revieweden_US
dc.contributor.affiliationumUniversity of Michigan Dearbornen_US
dc.contributor.affiliationumUniversity of Michigan Dearbornen_US
dc.contributor.affiliationotherUniversity of Wuppertal, Mendel University in Brnoen_US
dc.contributor.affiliationotherUniversity of Wuppertalen_US
dc.contributor.affiliationotherUniversity of Konstanzen_US
dc.contributor.affiliationotherUniversity of Wuppertal, University of Konstanzen_US
dc.contributor.affiliationotherUniversity of Wuppertalen_US
dc.contributor.affiliationotherUniversity of Wuppertalen_US
dc.contributor.affiliationumcampusDearbornen_US
dc.description.bitstreamurlhttps://deepblue.lib.umich.edu/bitstream/2027.42/152346/1/Foltynek2020_Paraphrase_Detection.pdf
dc.identifier.orcid0000-0001-8412-5553en_US
dc.identifier.orcid0000-0002-9440-780Xen_US
dc.identifier.orcid0000-0002-4212-0508en_US
dc.identifier.orcid0000-0003-4648-8198en_US
dc.identifier.orcid0000-0001-7141-4997en_US
dc.identifier.orcid0000-0002-2775-2806en_US
dc.identifier.orcid0000-0001-6522-3019en_US
dc.description.filedescriptionDescription of Foltynek2020_Paraphrase_Detection.pdf : Foltynek2020_Paraphrase_Detection
dc.identifier.name-orcidSchubotz, Moritz; 0000-0001-7141-4997en_US
dc.identifier.name-orcidFoltýnek, Tomáš; 0000-0001-8412-5553en_US
dc.identifier.name-orcidMeuschke, Norman; 0000-0003-4648-8198en_US
dc.identifier.name-orcidGrosky, William; 0000-0002-2775-2806en_US
dc.identifier.name-orcidRuas, Terry; 0000-0002-9440-780Xen_US
dc.identifier.name-orcidScharpf, Philipp; 0000-0002-4212-0508en_US
dc.identifier.name-orcidGipp, Béla; 0000-0001-6522-3019en_US
dc.owningcollnameComputer and Information Science, Department of (UM-Dearborn)


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.