Custom Word Embeddings for Sentiment Analysis

Murray, T.; Nair, V.; Annamalai, A.; Shah, P.; Mo, E.; Jin, S.; Gogia, R.

Custom Word Embeddings for Sentiment Analysis

dc.contributor.author	Murray, T.
dc.contributor.author	Nair, V.
dc.contributor.author	Annamalai, A.
dc.contributor.author	Shah, P.
dc.contributor.author	Mo, E.
dc.contributor.author	Jin, S.
dc.contributor.author	Gogia, R.
dc.date.accessioned	2021-04-29T19:12:17Z
dc.date.available	2021-04-29T19:12:17Z
dc.date.issued	2020
dc.identifier.uri	https://hdl.handle.net/2027.42/167248
dc.identifier.uri	https://youtu.be/WzIbeCQQ7n8
dc.description.abstract	Our work analyzed the relationship between the domain type of the word embeddings used to create sentiment analysis models and the domain of the sentiment analysis task. We used 3 corpora, (NYT, LION poems, and Book blurbs) to create our custom word embeddings and developed 9 different sentiment analysis models from our best word embeddings for each of the corpora. To compare our custom word embeddings against, we used 8 other pre-trained sentiment models including XLNet’s model, BERT (6 variations), and a simple skip-gram model used with Google vectors. Though we were hoping to prove that there was a clear advantage to using similar domains for the word embeddings as the dataset your sentiment model would be tested on, we concluded that more generally trained embeddings will outperform in-domain word embeddings.
dc.subject	machine learning
dc.subject	sentiment analysis
dc.subject	custom corpora
dc.title	Custom Word Embeddings for Sentiment Analysis
dc.type	Technical Report
dc.subject.hlbtoplevel	Engineering
dc.contributor.affiliationum	Computer Science Engineering
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/167248/1/Honors_Capstone_Murray_ProQuest_Sentiment_Analysis-Taylor_Murray.pdf
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/167248/2/Honors_Capstone_Presentation_Murray_ProQuest_Sentiment_Analysis-Taylor_Murray.pdf
dc.identifier.doi	https://dx.doi.org/10.7302/923
dc.working.doi	10.7302/923	en
dc.owningcollname	Honors Program, The College of Engineering

Files in this item

Name:: Honors_Capstone_Murray_ProQues ...
Size:: 1.869MB
Format:: PDF
Description:: Honors Capstone Final Report

View/Open

Name:: Honors_Capstone_Presentation_M ...
Size:: 1.203MB
Format:: PDF

View/Open

Honors Program, The College of Engineering

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.