Preventive healthcare policies in the US: solutions for disease management using Big Data Analytics

Batarseh, Feras A.; Ghassib, Iya; Chong, Deri (Sondor); Su, Po-Hsuan

Preventive healthcare policies in the US: solutions for disease management using Big Data Analytics

dc.contributor.author	Batarseh, Feras A.
dc.contributor.author	Ghassib, Iya
dc.contributor.author	Chong, Deri (Sondor)
dc.contributor.author	Su, Po-Hsuan
dc.date.accessioned	2022-08-10T18:49:42Z
dc.date.available	2022-08-10T18:49:42Z
dc.date.issued	2020-06-23
dc.identifier.citation	Journal of Big Data. 2020 Jun 23;7(1):38
dc.identifier.uri	https://doi.org/10.1186/s40537-020-00315-8
dc.identifier.uri	https://hdl.handle.net/2027.42/174005	en
dc.description.abstract	Abstract Data-driven healthcare policy discussions are gaining traction after the Covid-19 outbreak and ahead of the 2020 US presidential elections. The US has a hybrid healthcare structure; it is a system that does not provide universal coverage, albeit few years ago enacted a mandate (Affordable Care Act-ACA) that provides coverage for the majority of Americans. The US has the highest health expenditure per capita of all western and developed countries; however, most Americans don’t tap into the benefits of preventive healthcare. It is estimated that only 8% of Americans undergo routine preventive screenings. On a national level, very few states (15 out of the 50) have above-average preventive healthcare metrics. In literature, many studies focus on the cure of diseases (research areas such as drug discovery and disease prediction); whilst a minority have examined data-driven preventive measures—a matter that Americans and policy makers ought to place at the forefront of national issues. In this work, we present solutions for preventive practices and policies through Machine Learning (ML) methods. ML is morally neutral, it depends on the data that train the models; in this work, we make the case that Big Data is an imperative paradigm for healthcare. We examine disparities in clinical data for US patients by developing correlation and imputation methods for data completeness. Non-conventional patterns are identified. The data lifecycle followed is methodical and deliberate; 1000+ clinical, demographical, and laboratory variables are collected from the Centers for Disease Control and Prevention (CDC). Multiple statistical models are deployed (Pearson correlations, Cramer’s V, MICE, and ANOVA). Other unsupervised ML models are also examined (K-modes and K-prototypes for clustering). Through the results presented in the paper, pointers to preventive chronic disease tests are presented, and the models are tested and evaluated.
dc.title	Preventive healthcare policies in the US: solutions for disease management using Big Data Analytics
dc.type	Journal Article
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/174005/1/40537_2020_Article_315.pdf
dc.identifier.doi	https://dx.doi.org/10.7302/5736
dc.language.rfc3066	en
dc.rights.holder	The Author(s)
dc.date.updated	2022-08-10T18:49:41Z
dc.owningcollname	Interdisciplinary and Peer-Reviewed

Files in this item

Name:: 40537_2020_Article_315.pdf
Size:: 2.956MB
Format:: PDF

View/Open

Interdisciplinary and Peer-Reviewed

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.