Semester
Fall
Date of Graduation
2022
Document Type
Thesis
Degree Type
MS
College
Statler College of Engineering and Mineral Resources
Department
Lane Department of Computer Science and Electrical Engineering
Committee Chair
Donald Adjeroh
Committee Co-Chair
James Bardes
Committee Member
James Bardes
Committee Member
Gianfranco Doretto
Abstract
In the United States, cardiovascular disease is the leading cause of death for both men and women. The factors that contribute to heart disease are well known but it is still an ever-evolving field of research. The possible benefits to being able to determine an individual’s Health Risk could provide the early insights needed to combat cardiovascular disease. However, two major problems within this area are the public availability of a dataset and the label imbalances for the dataset. In this thesis, we present a framework for exploring health risk using a combination of the NHANES dataset and open-source machine learning tools. We apply this framework to both cardiovascular disease risk detection and hypertension risk detection. To address the challenge of data imbalance, we also explore different sampling strategies built on a novel modification to KMeans Under-Sampling. Results are included to demonstrate the performance of the proposed methods.
Recommended Citation
Cole, Levi Butcher, "Machine Learning Approach to Health Risk Determination" (2022). Graduate Theses, Dissertations, and Problem Reports. 11456.
https://researchrepository.wvu.edu/etd/11456
Embargo Reason
Publication Pending