2018 IEEE International Conference on Healthcare Informatics (ICHI) (2018)
New York City, NY, USA
Jun 4, 2018 to Jun 7, 2018
In this paper, we address the problem of research data availability and access in the healthcare sector, by proposing Virtual Patient Model (VPM) a process that combines the use of an optimization algorithm, statistical analysis and machine learning techniques to generate synthetic time series data and report their effectiveness in predictive models. We validate the proposed model by implementing a genetic algorithm that captures important features of a real-world patient time-series data (original) and in applying constraints in the generative process, outputs a best fit synthetic candidate solution of the original time series data. Experimental results using statistical verification tests on both the synthetic and original datasets showed that the synthetic dataset preserved features from the original dataset. We used machine learning prediction models that integrated the synthetic dataset into classification learners and compare their outcomes with those from learners trained with the original dataset. We found promising results in machine learner's ability to discriminate between the different classes when synthetic data is used for training.
forecasting theory, genetic algorithms, health care, learning (artificial intelligence), medical administrative data processing, regression analysis, statistical analysis, time series
R. Shamsuddin, B. M. Maweu, M. Li and B. Prabhakaran, "Virtual Patient Model: An Approach for Generating Synthetic Healthcare Time Series Data," 2018 IEEE International Conference on Healthcare Informatics (ICHI), New York City, NY, USA, 2018, pp. 208-218.