Mining Health Data in Multimodal Data Series for Disease Prediction
Keywords:
Data Mining, Disease Prediction, HealthCare, Multimodal, K-means, SVM classificationAbstract
Disease Prediction plays a major role in health care community. With data mining process, disease will be predicted from large number of data. Dataset may be structured or unstructured. If the dataset is unstructured then the latent factor model is used to convert unstructured to structured data and it is very complex to predict a disease using unstructured data. Therefore we use synthetic data, which is structured. We concentrate on different kind of diseases. We propose a convolutional neural network based multimodal disease risk prediction (CNN-MDRP). Here datasets are stored as HER records. K-means clustering algorithm is used to group the datasets. Semi-Supervised Heterogeneous algorithm is applied to grouped data to predict the disease.
References
Min Chen, Yixue Hao, Kai Hwang, Fellow, IEEE, Lu Wang, and Lin Wang*, “Disease Prediction by Machine Learning over Big Data from Healthcare Communities”, vol.5, Pages: 8869 – 8879,2017.
P. B. Jensen, L. J. Jensen, and S. Brunak, “Mining electronic health records: towards better research applications and clinical care,” Nature Reviews Genetics, vol. 13, no. 6, pp. 395–405, 2012.
D. Oliver, F. Daly, F. C. Martin, and M. E. McMurdo, “Risk factors and risk assessment tools for falls in hospital in-patients: a systematic review,” Age and ageing, vol. 33, no. 2, pp. 122–130, 2004.
P. Groves, B. Kayyali, D. Knott, and S. V. Kuiken, “The’big data’revolution in healthcare: Accelerating value and innovation,” 2016.
M. Chen, S. Mao, and Y. Liu, “Big data: A survey,” Mobile Networks and Applications, vol. 19, no. 2, pp. 171–209, 2014.
B. Qian, X. Wang, N. Cao, H. Li, and Y.-G. Jiang, “A relative similarity based method for interactive patient risk prediction,” Data Mining and Knowledge Discovery, vol. 29, no. 4, pp. 1070–1093, 2015.
A. Singh, G. Nadkarni, O. Gottesman, S. B. Ellis, E. P. Bottinger, and J. V. Guttag, “Incorporating temporal ehr data in predictive models for risk stratification of renal function deterioration,” Journal of biomedical
informatics, vol. 53, pp. 220–228, 2015.
J. Wan, S. Tang, D. Li, S. Wang, C. Liu, H. Abbas and A. Vasilakos, “A Manufacturing Big Data Solution for Active Preventive Maintenance”, IEEE Transactions on Industrial Informatics, DOI: 10.1109/TII. 2017.2670505, 2017.
W. Yin and H. Sch¨utze, “Convolutional neural network for paraphrase identification.” in HLT-NAACL, 2015, pp. 901–911.
Nori, H. Kashima, K. Yamashita, H. Ikai, and Y. Imanaka, “Simultaneous modeling of multiple diseases for mortality prediction in acute hospital care,” in Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2015,pp. 855–864.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.