Random Forest Model for Predicting Grayscale Digits on Images
Keywords:
Random forest, grayscale images, RF_classification, RF_regression classification, MNIST datasetAbstract
The random forest is one of the fastest, powerful and easy to use techniques with the ability of performing regression and classification task recorded in the biblical study of machine learning. It grows into forest with some number of decision trees as a hybrid system. The problem of missing data which results from over-fitting can best be handled by random forest. The process places a very big barrier on how well the existing methods and functions. This also affects the success metrics in the process of training, testing and validating classifiers in order to produce a more generalized and accurate results for predicting grayscale digits on images. This model is usually employed in replacing the continuous variables with the median values while computing the correct weighted proximity average of all the missing data values. Our aim is to develop an efficienct random forest model using the classification and regressor classifiers with some high-quality of input data by introducing weighted average and the best in majority vote. This model has been trained and tested using the MNIST dataset with some adjusted hyper-parameter values which defined and improved the general performance of the combined trees in the forest. The simulation was done using python programming language and resulted to 99% and 90% metrics of accuracy in comparison with the random forest classification and regression approach respectively.
References
H. T. Malazi, and M. David, (2018). "Combining Emerging Patterns with Random Forest for Complex Activity Recognition in Smart Homes," Applied Intelligence, Vol. 48, Issue. 2, pp.315-330, 2018.
Y. K. Chen, and J. F. Wang, "Segmentation of Single- or Multiple-Touching handwritting Numerial String Using Background and Foreground Analysis," IEEE Transaction on Pattern Analysis and Machine Intelligence, Vol. 22, Issue. 11, pp.1304-1317, 2000
V. Y. Kulkami, and P. K. Sinha, "random Forest Classifiers: A Survey and Future Research Directions," International Journal of Advanced Computing Vol. 36, Issue. 1, pp.1144-1153,2013.
L. Breiman, "Random Forest," Machine Learning, Vol. 45, Issue. 1, pp.279-290, 2001.
R. Anitha, and S. R. D. Siva, "Development of Computer-aided Approach for Brain Tumor Detection using Random Forest Classifier," International Journal of Imaging Systems and technology, Vol. 28, Issue. 1, pp.48-53,2018.
V. Lepetit, P. Fua, (2006). "Keypoint Recognition Using Randomized trees," IEEE Trans. Pattern Anaysis Machine Intelligence Vol. 28, Issue. 9, 1465-1479, 2006.
S. Bernard, S. Adam, and L. Heutte, " Using Random Forests for Handwritten Digit Recognition," Proceedings of the 9th IAPR/IEEE International Conference on Document Analysis and Recognition(ICDAR), pp.1043-1047, 2007.
N. Arica, and F. T. Yarman-Vural, "Optical Character Recognition for Cursive Handwritting," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 24, Issue. 6, pp.801-113, 2002.
L. Kuncheva, "Combining Pattern Recognition: Methods and Algorithms," John Wiley and Sons, pp.60-75, 2004.
Y. K. Vrushali, and K. S. Pradeep, "Effective Learning and Classification Using Random Forest Algorithm," International Journal of Engineering and Innovative Technology(IJEIT) Vol. 3, Issue. 11, pp.267-273, 2014.
A. Jehad, K. Rehanulla, A. Nasir, and M. Imran, "Random Forest and Decision Trees," International Journal of Computer Science(IJCS), Vol. 9, Issue. 5, pp.273-278, 2012
H. Mishra, S. K. Pathak, and A. Srivastava, "Handwritten Digits Recognition Using Machine Learning Algorithms", International Research Journal of Engineering and Technology(IRJET), Vol. 07, Issue. 05, pp.8073-8079,2020.
V. Garg, "Handwritten Digit Classification Using Machine Learning Models," International Research Journal of Engineering and Technology(IRJET), Vol. 06, Issue. 11, pp.3593-3597, 2019.
K. N. H. Al-Behadili, "Classification Algorithms for Determining Handwritten Digit," Iraqi Journal for Electrical and Electronic Engineering(IJEEE), Vol. 12, Issue. 1, pp.96-102, 2016.
V. P. Dhaka, and M. K. Sharma, "An Efficient Segmentation Technique for Devanagari Offline-Handwritten Scripts using the Feedforward Neural Network," Neural Computing and Applications, Vol. 26, Issue. 8, pp.1881-1893, 2015.
A. Singh, and R. Sathyaraj, "A Comparison Between Classification Algorithms on Different Datasets Methodologies using Rapidminer," International Journal of Advanced Research in Computer and Communication Engineering(IJARCCE), Vol. 5, Issue 5, pp.560-563, 2016.
R. R. Rani, P. Manikadan, and R. D. Chitra, "An Empirical Analysis of Classification Trees Algorithm for Protein Datasets," International Journal of Advanced Research in Computer Science(IJARCS), Vol. 6, Issue. 6, pp.34-39, 2015.
Y. Chherawala, P. P. Roy, and M. Cheriet, "Feature set Evaluation for Offline Handwriting Recognition systems: Application to the Recurrent Neural Network Model," IEEE transactions on cybernetics, Vol. 46, Issue. 12, pp.2825-2836, 2016.
S. Joseph, and J. George, "Feature Extraction and Classification Techniques of MODI Script Character Recognition," Pertanika Journal of Science and Technology(PJST), Vol. 27, Issue. 4, pp.1649 - 1669, 2019
R. Anushri, N. Ashitha, and Mamath, "Digit Recohnition," International Advanced Research Journal in Science, Engineering and Technology(IARJSET), Vol. 4, Issue. 8, pp.44-50, 2017.
G. Pirlo, and D. Impedovo, "Fuzzy Zoning Based Classification for Handwritten Characters," IEEE Transaction on Pattern Recognition and Machine Intelligence, Vol. 19, Issue. 04, pp.780-785, 2011.
S. Vinneet, and P. L. Sunil, “Digits recognition using single layer neural Network with principal component analysis,” Computer Science and Engineering (APWC on CSE), Asia-Pacific World Congress IEEE, pp.4-5, 2014.
J. Puzicha, M. Held, J. Ketterer, J. M. Buhmann, and D. W. Fellner, "On Spatial Quantization of Color Images," IEEE Transactions on Image Processing 9, pp.666–682, 2000.
T. Horiuchi, "Grayscale Image Segmentation using Color Space," Institute of Electronics, Information and Communication Engineers(IEICE) Trans. Infomation System, Vol. E89-D, Issue. 3, pp.1231-1235, 2006.
S. Ziweritin, C. C. Ukegbu, and E. U. Ezeorah, Building Data Mining Classification Model for Pixilated Digit Recognition System," SSRG International Journal of Computer Science and Engineering(IJCSE), Vol. 7, Issue. 10, pp.6-12, 2020.
A. Liaw, and M. Wiener, "Classification and Regression by Random Forest," R News, Vol. 2, Issue. 3, pp.18–22, 2002
L. Kuncheva, and C. Whitaker, "Measures of Diversity in Classifier Ensembles and their Relationship with the Ensemble Accuracy," Machine learning, Vol. 51. Issue. 2, pp.181–207, 2003.
S. Talla, P. Venigalla, A. Shaik, and M. Vuyyuru, "Multiclass Classification Using Random Forest Classifier," International Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), Vol. 5, Issue. 02, pp.493- 496, 2019.
J. J. Rodriguez, L. I. Kuncheva, and C. J. Alonso, "Rotation forest: A New Classifier Ensemble Method," IEEE Transactions on Pattern Analysis and Machine Intelligence(TPAMI), Vol. 28, Issue. 10, pp.1619 - 1630, 2006.
E. Kremic, and A. Subasi, "Performance of Random Forest and SVM in Face Recognition," International Arab Journal of Information Technology(IAJI), Vol. 13, Issue. 2, pp.288-292, 2016.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.