Welcome to Francis Academic Press

Academic Journal of Computing & Information Science, 2021, 4(6); doi: 10.25236/AJCIS.2021.040613.

Handwriting digit classification using PCA-transformed image features

Author(s)

Yingqiang Yuan

Corresponding Author:
Yingqiang Yuan
Affiliation(s)

Washington State University 910 NE Providence CT Emerald Down C103 Pullman WA 99163

Abstract

Machine learning model is capable of achieving an incredible great accuracy by training a large amount of data with many dimensions of features in it. To accomplish such a goal, we are currently in desperate need of powerful computers with CPUs (Computation Processing Unit) and GPUs (Graphic Processing Unit) that are competent of the workload of the training process of those machine learning models. Due to the problem that there are multiple disparate computationally intensive tasks that are lengthy period of time during training phase of machine learning, devices like smartphone and personal computer cannot achieve great efficiency during the training phase of machine learning. Instead of enhancing computational performance of devices and improve the machine learning algorithm, another more feasible way to bypass such problem is to apply dimensionality reduction methods on raw data in hope of reducing the number of features that are passing into the machine learning model during the training phase. By doing so, we can shrink the originally large number of features to several dimensions that is accomplishable for our device to compute and meanwhile, sufficient enough to have a model with acceptable accuracy trained. The purpose of this study to explore and investigate the potential benefits and disadvantages of dimensionality reduction on multiple distinct machine learning algorithm by comparing the performances of each machine learning algorithms when passing the features of raw datasets in and when passing the dataset with the features that survived after dimensionality reduction. The performance will be measured by completing a typical machine learning task: handwriting digit classification. Among all the dimensionalities methods that have been discovered, a traditional and conventional dimensionality reduction method, PCA (Principal Components Analysis), is selected. Such method has been proven successful and efficient in generating the set of data with lower dimension features and apply on an image classification task using several supervised machine learning methods, including KNN (k-nearest neighbors), SVM (Support vector machine) and CNN (Convolutional Neural Network).

Keywords

Dimensionality Reduction, PCA (Principle Components Analysis), KNN (k-nearest neighbors), SVM (support vector machine), CNN(Convolutional Neural Network)

Cite This Paper

Yingqiang Yuan. Handwriting digit classification using PCA-transformed image features. Academic Journal of Computing & Information Science (2021), Vol. 4, Issue 6: 78-81. https://doi.org/10.25236/AJCIS.2021.040613.

References

[1] Yanbin Zheng,Hongxu Yun,Fu Wang,Yong Ding,Yongzhong Huang,Wenfen Liu. Defence Against Adversarial Attacks Using Clustering Algorithm[A]. ICPCSEE Steering Committee.Abstracts of the 5th International Conference of Pioneering Computer Scientists,Engineers and Educators(ICPCSEE 2019)Part I[C].ICPCSEE Steering Committee

[2] Aissam Jadli, Hain Mustapha, Chergui Adil. Handwritten Documents Validation Using Pattern Recognition and Transfer Learning[J]. International Journal of Web - Based Learning and Teaching Technologies, 2022, 17(5):1-13.

[3] Wang Yu-Chun, Chuang Chia-Min, Wu Chun-Kai, et al. Cross-language article linking with deep neural network based paragraph encoding[J]. Computer Speech & Language, 2022, 72