Welcome to Francis Academic Press

Academic Journal of Engineering and Technology Science, 2024, 7(3); doi: 10.25236/AJETS.2024.070325.

An Incremental Clustering Algorithm for Arbitrary Shaped in IoT Data


Tianzhen Chen1,2

Corresponding Author:
Tianzhen Chen

1EIT Data Science and Communication College, Zhejiang Yuexiu University of Foreign Languages, Shaoxing, China

2School of Information Industry, Meizhouwan Vocational Technical College, Putian, China


With the rapid development of Internet of Things (IoT) technology, a vast number of IoT devices continuously generate large and ever-growing datasets, which typically exhibit high-dimensional and dynamically changing characteristics. To effectively manage these data, this study introduces a novel incremental clustering algorithm for arbitrary shaped in IoT data named Arbitrary Shaped Incremental Clustering (ASIC). The ASIC algorithm adapts to ongoing changes in data distributions by updating clustering results in real-time, thus effectively managing dynamic data without the need to reprocess the entire dataset. By comparing with several existing clustering algorithms, this study demonstrates the advantages of ASIC in handling large-scale and dynamic datasets. Experiments conducted on multiple datasets, including AWS IoT, show that the ASIC algorithm excels in clustering quality and operational efficiency. This algorithm is particularly suitable for applications requiring rapid response to environmental changes and real-time updates of data processing results.


Data Mining, IoT Data, Incremental Learning, Clustering Analysis, Data Analysis

Cite This Paper

Tianzhen Chen. An Incremental Clustering Algorithm for Arbitrary Shaped in IoT Data. Academic Journal of Engineering and Technology Science (2024) Vol. 7, Issue 3: 178-188. https://doi.org/10.25236/AJETS.2024.070325.


[1] Ullah, Inam, et al. "Data Science Meets Intelligent Internet of Things." Future Communication Systems Using Artificial Intelligence, Internet of Things and Data Science. 2024, CRC Press 73-91.

[2] Shen, Li, et al. "On efficient training of large-scale deep learning models: A literature review." arXiv preprint arXiv:2304.03589, 2023.

[3] Zhang, C., et al. "Deep Learning-Integrated Incremental Clustering for Streaming Data." IEEE Transactions on Neural Networks and Learning Systems, 2022.

[4] Naghib, Arezou, et al. "A comprehensive and systematic literature review on the big data management techniques in the internet of things." Wireless Networks, 29.3, 2023: 1085-1144.

[5] Oyewole, Gbeminiyi John, and George Alex Thopil. "Data clustering: application and trends." Artificial Intelligence Review, 56.7, 2023: 6439-6475.

[6] Yang, Xiaoqing. "Research on incremental clustering algorithm for big data." Applied Mathematics and Nonlinear Sciences, 8.2, 2023: 169-180.

[7] Chefrour, Aida. "A modified Incremental Density Based Clustering Algorithm." 2022 7th International Conference on Image and Signal Processing and their Applications (ISPA). IEEE, 2022.

[8] Inoubli, Wissem, et al. "A distributed and incremental algorithm for large-scale graph clustering." Future Generation Computer Systems, 134, 2022: 334-347.

[9] Zhang, Yuejin, Jiajia Zhang, and Dongmei Zhao. "Text clustering incremental algorithm in sensitive topic detection." Int J Inf Commun Sci, 3, 2018: 88.

[10] Yang, Xiaoqing. "Research on incremental clustering algorithm for big data." Applied Mathematics and Nonlinear Sciences, 8.2, 2023: 169-180.

[11] Eidheim, Ole Christian. "Online Arbitrary Shaped Clustering through Correlated Gaussian Functions." arXiv preprint arXiv:2302.06335, 2023.

[12] Aws-Samples. Aws-samples/aws-iotexamples. https://github.com/aws-samples/aws-iotexamples /tree/master/predictionDataSimulator, 2016.

[13] Dookeran, Keith A., et al. "Associations of two-pore domain potassium channels and triple negative breast cancer subtype in The Cancer Genome Atlas: systematic evaluation of gene expression and methylation." BMC research notes, 10, 2017: 1-9.

[14] Chen, Weijie, et al. "Self-supervised noisy label learning for source-free unsupervised domain adaptation." 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2022.

[15] Pinto, Joylin Priya, Soumya Kelur, and Jyothi Shetty. "Iris flower species identification using machine learning approach." 2018 4th International Conference for Convergence in Technology (I2CT). IEEE, 2018.

[16] Kumar, Manish, et al. "Quality assessment and monitoring of river water using IoT infrastructure." IEEE Internet of Things Journal, 2023.

[17] Huang, Bin, et al. "Review of data-driven prognostics and health management techniques: lessions learned from PHM data challenge competitions." Machine Failure Prevention Technology, 2017, 1-17.