Optimization and Scalability of Collaborative Filtering Algorithms in Large Language Models

<p>Haowei Yang<sup>1</sup>, Longfei Yun<sup>2</sup>, Jinghan Cao<sup>3</sup>, Qingyi Lu<sup>4</sup>, Yuming Tu<sup>5</sup></p>

doi:10.25236/AJCIS.2024.071201

Academic Journal of Computing & Information Science, 2024, 7(12); doi: 10.25236/AJCIS.2024.071201.

Optimization and Scalability of Collaborative Filtering Algorithms in Large Language Models

Author(s)

Haowei Yang¹, Longfei Yun², Jinghan Cao³, Qingyi Lu⁴, Yuming Tu⁵

Corresponding Author:

Haowei Yang

Affiliation(s)

¹Industrial Engineering, University of Houston, Houston, USA

²Computer Science,University of California San Diego, San Diego, USA

³Computer Science, San Francisco State University, Seattle, USA

⁴Computer Science,Brown University, Providence, USA

⁵Independent Researcher, New Jersey, USA

Download PDF
|
Download: 43
|
View: 3021

Abstract

With the rapid development of large language models (LLMs) and the growing demand for personalized content, recommendation systems have become critical in enhancing user experience and driving engagement. Collaborative filtering algorithms, being core to many recommendation systems, have garnered significant attention for their efficiency and interpretability. However, traditional collaborative filtering approaches face numerous challenges when integrated into large-scale LLM-based systems, including high computational costs, severe data sparsity, cold start problems, and lack of scalability. This paper investigates the optimization and scalability of collaborative filtering algorithms in large language models, addressing these limitations through advanced optimization strategies. Firstly, we analyze the fundamental principles of collaborative filtering algorithms and their limitations when applied in LLM-based contexts. Next, several optimization techniques such as matrix factorization, approximate nearest neighbor search, and parallel computing are proposed to enhance computational efficiency and model accuracy. Additionally, strategies such as distributed architecture and model compression are explored to facilitate dynamic updates and scalability in data-intensive environments.

Keywords

Collaborative filtering algorithm; large language models; recommendation systems; algorithm optimization; scalability

Cite This Paper

Haowei Yang, Longfei Yun, Jinghan Cao, Qingyi Lu, Yuming Tu. Optimization and Scalability of Collaborative Filtering Algorithms in Large Language Models. Academic Journal of Computing & Information Science (2024), Vol. 7, Issue 12: 1-8. https://doi.org/10.25236/AJCIS.2024.071201.

References

[1] Huang B, Lu Q, Huang S, et al. Multi-modal clothing recommendation model based on large model and VAE enhancement[J]. arXiv preprint arXiv:2410.02219, 2024.

[2] Huang S, Yang H, Yao Y, et al. Deep adaptive interest network: personalized recommendation with context-aware learning[J]. arXiv preprint arXiv:2409.02425, 2024.

[3] Wu Z. An efficient recommendation model based on knowledge graph attention-assisted network (kgatax)[J]. arXiv preprint arXiv:2409.15315, 2024.

[4] Xiang A, Huang B, Guo X, et al. A neural matrix decomposition recommender system model based on the multimodal large language model[J]. arXiv preprint arXiv:2407.08942, 2024.

[5] Xiang, Ao, et al. "A Multimodal Fusion Network For Student Emotion Recognition Based on Transformer and Tensor Product." arXiv preprint arXiv:2403.08511 (2024).

[6] Liu, Jiabei, et al. "Application of deep learning-based natural language processing in multilingual sentiment analysis." Mediterranean Journal of Basic and Applied Sciences (MJBAS) 8.2 (2024): 243-260.

[7] Zhao Q, Hao Y, Li X. Stock price prediction based on hybrid CNN-LSTM model[J]. 2024.

[8] Gao D, Shenoy R, Yi S, et al. Synaptic resistor circuits based on Al oxide and Ti silicide for concurrent learning and signal processing in artificial intelligence systems[J]. Advanced Materials, 2023, 35(15): 2210484.

[9] Wu X, Sun Y, Liu X. Multi-class classification of breast cancer gene expression using PCA and XGBoost[J]. 2024.

[10] Xu Q, Wang T, Cai X. Energy market price forecasting and financial technology risk management based on generative AI[J]. 2024.

[11] Diao S, Wei C, Wang J, et al. Ventilator pressure prediction using recurrent neural network[J]. arXiv preprint arXiv:2410.06552, 2024.

[12] Zhao Y, Hu B, Wang S. Prediction of brent crude oil price based on lstm model under the background of low-carbon transition[J]. arXiv preprint arXiv:2409.12376, 2024.

[13] Zhang W, Huang J, Wang R, et al. Integration of Mamba and Transformer--MAT for Long-Short Range Time Series Forecasting with Application to Weather Dynamics[J]. arXiv preprint arXiv:2409.08530, 2024.

[14] Mo K, Chu L, Zhang X, et al. DRAL: Deep reinforcement adaptive learning for multi-UAVs navigation in unknown indoor environment[J]. arXiv preprint arXiv:2409.03930, 2024.

[15] Tang X, Wang Z, Cai X, et al. Research on heterogeneous computation resource allocation based on data-driven method[C]//2024 6th International Conference on Data-driven Optimization of Complex Systems (DOCS). IEEE, 2024: 916-919.

[16] Ma D, Yang Y, Tian Q, et al. Comparative analysis of x-ray image classification of pneumonia based on deep learning algorithm[J]. Theoretical and Natural Science, 2024, 56: 52-59.

[17] Wang L, Cheng Y, Xiang A, et al. Application of Natural Language Processing in Financial Risk Detection[J]. arXiv preprint arXiv:2406.09765, 2024.

[18] Zhang J, Zhang W, Tan C, et al. YOLO-PPA based efficient traffic sign detection for cruise control in autonomous driving[J]. arXiv preprint arXiv:2409.03320, 2024.

[19] Li X, Cao H, Zhang Z, et al. Artistic Neural Style Transfer Algorithms with Activation Smoothing[J]. arXiv preprint arXiv:2411.08014, 2024.

[20] Yang H, Sui M, Liu S, et al. Research on Key Technologies for Cross-Cloud Federated Training of Large Language Models[J]. arXiv preprint arXiv:2410.19130, 2024.

[21] Li X, Wang X, Qi Z, et al. DTSGAN: Learning Dynamic Textures via Spatiotemporal Generative Adversarial Network[J]. Academic Journal of Computing & Information Science, 7(10): 31-40.

[22] Qi Z, Ding L, Li X, et al. Detecting and Classifying Defective Products in Images Using YOLO[J]. Academic Journal of Computing & Information Science, 7(10): 47-54.

[23] Yang H, Cheng Z, Zhang Z, et al. Analysis of Financial Risk Behavior Prediction Using Deep Learning and Big Data Algorithms[J]. arXiv preprint arXiv:2410.19394, 2024.

[24] Wu Z. Mpgaan: Effective and efficient heterogeneous information network classification[J]. Journal of Computer Science and Technology Studies, 2024, 6(4): 08-16.

[25] Wang Z, Chen Y, Wang F, et al. Improved Unet model for brain tumor image segmentation based on ASPP-coordinate attention mechanism[J]. arXiv preprint arXiv:2409.08588, 2024.

[26] Wu Z. Deep learning with improved metaheuristic optimization for traffic flow prediction[J]. Journal of Computer Science and Technology Studies, 2024, 6(4): 47-53.

[27] Qu M. High precision measurement technology of geometric parameters based on binocular stereo vision application and development prospect of the system in metrology and detection[J]. Journal of Computer Technology and Applied Mathematics, 2024, 1(3): 23-29.

[28] Xu L, Liu J, Zhao H, et al. Autonomous navigation of unmanned vehicle through deep reinforcement learning[J]. arXiv preprint arXiv:2407.18962, 2024.

[29] Xiang J, Chen J. Imitation Learning-Based Convex Approximations of Probabilistic Reachable Sets[C]//AIAA AVIATION FORUM AND ASCEND 2024. 2024: 4356.