Abstract
Accurate and reliable bearing fault diagnosis is critical for ensuring the safe operation of mechanical equipment. Previous data-driven methods encounter challenges in training advanced deep learning models, primarily due to the scarcity of fault data and inconsistencies in data distributions. Additionally, these methods often suffer from limited interpretability and reliability, as they lack constraint-guided learning based on the physical mechanisms underlying bearing failures, which hampers their utility in machine condition monitoring. Recent advancements in large language models (LLMs) demonstrate their potential to address these challenges. To this end, we aim to enhance the generalization and interpretability of bearing fault diagnosis by leveraging the capabilities of multimodal LLMs. Specifically, a novel framework called DiagLLM is designed to achieve this goal. DiagLLM leverages the powerful reasoning capabilities of large language models and incorporates contextual information from both envelope spectrum images and expert knowledge to accurately diagnose bearing faults. To effectively tune DiagLLM, diagnostic visual instruction-following data are constructed to link fault feature descriptions with signal characteristics, and the entire model is fine-tuned using a parameter-efficient training pipeline. Extensive experiments are conducted on two publicly available bearing fault diagnosis datasets, and the results show that DiagLLM outperforms leading baselines, particularly in scenarios with limited data and cross-data generalization.
Similar content being viewed by others
References
Chen H, Liu Z, Alippi C, et al. Explainable intelligent fault diagnosis for nonlinear dynamic systems: from unsupervised to supervised learning. IEEE Trans Neural Netw Learn Syst, 2024, 35: 6166–6179
Zhong T, Qin C J, Shi G, et al. A residual denoising and multiscale attention-based weighted domain adaptation network for tunnel boring machine main bearing fault diagnosis. Sci China Tech Sci, 2024, 67: 2594–2618
Matania O, Dattner I, Bortman J, et al. A systematic literature review of deep learning for vibration-based fault diagnosis of critical rotating machinery: limitations and challenges. J Sound Vib, 2024, 590: 118562
Zhang K, Wen Q, Zhang C, et al. Self-supervised learning for time series analysis: taxonomy, progress, and prospects. IEEE Trans Pattern Anal Mach Intell, 2024, 46: 6775–6794
Zhao Z, Jiao Y. A fault diagnosis method for rotating machinery based on CNN with mixed information. IEEE Trans Ind Inf, 2022, 19: 9091–9101
Mohammad-Alikhani A, Nahid-Mobarakeh B, Hsieh M F. One-dimensional LSTM-regulated deep residual network for data-driven fault detection in electric machines. IEEE Trans Ind Electron, 2024, 71: 3083–3092
He Y L, Lu Z Y, Zhu Q X. Novel manifold autoencoder for industrial process fault diagnosis. IEEE Trans Ind Inf, 2024, 21: 858–865
Yu J, Liu G. Knowledge extraction and insertion to deep belief network for gearbox fault diagnosis. Knowl-Based Syst, 2020, 197: 105883
Fan J, Yuan X, Miao Z, et al. Full attention wasserstein GAN with gradient normalization for fault diagnosis under imbalanced data. IEEE Trans Instrum Meas, 2022, 71: 1–16
Ji M, Zhao G. DEViT: deformable convolution-based vision transformer for bearing fault diagnosis. IEEE Trans Instrum Meas, 2024, 73: 1–13
Hu W, Xin G, Wu J, et al. Vibration-based bearing fault diagnosis of high-speed trains: a literature review. High-speed Railway, 2023, 1: 219–223
Lei Y, Liu H, Li N, et al. Condition monitoring and fault diagnosis of industrial robots: a review. Sci China Tech Sci, 2025, 68: 1110301
Xie Z, Chen J, Feng Y, et al. End to end multi-task learning with attention for multi-objective fault diagnosis under small sample. J Manuf Syst, 2022, 62: 301–316
Wei Y, Xiao Z, Liu S, et al. A novel data augmentation and composite multiscale network for mechanical fault diagnosis. IEEE Trans Instrum Meas, 2023, 72: 1–12
Li W, Zhong X, Shao H, et al. Multi-mode data augmentation and fault diagnosis of rotating machinery using modified ACGAN designed with new framework. Adv Eng Inf, 2022, 52: 101552
Zhu J, Chen N, Shen C. A new multiple source domain adaptation fault diagnosis method between different rotating machines. IEEE Trans Ind Inf, 2021, 17: 4788–4797
Ding Y, Jia M, Zhuang J, et al. Deep imbalanced domain adaptation for transfer learning fault diagnosis of bearings under multiple working conditions. Reliab Eng Syst Saf, 2023, 230: 108890
An Y, Zhang K, Chai Y, et al. Gaussian mixture variational-based transformer domain adaptation fault diagnosis method and its application in bearing fault diagnosis. IEEE Trans Ind Inf, 2024, 20: 615–625
Qian Q, Luo J, Qin Y. Adaptive intermediate class-wise distribution alignment: a universal domain adaptation and generalization method for machine fault diagnosis. IEEE Trans Neural Netw Learn Syst, 2025, 36: 4296–4310
Shi Y, Deng A, Deng M, et al. Domain transferability-based deep domain generalization method towards actual fault diagnosis scenarios. IEEE Trans Ind Inf, 2023, 19: 7355–7366
Zhu Y, Zi Y, Li J, et al. PhysiCausalNet: a causal- and physics-driven domain generalization network for cross-machine fault diagnosis of unseen domain. IEEE Trans Ind Inf, 2024, 20: 8488–8498
Tama B A, Vania M, Lee S, et al. Recent advances in the application of deep learning for fault diagnosis of rotating machinery using vibration signals. Artif Intell Rev, 2023, 56: 4667–4709
Wang H, Liu Z, Peng D, et al. Interpretable convolutional neural network with multilayer wavelet for noise-robust machinery fault diagnosis. Mech Syst Signal Process, 2023, 195: 110314
Wang S, Liu Z, Jia Z, et al. Intermittent fault diagnosis of analog circuit based on enhanced one-dimensional vision transformer and transfer learning strategy. Eng Appl Artif Intell, 2024, 127: 107281
Li Y, Zhou Z, Sun C, et al. Variational attention-based interpretable transformer network for rotary machine fault diagnosis. IEEE Trans Neural Netw Learn Syst, 2022, 35: 6180–6193
He C, Shi H, Li R, et al. Interpretable modulated differentiable STFT and physics-informed balanced spectrum metric for freight train wheelset bearing cross-machine transfer fault diagnosis under speed fluctuations. Adv Eng Inf, 2024, 62: 102568
Zhou T, Han T, Droguett E L. Towards trustworthy machine fault diagnosis: a probabilistic Bayesian deep learning framework. Reliab Eng Syst Saf, 2022, 224: 108525
Xiao Y, Shao H, Wang J, et al. Bayesian variational transformer: a generalizable model for rotating machinery fault diagnosis. Mech Syst Signal Process, 2024, 207: 110936
Qaid H A A M, Zhang B, Li D, et al. FD-LLM: large language model for fault diagnosis of machines. 2024. ArXiv:2412.01218
Tao L, Liu H, Ning G, et al. LLM-based framework for bearing fault diagnosis. Mech Syst Signal Process, 2025, 224: 112127
Driess D, Xia F, Sajjadi M S M, et al. PaLM-E: an embodied multimodal language model. In: Proceedings of the International Conference on Machine Learning, 2023. 8469–8488
Li J, Li D, Savarese S, et al. BLIP-2: bootstrapping language-image pre-training with frozen image encoders and large language models. In: Proceedings of the International Conference on Machine Learning, 2023. 19730–19742
Wei J, Bosma M, Zhao V Y, et al. Finetuned language models are zero-shot learners. 2022. ArXiv:2109.01652
Zhu D, Chen J, Shen X, et al. MiniGPT-4: enhancing vision-language understanding with advanced large language models. 2023. ArXiv:2304.10592
Liu H, Li C, Wu Q, et al. Visual instruction tuning. In: Proceedings of the Advances in Neural Information Processing Systems, 2024. 34892–34916
Dai W, Li J, Li D, et al. InstructBLIP: towards general-purpose vision-language models with instruction tuning. In: Proceedings of the Advances in Neural Information Processing Systems, 2023
Ye Q, Xu H, Xu G, et al. mPLUG-Owl: modularization empowers large language models with multimodality. 2023. ArXiv:2304.14178
Moon S, Madotto A, Lin Z, et al. AnyMAL: an efficient and scalable any-modality augmented language model. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2024. 1314–1332
Hu E J, Shen Y, Wallis P, et al. LoRA: low-rank adaptation of large language models. In: Proceedings of International Conference on Learning Representations. 2022
Wang B, Lei Y, Li N, et al. A hybrid prognostics approach for estimating remaining useful life of rolling element bearings. IEEE Trans Rel, 2018, 69: 401–412
Lessmeier C, Kimotho J K, Zimmer D, et al. Condition monitoring of bearing damage in electromechanical drive systems by using motor current signals of electric motors: a benchmark data set for data-driven classification. In: Proceedings of the PHM Society European Conference, 2016
Zhang W, Peng G, Li C, et al. A new deep learning model for fault diagnosis with good anti-noise and domain adaptation ability on raw vibration signals. Sensors, 2017, 17: 425
Chen Z, Gryllias K, Li W. Intelligent fault diagnosis for rotary machinery using transferable convolutional neural network. IEEE Trans Ind Inf, 2019, 16: 339–349
Liao J X, Dong H C, Sun Z Q, et al. Attention-embedded quadratic network (Qttention) for effective and interpretable bearing fault diagnosis. IEEE Trans Instrum Meas, 2023, 72: 1–13
Wang H, Liu Z, Peng D, et al. Understanding and learning discriminant features based on multiattention 1DCNN for wheelset bearing fault diagnosis. IEEE Trans Ind Inf, 2020, 16: 5735–5745
Acknowledgements
This work was supported by Fundamental Research Funds for the Central Universities (Grant No. 2682025CX105) and National Natural Science Foundation of China (Grant No. U2468207).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Wang, J., Li, T., Yang, Y. et al. DiagLLM: multimodal reasoning with large language model for explainable bearing fault diagnosis. Sci. China Inf. Sci. 68, 160103 (2025). https://doi.org/10.1007/s11432-024-4333-7
Received:
Revised:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1007/s11432-024-4333-7


