DiagLLM: multimodal reasoning with large language model for explainable bearing fault diagnosis

Wang, Jie; Li, Tianrui; Yang, Yan; Chen, Shiqian; Zhai, Wanming

doi:10.1007/s11432-024-4333-7

DiagLLM: multimodal reasoning with large language model for explainable bearing fault diagnosis

Research Paper
Published: 22 May 2025

Volume 68, article number 160103, (2025)
Cite this article

Science China Information Sciences Aims and scope Submit manuscript

Jie Wang^1,2,3,
Tianrui Li^1,2,
Yan Yang^1,2,
Shiqian Chen³ &
…
Wanming Zhai³

2189 Accesses
42 Citations
Explore all metrics

Abstract

Accurate and reliable bearing fault diagnosis is critical for ensuring the safe operation of mechanical equipment. Previous data-driven methods encounter challenges in training advanced deep learning models, primarily due to the scarcity of fault data and inconsistencies in data distributions. Additionally, these methods often suffer from limited interpretability and reliability, as they lack constraint-guided learning based on the physical mechanisms underlying bearing failures, which hampers their utility in machine condition monitoring. Recent advancements in large language models (LLMs) demonstrate their potential to address these challenges. To this end, we aim to enhance the generalization and interpretability of bearing fault diagnosis by leveraging the capabilities of multimodal LLMs. Specifically, a novel framework called DiagLLM is designed to achieve this goal. DiagLLM leverages the powerful reasoning capabilities of large language models and incorporates contextual information from both envelope spectrum images and expert knowledge to accurately diagnose bearing faults. To effectively tune DiagLLM, diagnostic visual instruction-following data are constructed to link fault feature descriptions with signal characteristics, and the entire model is fine-tuned using a parameter-efficient training pipeline. Extensive experiments are conducted on two publicly available bearing fault diagnosis datasets, and the results show that DiagLLM outperforms leading baselines, particularly in scenarios with limited data and cross-data generalization.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+

from €37.37 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

Price includes VAT (Netherlands)

Instant access to the full article PDF.

Institutional subscriptions

Efficient Input Data Strategies for LLMs (Large Language Models)-Based Bearing Fault Diagnosis

Article 27 April 2026

Multi-label fault diagnosis of rolling bearing based on meta-learning

Article 17 September 2020

Intelligent fault diagnosis of rolling bearings based on LSTM with large margin nearest neighbor algorithm

Article 29 May 2022

References

Chen H, Liu Z, Alippi C, et al. Explainable intelligent fault diagnosis for nonlinear dynamic systems: from unsupervised to supervised learning. IEEE Trans Neural Netw Learn Syst, 2024, 35: 6166–6179
Article Google Scholar
Zhong T, Qin C J, Shi G, et al. A residual denoising and multiscale attention-based weighted domain adaptation network for tunnel boring machine main bearing fault diagnosis. Sci China Tech Sci, 2024, 67: 2594–2618
Article Google Scholar
Matania O, Dattner I, Bortman J, et al. A systematic literature review of deep learning for vibration-based fault diagnosis of critical rotating machinery: limitations and challenges. J Sound Vib, 2024, 590: 118562
Article Google Scholar
Zhang K, Wen Q, Zhang C, et al. Self-supervised learning for time series analysis: taxonomy, progress, and prospects. IEEE Trans Pattern Anal Mach Intell, 2024, 46: 6775–6794
Article Google Scholar
Zhao Z, Jiao Y. A fault diagnosis method for rotating machinery based on CNN with mixed information. IEEE Trans Ind Inf, 2022, 19: 9091–9101
Article Google Scholar
Mohammad-Alikhani A, Nahid-Mobarakeh B, Hsieh M F. One-dimensional LSTM-regulated deep residual network for data-driven fault detection in electric machines. IEEE Trans Ind Electron, 2024, 71: 3083–3092
Article Google Scholar
He Y L, Lu Z Y, Zhu Q X. Novel manifold autoencoder for industrial process fault diagnosis. IEEE Trans Ind Inf, 2024, 21: 858–865
Article Google Scholar
Yu J, Liu G. Knowledge extraction and insertion to deep belief network for gearbox fault diagnosis. Knowl-Based Syst, 2020, 197: 105883
Article Google Scholar
Fan J, Yuan X, Miao Z, et al. Full attention wasserstein GAN with gradient normalization for fault diagnosis under imbalanced data. IEEE Trans Instrum Meas, 2022, 71: 1–16
Google Scholar
Ji M, Zhao G. DEViT: deformable convolution-based vision transformer for bearing fault diagnosis. IEEE Trans Instrum Meas, 2024, 73: 1–13
Google Scholar
Hu W, Xin G, Wu J, et al. Vibration-based bearing fault diagnosis of high-speed trains: a literature review. High-speed Railway, 2023, 1: 219–223
Article Google Scholar
Lei Y, Liu H, Li N, et al. Condition monitoring and fault diagnosis of industrial robots: a review. Sci China Tech Sci, 2025, 68: 1110301
Article Google Scholar
Xie Z, Chen J, Feng Y, et al. End to end multi-task learning with attention for multi-objective fault diagnosis under small sample. J Manuf Syst, 2022, 62: 301–316
Article Google Scholar
Wei Y, Xiao Z, Liu S, et al. A novel data augmentation and composite multiscale network for mechanical fault diagnosis. IEEE Trans Instrum Meas, 2023, 72: 1–12
Google Scholar
Li W, Zhong X, Shao H, et al. Multi-mode data augmentation and fault diagnosis of rotating machinery using modified ACGAN designed with new framework. Adv Eng Inf, 2022, 52: 101552
Article Google Scholar
Zhu J, Chen N, Shen C. A new multiple source domain adaptation fault diagnosis method between different rotating machines. IEEE Trans Ind Inf, 2021, 17: 4788–4797
Article Google Scholar
Ding Y, Jia M, Zhuang J, et al. Deep imbalanced domain adaptation for transfer learning fault diagnosis of bearings under multiple working conditions. Reliab Eng Syst Saf, 2023, 230: 108890
Article Google Scholar
An Y, Zhang K, Chai Y, et al. Gaussian mixture variational-based transformer domain adaptation fault diagnosis method and its application in bearing fault diagnosis. IEEE Trans Ind Inf, 2024, 20: 615–625
Article Google Scholar
Qian Q, Luo J, Qin Y. Adaptive intermediate class-wise distribution alignment: a universal domain adaptation and generalization method for machine fault diagnosis. IEEE Trans Neural Netw Learn Syst, 2025, 36: 4296–4310
Article Google Scholar
Shi Y, Deng A, Deng M, et al. Domain transferability-based deep domain generalization method towards actual fault diagnosis scenarios. IEEE Trans Ind Inf, 2023, 19: 7355–7366
Article Google Scholar
Zhu Y, Zi Y, Li J, et al. PhysiCausalNet: a causal- and physics-driven domain generalization network for cross-machine fault diagnosis of unseen domain. IEEE Trans Ind Inf, 2024, 20: 8488–8498
Article Google Scholar
Tama B A, Vania M, Lee S, et al. Recent advances in the application of deep learning for fault diagnosis of rotating machinery using vibration signals. Artif Intell Rev, 2023, 56: 4667–4709
Article Google Scholar
Wang H, Liu Z, Peng D, et al. Interpretable convolutional neural network with multilayer wavelet for noise-robust machinery fault diagnosis. Mech Syst Signal Process, 2023, 195: 110314
Article Google Scholar
Wang S, Liu Z, Jia Z, et al. Intermittent fault diagnosis of analog circuit based on enhanced one-dimensional vision transformer and transfer learning strategy. Eng Appl Artif Intell, 2024, 127: 107281
Article Google Scholar
Li Y, Zhou Z, Sun C, et al. Variational attention-based interpretable transformer network for rotary machine fault diagnosis. IEEE Trans Neural Netw Learn Syst, 2022, 35: 6180–6193
Article Google Scholar
He C, Shi H, Li R, et al. Interpretable modulated differentiable STFT and physics-informed balanced spectrum metric for freight train wheelset bearing cross-machine transfer fault diagnosis under speed fluctuations. Adv Eng Inf, 2024, 62: 102568
Article Google Scholar
Zhou T, Han T, Droguett E L. Towards trustworthy machine fault diagnosis: a probabilistic Bayesian deep learning framework. Reliab Eng Syst Saf, 2022, 224: 108525
Article Google Scholar
Xiao Y, Shao H, Wang J, et al. Bayesian variational transformer: a generalizable model for rotating machinery fault diagnosis. Mech Syst Signal Process, 2024, 207: 110936
Article Google Scholar
Qaid H A A M, Zhang B, Li D, et al. FD-LLM: large language model for fault diagnosis of machines. 2024. ArXiv:2412.01218
Google Scholar
Tao L, Liu H, Ning G, et al. LLM-based framework for bearing fault diagnosis. Mech Syst Signal Process, 2025, 224: 112127
Article Google Scholar
Driess D, Xia F, Sajjadi M S M, et al. PaLM-E: an embodied multimodal language model. In: Proceedings of the International Conference on Machine Learning, 2023. 8469–8488
Google Scholar
Li J, Li D, Savarese S, et al. BLIP-2: bootstrapping language-image pre-training with frozen image encoders and large language models. In: Proceedings of the International Conference on Machine Learning, 2023. 19730–19742
Google Scholar
Wei J, Bosma M, Zhao V Y, et al. Finetuned language models are zero-shot learners. 2022. ArXiv:2109.01652
Google Scholar
Zhu D, Chen J, Shen X, et al. MiniGPT-4: enhancing vision-language understanding with advanced large language models. 2023. ArXiv:2304.10592
Google Scholar
Liu H, Li C, Wu Q, et al. Visual instruction tuning. In: Proceedings of the Advances in Neural Information Processing Systems, 2024. 34892–34916
Google Scholar
Dai W, Li J, Li D, et al. InstructBLIP: towards general-purpose vision-language models with instruction tuning. In: Proceedings of the Advances in Neural Information Processing Systems, 2023
Google Scholar
Ye Q, Xu H, Xu G, et al. mPLUG-Owl: modularization empowers large language models with multimodality. 2023. ArXiv:2304.14178
Google Scholar
Moon S, Madotto A, Lin Z, et al. AnyMAL: an efficient and scalable any-modality augmented language model. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2024. 1314–1332
Google Scholar
Hu E J, Shen Y, Wallis P, et al. LoRA: low-rank adaptation of large language models. In: Proceedings of International Conference on Learning Representations. 2022
Google Scholar
Wang B, Lei Y, Li N, et al. A hybrid prognostics approach for estimating remaining useful life of rolling element bearings. IEEE Trans Rel, 2018, 69: 401–412
Article Google Scholar
Lessmeier C, Kimotho J K, Zimmer D, et al. Condition monitoring of bearing damage in electromechanical drive systems by using motor current signals of electric motors: a benchmark data set for data-driven classification. In: Proceedings of the PHM Society European Conference, 2016
Google Scholar
Zhang W, Peng G, Li C, et al. A new deep learning model for fault diagnosis with good anti-noise and domain adaptation ability on raw vibration signals. Sensors, 2017, 17: 425
Article Google Scholar
Chen Z, Gryllias K, Li W. Intelligent fault diagnosis for rotary machinery using transferable convolutional neural network. IEEE Trans Ind Inf, 2019, 16: 339–349
Article Google Scholar
Liao J X, Dong H C, Sun Z Q, et al. Attention-embedded quadratic network (Qttention) for effective and interpretable bearing fault diagnosis. IEEE Trans Instrum Meas, 2023, 72: 1–13
Article Google Scholar
Wang H, Liu Z, Peng D, et al. Understanding and learning discriminant features based on multiattention 1DCNN for wheelset bearing fault diagnosis. IEEE Trans Ind Inf, 2020, 16: 5735–5745
Article Google Scholar

Download references

Acknowledgements

This work was supported by Fundamental Research Funds for the Central Universities (Grant No. 2682025CX105) and National Natural Science Foundation of China (Grant No. U2468207).

Author information

Authors and Affiliations

School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu, 611756, China
Jie Wang, Tianrui Li & Yan Yang
Engineering Research Center of Sustainable Urban Intelligent Transportation, Ministry of Education, Chengdu, 611756, China
Jie Wang, Tianrui Li & Yan Yang
State Key Laboratory of Rail Transit Vehicle System, Chengdu, 610031, China
Jie Wang, Shiqian Chen & Wanming Zhai

Authors

Jie Wang
View author publications
Search author on:PubMed Google Scholar
Tianrui Li
View author publications
Search author on:PubMed Google Scholar
Yan Yang
View author publications
Search author on:PubMed Google Scholar
Shiqian Chen
View author publications
Search author on:PubMed Google Scholar
Wanming Zhai
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Tianrui Li.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, J., Li, T., Yang, Y. et al. DiagLLM: multimodal reasoning with large language model for explainable bearing fault diagnosis. Sci. China Inf. Sci. 68, 160103 (2025). https://doi.org/10.1007/s11432-024-4333-7

Download citation

Received: 30 November 2024
Revised: 09 February 2025
Accepted: 05 March 2025
Published: 22 May 2025
Version of record: 22 May 2025
DOI: https://doi.org/10.1007/s11432-024-4333-7

Keywords

Profiles

Tianrui Li View author profile

Access this article

Log in via an institution

Subscribe and save

Springer+

from €37.37 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

Price includes VAT (Netherlands)

Instant access to the full article PDF.

Institutional subscriptions

DiagLLM: multimodal reasoning with large language model for explainable bearing fault diagnosis

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Efficient Input Data Strategies for LLMs (Large Language Models)-Based Bearing Fault Diagnosis

Multi-label fault diagnosis of rolling bearing based on meta-learning

Intelligent fault diagnosis of rolling bearings based on LSTM with large margin nearest neighbor algorithm

Explore related subjects

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Profiles

Subscribe and save

Buy Now