Skip to main content
Log in

DiagLLM: multimodal reasoning with large language model for explainable bearing fault diagnosis

  • Research Paper
  • Published:
Science China Information Sciences Aims and scope Submit manuscript

Abstract

Accurate and reliable bearing fault diagnosis is critical for ensuring the safe operation of mechanical equipment. Previous data-driven methods encounter challenges in training advanced deep learning models, primarily due to the scarcity of fault data and inconsistencies in data distributions. Additionally, these methods often suffer from limited interpretability and reliability, as they lack constraint-guided learning based on the physical mechanisms underlying bearing failures, which hampers their utility in machine condition monitoring. Recent advancements in large language models (LLMs) demonstrate their potential to address these challenges. To this end, we aim to enhance the generalization and interpretability of bearing fault diagnosis by leveraging the capabilities of multimodal LLMs. Specifically, a novel framework called DiagLLM is designed to achieve this goal. DiagLLM leverages the powerful reasoning capabilities of large language models and incorporates contextual information from both envelope spectrum images and expert knowledge to accurately diagnose bearing faults. To effectively tune DiagLLM, diagnostic visual instruction-following data are constructed to link fault feature descriptions with signal characteristics, and the entire model is fine-tuned using a parameter-efficient training pipeline. Extensive experiments are conducted on two publicly available bearing fault diagnosis datasets, and the results show that DiagLLM outperforms leading baselines, particularly in scenarios with limited data and cross-data generalization.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+
from €37.37 /Month
  • Starting from 10 chapters or articles per month
  • Access and download chapters and articles from more than 300k books and 2,500 journals
  • Cancel anytime
View plans

Buy Now

Price includes VAT (Netherlands)

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Chen H, Liu Z, Alippi C, et al. Explainable intelligent fault diagnosis for nonlinear dynamic systems: from unsupervised to supervised learning. IEEE Trans Neural Netw Learn Syst, 2024, 35: 6166–6179

    Article  Google Scholar 

  2. Zhong T, Qin C J, Shi G, et al. A residual denoising and multiscale attention-based weighted domain adaptation network for tunnel boring machine main bearing fault diagnosis. Sci China Tech Sci, 2024, 67: 2594–2618

    Article  Google Scholar 

  3. Matania O, Dattner I, Bortman J, et al. A systematic literature review of deep learning for vibration-based fault diagnosis of critical rotating machinery: limitations and challenges. J Sound Vib, 2024, 590: 118562

    Article  Google Scholar 

  4. Zhang K, Wen Q, Zhang C, et al. Self-supervised learning for time series analysis: taxonomy, progress, and prospects. IEEE Trans Pattern Anal Mach Intell, 2024, 46: 6775–6794

    Article  Google Scholar 

  5. Zhao Z, Jiao Y. A fault diagnosis method for rotating machinery based on CNN with mixed information. IEEE Trans Ind Inf, 2022, 19: 9091–9101

    Article  Google Scholar 

  6. Mohammad-Alikhani A, Nahid-Mobarakeh B, Hsieh M F. One-dimensional LSTM-regulated deep residual network for data-driven fault detection in electric machines. IEEE Trans Ind Electron, 2024, 71: 3083–3092

    Article  Google Scholar 

  7. He Y L, Lu Z Y, Zhu Q X. Novel manifold autoencoder for industrial process fault diagnosis. IEEE Trans Ind Inf, 2024, 21: 858–865

    Article  Google Scholar 

  8. Yu J, Liu G. Knowledge extraction and insertion to deep belief network for gearbox fault diagnosis. Knowl-Based Syst, 2020, 197: 105883

    Article  Google Scholar 

  9. Fan J, Yuan X, Miao Z, et al. Full attention wasserstein GAN with gradient normalization for fault diagnosis under imbalanced data. IEEE Trans Instrum Meas, 2022, 71: 1–16

    Google Scholar 

  10. Ji M, Zhao G. DEViT: deformable convolution-based vision transformer for bearing fault diagnosis. IEEE Trans Instrum Meas, 2024, 73: 1–13

    Google Scholar 

  11. Hu W, Xin G, Wu J, et al. Vibration-based bearing fault diagnosis of high-speed trains: a literature review. High-speed Railway, 2023, 1: 219–223

    Article  Google Scholar 

  12. Lei Y, Liu H, Li N, et al. Condition monitoring and fault diagnosis of industrial robots: a review. Sci China Tech Sci, 2025, 68: 1110301

    Article  Google Scholar 

  13. Xie Z, Chen J, Feng Y, et al. End to end multi-task learning with attention for multi-objective fault diagnosis under small sample. J Manuf Syst, 2022, 62: 301–316

    Article  Google Scholar 

  14. Wei Y, Xiao Z, Liu S, et al. A novel data augmentation and composite multiscale network for mechanical fault diagnosis. IEEE Trans Instrum Meas, 2023, 72: 1–12

    Google Scholar 

  15. Li W, Zhong X, Shao H, et al. Multi-mode data augmentation and fault diagnosis of rotating machinery using modified ACGAN designed with new framework. Adv Eng Inf, 2022, 52: 101552

    Article  Google Scholar 

  16. Zhu J, Chen N, Shen C. A new multiple source domain adaptation fault diagnosis method between different rotating machines. IEEE Trans Ind Inf, 2021, 17: 4788–4797

    Article  Google Scholar 

  17. Ding Y, Jia M, Zhuang J, et al. Deep imbalanced domain adaptation for transfer learning fault diagnosis of bearings under multiple working conditions. Reliab Eng Syst Saf, 2023, 230: 108890

    Article  Google Scholar 

  18. An Y, Zhang K, Chai Y, et al. Gaussian mixture variational-based transformer domain adaptation fault diagnosis method and its application in bearing fault diagnosis. IEEE Trans Ind Inf, 2024, 20: 615–625

    Article  Google Scholar 

  19. Qian Q, Luo J, Qin Y. Adaptive intermediate class-wise distribution alignment: a universal domain adaptation and generalization method for machine fault diagnosis. IEEE Trans Neural Netw Learn Syst, 2025, 36: 4296–4310

    Article  Google Scholar 

  20. Shi Y, Deng A, Deng M, et al. Domain transferability-based deep domain generalization method towards actual fault diagnosis scenarios. IEEE Trans Ind Inf, 2023, 19: 7355–7366

    Article  Google Scholar 

  21. Zhu Y, Zi Y, Li J, et al. PhysiCausalNet: a causal- and physics-driven domain generalization network for cross-machine fault diagnosis of unseen domain. IEEE Trans Ind Inf, 2024, 20: 8488–8498

    Article  Google Scholar 

  22. Tama B A, Vania M, Lee S, et al. Recent advances in the application of deep learning for fault diagnosis of rotating machinery using vibration signals. Artif Intell Rev, 2023, 56: 4667–4709

    Article  Google Scholar 

  23. Wang H, Liu Z, Peng D, et al. Interpretable convolutional neural network with multilayer wavelet for noise-robust machinery fault diagnosis. Mech Syst Signal Process, 2023, 195: 110314

    Article  Google Scholar 

  24. Wang S, Liu Z, Jia Z, et al. Intermittent fault diagnosis of analog circuit based on enhanced one-dimensional vision transformer and transfer learning strategy. Eng Appl Artif Intell, 2024, 127: 107281

    Article  Google Scholar 

  25. Li Y, Zhou Z, Sun C, et al. Variational attention-based interpretable transformer network for rotary machine fault diagnosis. IEEE Trans Neural Netw Learn Syst, 2022, 35: 6180–6193

    Article  Google Scholar 

  26. He C, Shi H, Li R, et al. Interpretable modulated differentiable STFT and physics-informed balanced spectrum metric for freight train wheelset bearing cross-machine transfer fault diagnosis under speed fluctuations. Adv Eng Inf, 2024, 62: 102568

    Article  Google Scholar 

  27. Zhou T, Han T, Droguett E L. Towards trustworthy machine fault diagnosis: a probabilistic Bayesian deep learning framework. Reliab Eng Syst Saf, 2022, 224: 108525

    Article  Google Scholar 

  28. Xiao Y, Shao H, Wang J, et al. Bayesian variational transformer: a generalizable model for rotating machinery fault diagnosis. Mech Syst Signal Process, 2024, 207: 110936

    Article  Google Scholar 

  29. Qaid H A A M, Zhang B, Li D, et al. FD-LLM: large language model for fault diagnosis of machines. 2024. ArXiv:2412.01218

    Google Scholar 

  30. Tao L, Liu H, Ning G, et al. LLM-based framework for bearing fault diagnosis. Mech Syst Signal Process, 2025, 224: 112127

    Article  Google Scholar 

  31. Driess D, Xia F, Sajjadi M S M, et al. PaLM-E: an embodied multimodal language model. In: Proceedings of the International Conference on Machine Learning, 2023. 8469–8488

    Google Scholar 

  32. Li J, Li D, Savarese S, et al. BLIP-2: bootstrapping language-image pre-training with frozen image encoders and large language models. In: Proceedings of the International Conference on Machine Learning, 2023. 19730–19742

    Google Scholar 

  33. Wei J, Bosma M, Zhao V Y, et al. Finetuned language models are zero-shot learners. 2022. ArXiv:2109.01652

    Google Scholar 

  34. Zhu D, Chen J, Shen X, et al. MiniGPT-4: enhancing vision-language understanding with advanced large language models. 2023. ArXiv:2304.10592

    Google Scholar 

  35. Liu H, Li C, Wu Q, et al. Visual instruction tuning. In: Proceedings of the Advances in Neural Information Processing Systems, 2024. 34892–34916

    Google Scholar 

  36. Dai W, Li J, Li D, et al. InstructBLIP: towards general-purpose vision-language models with instruction tuning. In: Proceedings of the Advances in Neural Information Processing Systems, 2023

    Google Scholar 

  37. Ye Q, Xu H, Xu G, et al. mPLUG-Owl: modularization empowers large language models with multimodality. 2023. ArXiv:2304.14178

    Google Scholar 

  38. Moon S, Madotto A, Lin Z, et al. AnyMAL: an efficient and scalable any-modality augmented language model. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2024. 1314–1332

    Google Scholar 

  39. Hu E J, Shen Y, Wallis P, et al. LoRA: low-rank adaptation of large language models. In: Proceedings of International Conference on Learning Representations. 2022

    Google Scholar 

  40. Wang B, Lei Y, Li N, et al. A hybrid prognostics approach for estimating remaining useful life of rolling element bearings. IEEE Trans Rel, 2018, 69: 401–412

    Article  Google Scholar 

  41. Lessmeier C, Kimotho J K, Zimmer D, et al. Condition monitoring of bearing damage in electromechanical drive systems by using motor current signals of electric motors: a benchmark data set for data-driven classification. In: Proceedings of the PHM Society European Conference, 2016

    Google Scholar 

  42. Zhang W, Peng G, Li C, et al. A new deep learning model for fault diagnosis with good anti-noise and domain adaptation ability on raw vibration signals. Sensors, 2017, 17: 425

    Article  Google Scholar 

  43. Chen Z, Gryllias K, Li W. Intelligent fault diagnosis for rotary machinery using transferable convolutional neural network. IEEE Trans Ind Inf, 2019, 16: 339–349

    Article  Google Scholar 

  44. Liao J X, Dong H C, Sun Z Q, et al. Attention-embedded quadratic network (Qttention) for effective and interpretable bearing fault diagnosis. IEEE Trans Instrum Meas, 2023, 72: 1–13

    Article  Google Scholar 

  45. Wang H, Liu Z, Peng D, et al. Understanding and learning discriminant features based on multiattention 1DCNN for wheelset bearing fault diagnosis. IEEE Trans Ind Inf, 2020, 16: 5735–5745

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by Fundamental Research Funds for the Central Universities (Grant No. 2682025CX105) and National Natural Science Foundation of China (Grant No. U2468207).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tianrui Li.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, J., Li, T., Yang, Y. et al. DiagLLM: multimodal reasoning with large language model for explainable bearing fault diagnosis. Sci. China Inf. Sci. 68, 160103 (2025). https://doi.org/10.1007/s11432-024-4333-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Version of record:

  • DOI: https://doi.org/10.1007/s11432-024-4333-7

Keywords

Profiles

  1. Tianrui Li