Research on Intelligent Transformer Fault Diagnosis Model Based on Multimodal Data Fusion and Deep Learning

Tian, Cunjian

doi:10.17559/TV-20251211003197

Technical gazette, Vol. 33 No. 3, 2026.

Original scientific paper

https://doi.org/10.17559/TV-20251211003197

Research on Intelligent Transformer Fault Diagnosis Model Based on Multimodal Data Fusion and Deep Learning

Cunjian Tian ; Extra-high Voltage Branch of State Grid Fujian Electric Power Co., Ltd., Fuzhou Fujian, 350011, China *

* Corresponding author.

Full text: english pdf 1.300 Kb

page 1208-1218

downloads: 371

cite

APA 6th Edition

Tian, C. (2026). Research on Intelligent Transformer Fault Diagnosis Model Based on Multimodal Data Fusion and Deep Learning. Tehnički vjesnik, 33 (3), 1208-1218. https://doi.org/10.17559/TV-20251211003197

MLA 8th Edition

Tian, Cunjian. "Research on Intelligent Transformer Fault Diagnosis Model Based on Multimodal Data Fusion and Deep Learning." Tehnički vjesnik, vol. 33, no. 3, 2026, pp. 1208-1218. https://doi.org/10.17559/TV-20251211003197. Accessed 24 Jul. 2026.

Chicago 17th Edition

Tian, Cunjian. "Research on Intelligent Transformer Fault Diagnosis Model Based on Multimodal Data Fusion and Deep Learning." Tehnički vjesnik 33, no. 3 (2026): 1208-1218. https://doi.org/10.17559/TV-20251211003197

Harvard

Tian, C. (2026). 'Research on Intelligent Transformer Fault Diagnosis Model Based on Multimodal Data Fusion and Deep Learning', Tehnički vjesnik, 33(3), pp. 1208-1218. https://doi.org/10.17559/TV-20251211003197

Vancouver

Tian C. Research on Intelligent Transformer Fault Diagnosis Model Based on Multimodal Data Fusion and Deep Learning. Tehnički vjesnik [Internet]. 2026 [cited 2026 July 24];33(3):1208-1218. https://doi.org/10.17559/TV-20251211003197

IEEE

C. Tian, "Research on Intelligent Transformer Fault Diagnosis Model Based on Multimodal Data Fusion and Deep Learning", Tehnički vjesnik, vol.33, no. 3, pp. 1208-1218, 2026. [Online]. https://doi.org/10.17559/TV-20251211003197

Abstract

To enhance the accuracy of transformer fault diagnosis, this study is dedicated to designing a hybrid intelligent mechanism for fault diagnosis, which organically integrates multimodal data fusion strategies with adaptive deep learning models. The key information required for diagnosis is derived from the analysis of dissolved gases in the oil and serves as the input feature of the deep model. The key to model training lies in the introduction of an adaptive mechanism that can dynamically calibrate the learning rate based on the real-time convergence trend. An adaptive learning mechanism that can dynamically adjust the learning rate during the iterative process is proposed, thereby enhancing the convergence accuracy of the model while improving its training efficiency. Through specific cases, important parameters such as the number of hidden layers and the learning rate adjustment coefficient in the adaptive deep learning model were determined. The experimental results show that the proposed method performs excellently in feature extraction and analysis, featuring a faster convergence speed and higher convergence accuracy, which can significantly improve the accuracy of transformer fault diagnosis. Aiming at the problem that data alignment and series fusion are often ignored in the traditional multimodal data fusion process, this paper further proposes a graph-text multimodal fusion model based on the cross-attention mechanism. This model first uses BERT and ConvNeXt to extract text and image features respectively. Subsequently, with the help of the attention mechanism in the Image Transformer, the detailed information in the feature map output by ConvNeXt is further extracted to obtain higher-level image features and ensure that the image and text features are consistent in dimension. Finally, the alignment and fusion of graphic and text features are achieved through the cross-attention module. Experiments on the three datasets of MSAW-Single, MSAW-Multiple and MMSD show that the classification accuracy of the image-text multimodal fusion model based on cross-attention reaches 75.21%, 73.15% and 85.85% respectively, verifying the effectiveness of this method.

Keywords

adaptive deep learning model; fault diagnosis; learning rate; multimodal analysis; multimodal data fusion; transformer

Hrčak ID:

346732

URI

https://hrcak.srce.hr/346732

Publication date:

30.4.2026.

Visits: 586 *

Login and registration

Technical gazette, Vol. 33 No. 3, 2026.

Abstract

Keywords

Hrčak ID:

URI

Publication date: