Robust fault detection and classification in power transmission lines via ensemble machine learning models

Anwar, Tahir; Mu, Chaoxu; Yousaf, Muhammad Zain; Khan, Wajid; Khalid, Saqib; Hourani, Ahmad O.; Zaitsev, Ievgen

doi:10.1038/s41598-025-86554-2

Download PDF

Article
Open access
Published: 20 January 2025

Robust fault detection and classification in power transmission lines via ensemble machine learning models

Tahir Anwar¹^na1,
Chaoxu Mu¹,
Muhammad Zain Yousaf^2,3^na1,
Wajid Khan¹,
Saqib Khalid⁴,
Ahmad O. Hourani⁵ &
…
Ievgen Zaitsev^6,7

Scientific Reports volume 15, Article number: 2549 (2025) Cite this article

4678 Accesses
2 Citations
Metrics details

Subjects

Abstract

Transmission lines are vital for delivering electricity over long distances, yet they face reliability challenges due to faults that can disrupt power supply and pose safety risks. This research introduces a novel approach for fault detection and classification by analyzing voltage and current patterns across transmission line phases. Leveraging a comprehensive dataset of diverse fault scenarios, various machine learning algorithms—including Random Forest (RF), K-Nearest Neighbors (KNN), and Long Short-Term Memory (LSTM) networks—are evaluated. An ensemble methodology, RF-LSTM Tuned KNN, is proposed to enhance detection accuracy and robustness. Results indicate that RF-LSTM Tuned KNN achieves a remarkable accuracy of 99.96% on a multi-label dataset, outperforming RF (97.50%) and KNN (96.55%). In binary classification, KNN attains the highest accuracy of 99.85%, closely followed by RF at 99.72%. This methodology provides significant advancements in fault detection capabilities, offering valuable insights for improving grid reliability and stability, and ensuring a more resilient power supply.

Bayesian-optimized LSTM-DWT approach for reliable fault detection in MMC-based HVDC systems

Article Open access 02 August 2024

Enhancing HVDC transmission line fault detection using disjoint bagging and bayesian optimization with artificial neural networks and scientometric insights

Article Open access 09 October 2024

Integrating fault detection and classification in microgrids using supervised machine learning considering fault resistance uncertainty

Article Open access 18 November 2024

Introduction

Transmission lines play a critical role in the transportation of electricity over long distances; however, they are susceptible to faults that can disrupt operations and cause significant economic losses¹. The reliability of these lines heavily depends on effective fault identification and classification. The power system is divided into three main stages: generation, transmission, and distribution. Each stage is designed to ensure a stable and continuous electricity supply². Due to their exposure to the environment, parts of the electrical system are more prone to faults, making fault management essential. Transmission line faults are generally categorized into two types: short circuit (shunt faults) and series (open conductor faults). Shunt faults occur when there is contact between two or more conductors or between a conductor and the ground. In contrast, open conductor faults occur due to a break in the conductor. These faults can arise from various causes such as lightning strikes, short circuits between lines, accidents, unforeseen incidents, or human error³.

The advancements in Artificial Intelligence (AI) have brought about significant improvements in the detection and classification of faults in transmission lines. AI techniques have demonstrated their capability to enhance the accuracy and speed of fault detection and classification, which is essential for maintaining the stability and reliability of power systems^4,5,6. AI-driven methods leverage advanced algorithms and machine learning for automatic fault detection and classification, offering improved accuracy and efficiency over traditional approaches. These methods incorporate optimization techniques for fine-tuning, enhancing both training effectiveness and overall system performance⁷. AI models identify real-time anomalies, enabling swift responses to critical events, pre-emptive maintenance, and risk mitigation. Furthermore, AI streamline’s fault classification, reducing reliance on manual analysis and expertise^8,9. By analyzing patterns in sensor readings, performance metrics, and historical data, AI algorithms accurately classify faults such as line-to-ground, line-to-line, and open conductor faults.

Integrating AI into fault identification and classification offers numerous benefits for the transmission line industry. Automating these processes significantly reduces human error and response time, enhances system reliability, and minimizes downtime. AI models continuously adapt to evolving conditions, improving fault detection accuracy over time^10,11. As data availability increases and AI technologies advance, AI-driven fault identification and classification will play an essential role in ensuring the seamless operation of electrical grids, driving efficiency, reliability, and innovation in the industry¹². AI can greatly enhance fault detection in VSC-based MTdc systems by improving adaptability and reliability in protection strategies, addressing grid complexities, and supporting renewable energy integration¹³. A model for locating dc-link faults in MT-HVdc networks combines DWT, Bayesian optimization, and ANN, achieving robust detection even in noisy environments¹⁴. ANFIS effectively integrates ANN learning with fuzzy logic for fault location and classification^3,15,16. A deep learning-based protection scheme for meshed high-voltage direct current (HVDC) grids employs LSTM networks and discrete wavelet transforms to enhance fault detection reliability without complex thresholding¹⁷.

In 2021,¹⁸ developed a robust disturbance classification method addressing PMU data quality issues, using a univariate temporal convolutional denoising autoencoder (UTCN-DAE) and a multivariable temporal convolutional denoising network (MTCDN) for feature extraction, achieving 97.69% accuracy with high computational efficiency. Similarly,¹⁹ presented a fully automated deep-learning approach for fault classification in grid networks using a fully convolutional network (FCN). Tested on an IEEE 30-bus system, their method achieved an impressive accuracy of 99.27%, outperforming existing methods and validated through tenfold cross-validation and various performance metrics.

In Reference³³, Tong et al. introduced an intelligent fault diagnosis approach for rolling bearings that utilizes Gramian Angular Difference Field (GADF) and an improved dual attention residual network. This method highlights the potential of advanced feature extraction techniques, such as GADF, in capturing complex spatiotemporal patterns from sensor data, which can be beneficial for improving fault classification accuracy in power transmission line systems. The dual attention mechanism used by Tong et al. provides an additional layer of feature prioritization, which could inspire further refinement of fault detection models, particularly in identifying the most relevant features in noisy, multidimensional datasets. In Reference³⁴, Ma et al. explored relaying-assisted communications for demand response in smart grids, incorporating cost modeling, game strategies, and algorithms. While this paper primarily addresses communication systems, the underlying optimization strategies for resource allocation and decision-making in smart grids offer valuable insights for enhancing the coordination between fault detection systems and grid management tools. Such strategies could be adapted to improve the efficiency and speed of fault detection responses in transmission line systems. Reference³⁵ by Hang et al. presented a method for diagnosing interturn short-circuit faults and fault-tolerant control in DTP-PMSMs using subspace current residuals. This work emphasizes the importance of fault-tolerant control, which is highly relevant for power transmission systems where ensuring continued operation in the event of faults is critical. The use of current residuals for fault detection in rotating machinery can provide a framework for integrating fault-tolerant mechanisms with fault detection algorithms, enhancing the robustness of the overall system. Similarly, Reference³⁶ by Hang et al. proposed an improved fault diagnosis method for permanent magnet synchronous machine (PMSM) systems using lightweight multi-source information data layer fusion. This approach underlines the benefit of combining multiple data sources to improve fault diagnosis accuracy. The concept of multi-source data fusion is particularly relevant for power transmission systems, where data from various sensors and sources can be integrated to strengthen fault detection and classification models. Finally, in Reference³⁷, Ni et al. developed an explainable neural network by integrating the Jiles-Atherton model and nonlinear auto-regressive exogenous models for modeling universal hysteresis. This work contributes to the growing field of explainable AI by focusing on model transparency and interpretability, which is essential for high-stakes applications like fault detection in power systems. The integration of explainable AI in fault detection systems could enhance the trust and adoption of machine learning-based approaches by providing more transparent insights into the decision-making process of the models.

Addressing the challenge of limited labeled fault data in power distribution systems,²⁰ proposed a novel multi-task learning framework that leverages latent structures in unlabeled data, achieving an accuracy of 99.02% on distribution-level phasor devices and demonstrating robustness against measurement noise. Reference²¹ introduced a method for fault diagnosis of power transformers using dissolved gas analysis (DGA) and data transformation techniques, employing six optimized machine learning methods to achieve a high predictive accuracy of 90.61%, thereby enhancing power system reliability through early fault detection.

Reference²² further advanced the field with a 3D deep learning algorithm for accurately classifying power system faults using RGB channels from 4D images of transformed fault currents. Their model, which addressed overfitting with dropout provisions, achieved classification accuracies of 93.75% and 100% with dropout values of 0.4 and 0.5, respectively. Meanwhile,²³ developed an anomaly-based technique for fault detection in electrical power systems using One-Class SVM and PCA-based models, achieving accuracies of 79.84% and 79.28%, respectively, on the VSB Power Line Fault Detection dataset from Kaggle.

In a similar²⁴ proposed an unsupervised framework utilizing a capsule network with sparse filtering (CNSF) for fault detection and classification in transmission lines. The CNSF model demonstrated high accuracy ranging from 97 to 99%, showcasing robustness against noise and high impedance faults (HIF) across different TL topologies and fault conditions. Reference²⁵ introduced open-source software for generating synthetic power quality disturbances (PQDs) to benchmark classification techniques. Their deep-learning-based classifiers achieved accuracies of 99.28–99.75% (without noise) and 96.28–98.13% (with noise), facilitating the development and comparison of PQD classification algorithms.

Reference²⁶ proposed an LSTM-based 'end-to-end’ machine learning approach for fault detection and classification in power transmission networks, achieving a high accuracy of 99.00% across diverse fault scenarios and operational conditions on the WSCC 9-bus system. Additionally, in²⁷, a CNN transformer model was introduced for detecting and localizing faults in power lines, achieving 97.53% accuracy in fault classification and 96.14% in fault localization on the IEEE 14-bus system. In²⁸, a data-driven approach for fault location and classification in power distribution systems is presented, combining wavelet transformation with optimized Convolutional Neural Networks (CNNs) and achieving fault detection accuracy of 91.4%, branch identification accuracy of 93.77%, and fault type classification accuracy of 94.93%. Meanwhile,²⁹ proposes a method for fault detection and classification in DC transmission lines using LSTM networks and discrete wavelet transform (DWT), achieving 99.04% accuracy through a unique relay system and Bayesian optimization for hyperparameter tuning. Additionally, a hybrid CNN-LSTM model integrated with real-time RFID data enhances rotor angle stability and fault detection in microgrids, achieving a classification accuracy of 94.93%³⁰.

Lastly,³¹ developed an ensemble learning model using PMU data for transmission line fault classification, achieving an accuracy of 99.88% with Explainable AI (XAI) used to interpret predictions, validated on the IEEE 14-bus system. In response to existing research that prioritizes high accuracy while complicating models with ensemble techniques, this study proposes a single optimized ensemble method, RF-LSTM Tuned KNN, which balances performance with computational efficiency. This method achieves 99.93% accuracy in multi-label fault classification, enhancing transmission system reliability. Various machine learning models, including Random Forest (RF), K-Nearest Neighbors (KNN), and Long Short-Term Memory (LSTM), were evaluated across binary and multi-label datasets, establishing a robust benchmark for fault detection.

The integration of AI techniques in power transmission systems offers several benefits, including reduced human intervention, faster response times, enhanced predictive maintenance capabilities, and improved adaptability to changing conditions. By focusing on high-performing models, this research enhances fault detection capabilities and contributes to the development of resilient and intelligent power grids while promoting efficiency and reliability.

Recent advancements in fault detection for transmission lines and power networks using machine learning and deep learning have shown significant progress, as illustrated in Table 1. These contributions range from hybrid ensemble models that combine various algorithms to deep learning approaches leveraging neural networks. Research in this field has demonstrated promising results in enhancing the accuracy and reliability of fault detection, which is crucial for maintaining the stability of power systems, given the vulnerabilities inherent in transmission line operations. The continuous innovation in this area is essential to address the evolving challenges in power system reliability and to ensure an uninterrupted electricity supply to consumers.

Table 1 Overview of the recent contributions.

Full size table

Despite the progress, previous studies have often concentrated on individual algorithms or lacked comprehensive ensemble approaches. Moreover, the analysis of diverse fault scenarios, especially for binary and multi-label data, has been limited, which impacts detection accuracy. This paper aims to fill these gaps by introducing a novel ensemble technique, the RF-LSTM Voting Ensemble, which significantly enhances fault detection performance. By leveraging a comprehensive dataset and evaluating multiple machine learning algorithms, this study offers critical insights into improving grid reliability and stability.

The primary objective of this research is to develop a robust fault detection and classification system for power transmission lines. We evaluated the performance of individual machine learning models, such as Random Forest, KNN, and LSTM, on both binary and multi-label datasets. We also implemented an advanced ensemble technique, the RF-LSTM Voting Ensemble, aiming to achieve high accuracy while managing model complexity. Evaluation metrics such as accuracy, precision, recall, and F1-score were utilized to assess the models’ performance.

The RF-LSTM Voting Stacked Tune KNN showed strong results, with KNN and Random Forest emerging as top performers on binary datasets. These findings highlight the effectiveness of our approach in advancing fault detection capabilities and enhancing the resilience of power grids.

Methodology

The study used machine learning algorithms such as Random Forest (RF), K-Nearest Neighbors (KNN), and Long Short-Term Memory (LSTM) to identify and classify transmission line faults. Additionally, an RF-LSTM Stacked Tune KNN Ensemble was implemented. The framework involved data pre-processing and prediction modeling, with Particle Swarm Optimization (PSO) employed for hyperparameter tuning. Models were evaluated using confusion matrices, cross-validation, ROC curves, and learning curve analysis. We used TensorFlow for deep learning to develop and train our detection and classification models. Our experiments were conducted on a system with an Intel Core i5-4590 CPU, Nvidia GeForce GTX 750Ti GPU, and 8 GB of RAM. This setup provided adequate computational resources to train, test, and evaluate the hybrid models, allowing us to effectively validate their performance in electric fault detection and classification.

Dataset analysis

The Electrical Fault Detection and Classification dataset from Kaggle is crucial for enhancing fault detection in power transmission networks. It includes line currents and voltages recorded under various fault conditions from a system with four 11 kV generators and transformers at the midpoint of the transmission line. This dataset supports the development of algorithms for accurate fault detection and classification, improving network reliability and minimizing downtime. The dataset comprises two parts: a binary classification dataset with 12,001 rows and 9 columns for detecting fault presence, and a multi-class classification dataset with 7861 rows and 10 columns for identifying specific fault types. To ensure the final model’s robustness, a real-time simulated dataset with 11,701 rows and 10 columns is generated by applying faults at different lengths of transmission lines, as shown in Table 3, is used for training and testing. This dataset reflects practical scenarios with dynamic environmental conditions and operational variations, providing a more comprehensive evaluation framework. Tables 2 and 3 outline the details of these datasets, while Table 4 lists the six fault categories, including No-Fault, Line to Ground, Line to Line, Line to Line to Ground, Line to Line to Line, and Line to Line to Line to Ground. These datasets provide a comprehensive foundation for both detecting and classifying faults in electrical relays and transmission lines.

Table 2 Dataset overview (from³²).

Full size table

Table 3 RTD-dataset overview.

Full size table

Table 4 Target label information.

Full size table

The dataset analysis is depicted through several figures. Figure 1 shows scatter plots of voltage versus current for three lines (a, b, c), highlighting the voltage-current relationships and line-specific behaviors. Figure 2 details classification tasks: Fig. 2a represents binary classification with 54.2% “fault” and 45.8% "no-fault" instances, while Fig. 2b illustrates a multi-class classification with six fault types and a "no-fault" category, including specific distributions for each fault type. Finally, Fig. 3 displays unimodal distributions for current and voltage in both classification scenarios, showing typical values and aiding in outlier detection.

Figure 3 presents correlation matrices for binary, multi-label datasets and RTD-multi-label datasets labeled as (a), (b) and (c) respectively, to identify the exact set of input parameters in machine learning. The binary data correlation matrix in (a) showcases the relationships among variables in a binary classification context, while (b) and (c) illustrate correlations within a multi-classification framework. Rows and columns correspond to variables, and the values indicate correlation coefficients, reflecting the strength and direction of relationships. Diagonal elements, always displaying a correlation of 1, represent self-correlation, while off-diagonal elements reveal pairwise correlations. Values range from − 1 to 1: a correlation of 1 signifies a robust positive relationship, − 1 represents a strong negative correlation, and 0 indicates no correlation. These correlation matrices uncover patterns, identify potential multi collinearity issues, and provide insights into the dataset’s structure. However, correlation does not imply causation, necessitating further analysis to establish definitive causal relationships.

Data preprocessing

After analyzing the dataset’s characteristics, it was divided into training (75%) and testing (25%) sets, ensuring representation from each scenario. The target column was label encoded to a numeric format. Principal Component Analysis (PCA) was applied to input features to reduce noise and correlation, enhancing model performance by streamlining feature space dimensionality. In binary classification, PCA highlighted features closely linked to fault occurrences, while in multi classification, six principal components captured the dataset’s variance, retaining essential information for accurate predictions. Representation of the data preparation stages for input features is depicted in Fig. 4.

To address the class imbalance, the Synthetic Minority Over-Sampling Technique (SMOTE) generated synthetic samples for minority class instances using Eq. 1.³¹

$${X}_{g}= {X}_{m} + \delta ({X}_{n} - {X}_{m})$$

(1)

where δ is a random number in the range [0, 1]. This balanced the training data and improved model accuracy. Feature standardization was achieved using z-score normalization (Eq. 2):

$$z= \frac{X-\mu }{\sigma }$$

(2)

ensuring features had a mean of zero and a standard deviation of one, aiding in model training. Supervised classifiers were trained and evaluated on binary and multi-class datasets. To mitigate overfitting due to the small dataset size, fourfold cross-validation and hyperparameter tuning using the Particle Swarm Optimization (PSO) algorithm (Eqs. 3 and 4) were employed.

$${v}_{(i,j)}^{(t+1)} = w. {v}_{(i,j)}^{t}+{c}_{1}{r}_{1} ({p}_{(i,j)}-{v}_{(i,j)}^{t}) + {c}_{2}{r}_{2}({g}_{j}-{x}_{(i,j)}^{t})$$

(3)

$${x}_{i,j}^{t+1}= {x}_{i,j}^{t} + {v}_{i,j}^{t+1}$$

(4)

In PSO for hyperparameter tuning, each particle’s position x represents a set of hyperparameters. The algorithm iteratively updates each particle’s position and velocity, guided by its personal best position p and the swarm’s global best position g, to find the optimal hyperparameters for improved model performance.

Proposed framework

The proposed framework first detects faults and then classifies them into different categories. This section outlines the methodology for binary classification, which was then refined for multi-class classification, as shown in Fig. 5. At this stage, the dataset has already been pre-processed and split into training and testing sets.

The initial datasets, ${D}_{binary}$ and ${D}_{multi}$ containing binary and multi-label classifications respectively, are described in Eq. 5. To ensure independent and identically distributed data for cross-validation and evaluation, each dataset was divided into four subsets, denoted as follows:

$${{D}_{binary}}_{i}= \left\{{x}_{i}, {y}_{i}\right\}fori=\text{1,2},\text{3,4},\dots$$

(5a)

$${D}_{m{ulticlass}_{i}}= \left\{{x}_{i}, {y}_{i}\right\}fori=\text{1,2},\text{3,4},\dots$$

(5b)

In this iterative process, the dataset is partitioned into four subsets over four rounds (j = 1, 2, 3, 4). In each round, three subsets form the training set ${D}_{train}^{j}$, while the remaining subset acts as the test set, as shown in Eq. 6.

$${D}_{train}^{j}= {{D}_{binary}}_{i} \cup {{D}_{binary}}_{i+1}\cup {{D}_{binary}}_{i+3}\{j=1, 2, 3, \dots i, i+1, i+2 \ne j$$

(6a)

$${D}_{test}^{j}= {{D}_{binary}}_{j}j=1, 2, 3,\dots$$

(6b)

The same partitioning process applies to the multi-label classification dataset ${D}_{multi}$, ensuring each subset is used once as the test set for robust cross-validation. We applied three classification models to both datasets: Random Forest (RF), K-Nearest Neighbors (KNN), and Long Short-Term Memory (LSTM).

Random Forest constructs multiple decision trees. For binary classification, the final prediction ${y}_{RF\left(x\right)}$ is determined by majority voting among Mtrees (Eq. 7a). For multi-label classification, the prediction ${y}_{RF\left(x\right)}^{k}$ for each label ${y}^{k}$ is also based on majority voting among M trees (Eq. 7b).

$${y}_{RF}\left(x\right)={argmax}_{c{\in }_{\{\text{0,1}\}}}{\sum }_{i=1}^{M}I({y}_{RF}\left(x\right)=c)$$

(7a)

$${y}_{RF\left(x\right)}^{k}={argmax}_{c{\in }_{(\text{0,1})}}{\sum }_{i=1}^{M}I({{y}_{RF}}_{i}(x)=c)$$

(7b)

Long Short-Term Memory (LSTM) networks build a recurrent neural network during training. For binary classification, the final prediction ${y}_{LSTM}(x)$ of is obtained by applying a sigmoid activation function σ to the output of the last time step. Specifically, this prediction is calculated using the hidden state ${h}_{t}$ the weight matrix ${W}_{o}$ and the bias term ${b}_{o}$ for the output layer (Eq. 8a). For multi-label classification, the prediction ${{y}_{GRU}(x)}^{k}$ for each label k is similarly determined by applying σ to the output of the last time step for that label, using the hidden state ${h}_{t}$ weight matrix ${{W}_{o}}^{k}$ and bias term ${b}_{o}^{k}$(Eq. 8b).

$${y}_{GRU}(x)=\sigma ({W}_{o}{h}_{t}+ {b}_{o})$$

(8a)

$${{y}_{GRU}(x)}^{k}=\sigma ({{W}_{o}}^{k}{h}_{t}+ {b}_{o}^{k})$$

(8b)

k-Nearest Neighbors (k-NN) predicts based on the labels of the k closest data points. For binary classification, the final prediction ${y}_{KNN}\left(x\right)$ is determined by majority voting among the k nearest neighbors, where ${y}_{{KNN}_{i}}\left(x\right)$ is the label of the i-th nearest neighbor and I is the indicator function (Eq. 9a). For multi-label classification, the prediction ${{y}_{KNN}(x)}^{k}$ for each ${y}^{k}$ is also determined by majority voting among the k nearest neighbors for that label (Eq. 9b).

$${y}_{kNN}\left(x\right)={argmax}_{c{\in }_{(\text{0,1})}} \left( {\sum }_{i=1}^{k}I({y}_{{kNN}_{i}}\left(x\right)\right)$$

(9a)

$${{y}_{kNN}(x)}^{k}={argmax}_{c{\in }_{(\text{0,1})}} \left( {\sum }_{i=1}^{k}I({y}_{{kNN}_{i}}^{K}(x)\right)$$

(9b)

After the analysis of the individual models Random Forest (RF) and Long Short-Term Memory (LSTM) the strengths and limitations in multi-fault classification were identified. To enhance overall performance, a stacked tune ensemble technique is implemented, combining the predictions of both models for each label. This ensemble approach integrates the strengths of each model, resulting in more robust and accurate classifications. In multi-fault classification, the RF-LSTM Stacked Tune KNN ensemble model combines predictions from Random Forest and LSTM. Let ${y}_{RF\left(x\right)}^{k}$ and ${{y}_{LSTM}(x)}^{k}$ denote the predictions for the k-th fault label. The final prediction ${y}_{stack}{(x)}^{k}$ is determined as in Eqs. 10 and 11:

$$z{(x)}^{k}={(y}_{RF\left(x\right)}^{k}, {{y}_{LSTM}(x)}^{k})$$

(10)

$${{y}_{stack}(x)}^{k}={argmax}_{c{\in }_{(\text{0,1})}}{\sum }_{i=1}^{k}I({{y}_{stack}(x)}^{k}=c )$$

(11)

Here, ${{y}_{stack}(x)}^{k}$ is the final prediction for the k-th label, I is the indicator function that equals 1 if the statement is true and 0 otherwise, and ${argmax}_{c{\in }_{(\text{0,1})}}$ selects the class c that is predicted by the majority of the nearest neighbors.

Evaluation of the methodology

To evaluate the performance of a classification model, we used several metrics, including the confusion matrix, accuracy, precision, recall, F1 score, and ROC curve with AUC. The confusion matrix is a table that shows the counts of true positives (TP), true negatives (TN), false positives (FP), and false negatives (FN), providing a comprehensive overview of the model’s performance. Accuracy is calculated as the ratio of correctly predicted instances (both TP and TN) to the total number of instances, as shown in Eq. 12^8,32.

$$Accuracy=\frac{TP+TN}{TP+TN+FP+FN}$$

(12)

Accuracy provides the overall correctness of the model but may be misleading in scenarios with imbalanced datasets, where certain fault types are more frequent than others. Precision measures the accuracy of positive predictions and is crucial when the importance of false positives is high, such as incorrectly identifying a fault that isn’t there, as shown in Eq. 13. Recall, or sensitivity, as shown in Eq. 14, evaluates the model’s ability to correctly identify actual faults. This is vital when the importance of missing a fault (false negatives) is high, as it could lead to severe consequences in the power system’s operation. The F1 score balances precision and recall, providing a single metric that is useful when dealing with imbalanced classes of faults as explained in Eq. 15.

$$Precission=\frac{TP}{TP+FP}$$

(13)

$$Recall=\frac{TP}{TF+FN}$$

(14)

$$F1 score=2*\frac{Precision\times Recall}{recision+Recall}$$

(15)

$$FPR=\frac{FP}{FP+TN}, TPR= \frac{TP}{TP+FN}$$

(16)

$$AUC= {\sum }_{i=1}^{n-1}\left[\left({FPR}_{i+1}-{FPR}_{i}\right).\frac{{TPR}_{i} + {TPR}_{i+1}}{2}\right]$$

(17)

The Receiver Operating Characteristic (ROC) curve plots the true positive rate (Recall) against the false positive rate (FPR), as described in Eq. 16. The Area Under the ROC Curve (AUC), defined in Eq. 17, summarizes the model’s performance, with an AUC of 1 indicating a perfect model and 0.5 indicating no discriminative ability. These metrics provide a comprehensive assessment of our model’s performance, ensuring robustness, reliability, and effectiveness in real-world fault identification and classification for transmission lines.

Results and discussions

Following the discussion on the model and its mathematical framework, this section detailed the evolution of our proposed methodology, beginning with binary classification and advancing to multi-class classification. It thoroughly examined the performance and results using comprehensive evaluation metrics, providing a clear context for our analysis.

Binary classification

Outstanding performance and reliability in binary fault classification have been demonstrated by the models. Detailed evaluations and comparisons underscore the strengths of each model in accurately identifying faults. As illustrated in Fig. 6, near-perfect performance is revealed by the confusion matrices. Notably, remarkable metrics were achieved by the Random Forest (RF) model, with an accuracy of 99.72%, precision of 99.72%, recall of 99.65%, and an F1 score of 99.78%. Slightly better performance was exhibited by the K-Nearest Neighbors (KNN) model, with an accuracy of 99.75%, while impressive performance was also demonstrated by the LSTM model, with an accuracy of 98.70%. These metrics underscore the robust capabilities of all models in binary fault classification.

In addition to high accuracy, precision, recall, and F1 scores, the performance of the models is further emphasized by ROC curves and comprehensive analyses. As shown in Fig. 7, an AUC of 1.0 is achieved by each ROC curve, indicating perfect classification ability for the Random Forest, K-Nearest Neighbors, and LSTM models. Despite slightly lower overall accuracy, the LSTM model maintains an AUC of 1.0, showcasing strong classification capability.

A detailed analysis of each model’s effectiveness in binary fault classification is provided in Table 5, revealing minimal errors and excellent discrimination between fault and no-fault cases. These findings collectively highlight the exceptional performance, high effectiveness, and reliability of the models in binary fault detection.

Table 5 Performance of classifier on binary label data set.

Full size table

Multi-label classification

After the models were evaluated for binary classification, the analysis was extended to multi-class scenarios to assess their performance in more complex tasks involving multiple fault categories. An RF-LSTM Voting Ensemble model is also proposed to further enhance fault detection accuracy and robustness. The confusion matrices in Fig. 8 align closely with the ROC values in Fig. 9, providing detailed insights into each model’s performance. Notably, the Random Forest (RF) and LSTM models consistently achieved impressive ROC values of 0.99 for classes 2 and 5, and perfect ROC values of 1 for other classes. High accuracy was demonstrated, with RF achieving 97.50% and LSTM achieving 96.62%. Precision, recall, and F1 scores were also notable, reflecting the models’ ability to maintain high performance across different fault categories.

In contrast, slightly lower ROC values of 0.94 and 0.95 were observed for classes 2 and 5, respectively, when using the K-Nearest Neighbors (KNN) model. Despite this, the KNN model still achieved respectable accuracy (96.55%) and other performance metrics, showcasing its capability in fault classification. A comprehensive summary of the models’ performance metrics, including accuracy, precision, recall, F1 score, and mean cross-validation (CV) score, is provided in Table 6.

Table 6 Performance of classifier on multi-label data.

Full size table

The performance of the ensemble model, as illustrated by the confusion matrix in Fig. 10 and the ROC curve in Fig. 11, demonstrates its exceptional capabilities. A comprehensive evaluation, detailed in Table 6, reveals that this RF-LSTM Stacked Tune KNN ensemble model surpasses the individual Random Forest (RF) and Long Short-Term Memory (LSTM) models. While the RF and LSTM models perform well individually, the RF-LSTM Stacked Tuned KNN ensemble achieves an impressive accuracy of 99.96% and a mean cross-validation score of 98.96%, which closely aligns with the model’s performance on Kaggle standard data. This result underscores the effectiveness of ensemble learning, where the combination of diverse models leverages their collective strengths, leading to robust and reliable predictions for transmission line fault classification.

The learning curves of the K-Nearest Neighbors (KNN) model and the RF-LSTM Stacked Tuned KNN model are demonstrated to showcase superior performance across different data types. The KNN model is shown to excel with binary label data, while the RF-LSTM Stacked Tuned KNN model exhibits outstanding performance with multi-label data. Both models maintain consistent accuracy and efficient learning progression over multiple epochs and data splits, validating their robustness and effectiveness.

In Fig. 12, the normalized learning curve of the KNN model is illustrated. The X-axis represents the size of the training set, which ranges from 0 to 1, while the Y-axis displays accuracy, ranging from 0.7 to 1, for improved visualization.

Figure 13 depicts the normalized learning curve of the RF-LSTM Stacked Tuned KNN model, where the X-axis also represents the size of the training set, ranging from 0 to 1, and the Y-axis shows accuracy, ranging from 0.9 to 1, to enhance visualization.

To evaluate the robustness of the RF-LSTM Stacked Tuned KNN ensemble model for practical applications, the model is trained on real-time simulation data (RTD), as presented in Table 3. On this dataset, the ensemble achieves a remarkable accuracy of 99.97% and a cross-validation score of 99.96%, with a training time of 14 min, as shown in Table 6. The performance on RTD data is illustrated in the confusion matrix and the ROC curve shown in Fig. 14, highlighting the model’s exceptional capability in accurately classifying transmission line faults. An in-depth assessment provided in Table 6 confirms that the RF-LSTM Stacked Tuned KNN ensemble model maintains high accuracy on both the Kaggle dataset and the real-time simulation dataset, demonstrating robustness and reliability across various fault categories. This affirms the model’s practical applicability in real-world fault detection scenarios.

Table 7 presents the memory usage and training time for various models in both multi-label and binary classification tasks. The Random Forest (RF) model demonstrates efficiency with a training time of 17 s and memory usage of 13 MB for binary classification, while the K-Nearest Neighbors (KNN) model takes 2.3 min and uses 197 MB. The LSTM model, although powerful, requires 7.3 min and 983 MB of memory. In the multi-label scenario, RF requires 17 s and 171 MB, while KNN takes 3 min and consumes 291 MB. The LSTM model’s training time increases to 9.4 min with memory usage of 731 MB. In contrast, the RF-LSTM Stacked Tuned KNN ensemble achieves a training time of 14 min and memory usage of 1.3 GB. This ensemble effectively balances accuracy and resource utilization, making it suitable for complex classification tasks, thereby highlighting its efficiency and scalability in real-world applications.

Table 7 Comparison of models’ training time and memory usage.

Full size table

Discussion

This study investigates the effectiveness of various machine learning techniques for fault classification in transmission lines, emphasizing both binary and multi-label scenarios. Our systematic evaluation focuses on three distinct models—RF (Random Forest), LSTM (Long Short-Term Memory), and KNN (K-Nearest Neighbors)—across these classification tasks. Each model’s performance and suitability for detecting faults in power system data are thoroughly tested.

In binary classification, our KNN model achieves an impressive accuracy of 99.85%, significantly outperforming traditional techniques like SVM PCA, which reported an accuracy of 79.84%. This highlights the efficacy of simpler models in achieving high accuracy with reduced complexity.

For multi-label classification, we utilize advanced ensemble techniques to enhance predictive performance. Specifically, we combine RF and LSTM models using the RF-LSTM Stack Tune KNN method. This approach excels with an outstanding accuracy of 99.96%, demonstrating its capability to handle complex datasets and achieve superior performance compared to other methodologies.

To provide deeper insights into model performance, our evaluation includes ROC curves and confusion matrices. The ROC curves (see Figs. 7, 9, 11) graphically illustrate the trade-off between the true positive rate and false positive rate, with higher Area Under the Curve (AUC) values indicating better discriminatory performance. Our RF-LSTM Stack Tune KNN model exhibits strong AUC values, further validating its effectiveness in accurately distinguishing fault classes. Additionally, the confusion matrices (see Figs. 6, 8, 10) offer a detailed view of model performance across different fault categories, showcasing the robustness of our approach.

Table 8 presents a comparative analysis of fault classification models applied to power systems, emphasizing the performance of the models alongside recent studies. In multi-label classification, the RF-LSTM Stacked Tune KNN ensemble model is shown to significantly outperform traditional deep learning approaches such as CNN (99.27%)¹⁹ and WTO-CNN (94.93%)²⁸, as well as other ensemble methods like EXAI (99.88%)³¹. These findings highlight the strategic advantage of combining RF and LSTM models within ensemble frameworks, which not only improve predictive accuracy but also reduce computational complexity in comparison to more complex deep learning architectures. Furthermore, Fig. 15 visually depicts the comparative performance of both multi-label and binary-label models, reinforcing the benefits identified in this analysis.

Table 8 Comparative analysis of methodologies: literature review vs our fault classification models.

Full size table

Furthermore, this study advance’s fault classification methodologies by demonstrating the effectiveness of selective model integration and ensemble learning techniques tailored to the characteristics of power system data. The RF-LSTM Stacked Tuned KNN model is established as a new benchmark for fault classification accuracy, emphasizing its practical applicability in real-world scenarios. To evaluate the robustness of this ensemble model for practical applications, it is trained on real-time simulation data (RTD), achieving a remarkable accuracy of 99.96% and a cross-validation score of 99.68%, as shown by confusion matrix and ROC curve in Fig. 14. This ensemble model demonstrated a training time of 14 min and utilized 1.3 GB of memory, highlighting its efficiency. In conclusion, the strategic combination of RF and LSTM models proves instrumental in achieving superior fault classification performance across binary and multi-label tasks. Through the application of ensemble learning, accuracy is enhanced while operational efficiency is ensured, presenting a promising solution for reliable power system fault detection and classification.

Conclusion

This study has demonstrated the transformative impact of machine learning on fault detection in transmission lines, which is essential for enhancing the reliability and efficiency of power grids. The performance of individual models, such as Random Forest (RF), Long Short-Term Memory (LSTM), and K-Nearest Neighbors (KNN), as well as innovative ensemble techniques, was rigorously evaluated to identify optimal strategies for both binary and multi-label fault classification tasks. In binary classification, simpler models like KNN achieved exceptional accuracy, reaching 99.75%, while RF demonstrated strong performance with an accuracy of 99.72%. In multi-label classification, the RF-LSTM Stack Tune KNN model excelled, delivering a remarkable accuracy of 99.93%. This approach demonstrated the effectiveness of combining RF and LSTM models to handle complex datasets and temporal dependencies, achieving higher performance than models discussed in prior research. The use of ROC curves and confusion matrices further validated the robustness of the models, highlighting their ability to accurately distinguish fault classes and offering deep insights into model performance. In conclusion, this research advance’s fault classification methodologies by demonstrating the benefits of selective model integration and ensemble learning, specifically tailored to the characteristics of power system data. The strategic combination of RF and LSTM models has proven to be a practical and effective solution for achieving superior fault classification accuracy and operational efficiency.

However, there are limitations to consider ensemble models such as RF-LSTM Stack Tune KNN require large training datasets and longer training times, which can increase computational demands. This can be a challenge in real-world applications where real-time fault detection is critical, particularly in large-scale power systems. Additionally, the complexity of ensemble methods may limit their accessibility for systems with constrained computational resources. Despite these limitations, the findings present a promising path forward for enhancing grid reliability and operational efficiency through advanced machine learning techniques.

Data availability

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

References

Dabbaghjamanesh, M. et al. Guest Editorial: advanced energy internet applications in industrial power and energy systems. IEEE Trans. Industr. Inf. 18(8), 5658–5661 (2022).
Article MATH Google Scholar
Yadav, A. & Swetapadma, A. A single ended directional fault section identifier and fault locator for double circuit transmission lines using combined wavelet and ANN approach. Int. J. Electr. Power Energy Syst. 69, 27–33 (2015).
Article Google Scholar
Lu, D. et al. Time-domain transmission line fault location method with full consideration of distributed parameters and line asymmetry. IEEE Trans. Power Delivery 35(6), 2651–2662 (2020).
Article MATH Google Scholar
Tang, Q., & H. Jung, Reliable anomaly detection and localization system: Implications on manufacturing industry. IEEE Access, (2023).
Jiang, Y., W. Wang, & C. Zhao. A machine vision-based realtime anomaly detection method for industrial products using deep learning. in 2019 Chinese Automation Congress (CAC). IEEE. (2019).
Feng, R. et al. Efficient training method for memristor-based array using 1T1M synapse. IEEE Trans. Circu. Syst. II Expr. Briefs 70(7), 2410–2414 (2023).
MATH Google Scholar
Yousaf, M. Z. et al. A novel DC fault protection scheme based on intelligent network for meshed DC grids. Int. J. Electr. Power Energy Syst. 154, 109423 (2023).
Article MATH Google Scholar
Fenton, W. G., McGinnity, T. M. & Maguire, L. P. Fault diagnosis of electronic systems using intelligent techniques: A review. IEEE Trans. Systs. Man Cybernet, Part C (Appl. Rev.) 31(3), 269–281 (2001).
Article MATH Google Scholar
Cai, Z. et al. Digital twin modeling for hydropower system based on radio frequency identification data collection. Electronics 13(13), 2576 (2024).
Article MATH Google Scholar
Xu, Y. et al. Artificial intelligence: A powerful paradigm for scientific research. The Innovation, 2021. 2(4).
Feng, R. et al. Nonintrusive load disaggregation for residential users based on alternating optimization and downsampling. IEEE Trans. Instrument. Measur. 70, 1–12 (2021).
MATH Google Scholar
Davlyatov, S. Artificial intelligence techniques: Smart way to smart grid. in 2023 International Conference on Artificial Intelligence and Smart Communication (AISC). 2023. IEEE.
Zain Yousaf, M. et al. Primary and backup fault detection techniques for multi-terminal HVdc systems: A review. IET Generat., Trans. Distribut. 14(22), 5261–5276 (2020).
Article MATH Google Scholar
Yousaf, M. Z. et al. Intelligent sensors for dc fault location scheme based on optimized intelligent architecture for HVdc systems. Sensors 22(24), 9936 (2022).
Article ADS PubMed PubMed Central MATH Google Scholar
Ngu, E. & Ramar, K. A combined impedance and traveling wave based fault location method for multi-terminal transmission lines. Int. J. Electr. Power Energy Syst. 33(10), 1767–1775 (2011).
Article MATH Google Scholar
Chavan, J.N., A.A. Kale, & S.R. Deore. Transmission Line Fault Detection Using Wavelet Transform & ANN Approach. in 2022 IEEE Integrated STEM Education Conference (ISEC). IEEE. (2022).
Yousaf, M. Z. et al. Deep learning-based robust dc fault protection scheme for meshed HVdc grids. CSEE J. Power Energy Syst. 9(6), 2423–2434 (2022).
MATH Google Scholar
Li, Z. et al. A power system disturbance classification method robust to PMU data quality issues. IEEE Trans. Ind. Informat. 18(1), 130–142 (2021).
Article MATH Google Scholar
Tikariha, A., et al. Fault classification in an IEEE 30 bus system using convolutional neural network. in 2021 4th International Conference on Recent Developments in Control, Automation & Power Engineering (RDCAPE). 2021. IEEE.
Gilanifar, M. et al. Fault classification in power distribution systems based on limited labeled data using multi-task latent structure learning. Sustain. Cities Soci. 73, 103094 (2021).
Article MATH Google Scholar
Taha, I. B. & Mansour, D. Novel power transformer fault diagnosis using optimized machine learning methods. Intell. Automat. Soft Comput. 28(3), 739–752 (2021).
Article MATH Google Scholar
Srikanth, P. & Koley, C. A novel three-dimensional deep learning algorithm for classification of power system faults. Comput. Electr. Eng. 91, 107100 (2021).
Article MATH Google Scholar
Wadi, M., & W. Elmasry. An anomaly-based technique for fault detection in power system networks. in 2021 International Conference on Electric Power Engineering–Palestine (ICEPE-P). IEEE. 2021
Fahim, S. R. et al. A deep learning based intelligent approach in detection and classification of transmission line faults. Int. J. Electr. Power Energy Syst. 133, 107102 (2021).
Article MATH Google Scholar
Machlev, R. et al. Open source dataset generator for power quality disturbances with deep-learning reference classifiers. Electr. Power Syst. Res. 195, 107152 (2021).
Article MATH Google Scholar
Rafique, F., Fu, L. & Mai, R. End to end machine learning for fault detection and classification in power transmission lines. Electr. Power Syst. Res. 199, 107430 (2021).
Article Google Scholar
Thomas, J. B. et al. CNN-based transformer model for fault detection in power system networks. IEEE Trans. Instrument. Measur. 72, 1–10 (2023).
Article MATH Google Scholar
Rizeakos, V. et al. Deep learning-based application for fault location identification and type classification in active distribution grids. Appl. Energy 338, 120932 (2023).
Article MATH Google Scholar
Khan, W. et al. Rotor angle stability of a microgrid generator through polynomial approximation based on RFID data collection and deep learning. Sci. Rep. 14(1), 28342 (2024).
Article CAS PubMed PubMed Central Google Scholar
Yousaf, M. Z. et al. Bayesian-optimized LSTM-DWT approach for reliable fault detection in MMC-based HVDC systems. Sci. Rep. 14(1), 17968 (2024).
Article CAS PubMed PubMed Central MATH Google Scholar
Bin Akter, S. et al. Ensemble learning based transmission line fault classification using phasor measurement unit (PMU) data with explainable AI (XAI). Plos one 19(2), e0295144 (2024).
Article CAS PubMed PubMed Central Google Scholar
Hossain, M. A. & Islam, M. S. A novel hybrid feature selection and ensemble-based machine learning approach for botnet detection. Sci. Rep. 13(1), 21207 (2023).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Tong, A., Zhang, J. & Xie, L. Intelligent fault diagnosis of rolling bearing based on Gramian angular difference field and improved dual attention residual network. Sensors 24(7), 2156 (2024).
Article ADS PubMed PubMed Central MATH Google Scholar
Ma, K., Yang, J. & Liu, P. Relaying-assisted communications for demand response in smart grid: Cost modeling, game strategies, and algorithms. IEEE J. Select. Areas Commun. 38(1), 48–60 (2019).
Article CAS MATH Google Scholar
Hang, J., Wang, X., Li, W. & Ding, S. Interturn short-circuit fault diagnosis and fault-tolerant control of DTP-PMSM based on subspace current residuals. IEEE Trans. Power Electr. 40(2), 3395–3404 (2024).
Article MATH Google Scholar
Hang, J., Qiu, G., Hao, M. & Ding, S. Improved fault diagnosis method for permanent magnet synchronous machine system based on lightweight multi-source information data layer fusion. IEEE Trans. Power Electr. 39(10), 13808–13817 (2024).
Article MATH Google Scholar
Ni, L. et al. An explainable neural network integrating Jiles-Atherton and nonlinear auto-regressive exogenous models for modeling universal hysteresis. Eng. Appl. Artif. Intell. 1(136), 108904 (2024).
Article MATH Google Scholar

Download references

Author information

Tahir Anwar and Muhammad Zain Yousaf contribute equal in this paper.

Authors and Affiliations

School of Electrical and Information Engineering, Tianjin University, Tianjin, 300072, China
Tahir Anwar, Chaoxu Mu & Wajid Khan
Center for Renewable Energy and Microgrids, Huanjiang Laboratory, Zhejiang University, Zhuji, 311816, Zhejiang, China
Muhammad Zain Yousaf
School of Electrical and Information Engineering, Hubei University of Automotive Technology, Shiyan, 442002, China
Muhammad Zain Yousaf
School of Electrical Engineering, The University of Lahore, Lahore, Pakistan
Saqib Khalid
Hourani Center for Applied Scientific Research, Al-Ahliyya Amman University, Amman, Jordan
Ahmad O. Hourani
Department of Theoretical Electrical Engineering and Diagnostics of Electrical Equipment, Institute of Electrodynamics, National Academy of Sciences of Ukraine, Beresteyskiy, 56, Kyiv-57, Kyiv, 03680, Ukraine
Ievgen Zaitsev
Center for Information-Analytical and Technical Support of Nuclear Power Facilities Monitoring, National Academy of Sciences of Ukraine, Akademika Palladina Avenue, 34-A, Kyiv, Ukraine
Ievgen Zaitsev

Authors

Tahir Anwar
View author publications
You can also search for this author inPubMed Google Scholar
Chaoxu Mu
View author publications
You can also search for this author inPubMed Google Scholar
Muhammad Zain Yousaf
View author publications
You can also search for this author inPubMed Google Scholar
Wajid Khan
View author publications
You can also search for this author inPubMed Google Scholar
Saqib Khalid
View author publications
You can also search for this author inPubMed Google Scholar
Ahmad O. Hourani
View author publications
You can also search for this author inPubMed Google Scholar
Ievgen Zaitsev
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

Tahir Anwar, Chaoxu Mu, Wajid Khan: Conceptualization, Methodology, Software, Visualization, Investigation, Writing—Original draft preparation. Muhammad Zain Yousaf: Data curation, Validation, Supervision, Resources, Writing—Review & Editing. Saqib Khalid, Ahmad O. Hourani, Ievgen Zaitsev: Project administration, Supervision, Resources, Writing—Review & Editing.

Corresponding authors

Correspondence to Muhammad Zain Yousaf or Ievgen Zaitsev.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Anwar, T., Mu, C., Yousaf, M.Z. et al. Robust fault detection and classification in power transmission lines via ensemble machine learning models. Sci Rep 15, 2549 (2025). https://doi.org/10.1038/s41598-025-86554-2

Download citation

Received: 02 November 2024
Accepted: 13 January 2025
Published: 20 January 2025
DOI: https://doi.org/10.1038/s41598-025-86554-2