Skip to main content

Bearing fault diagnosis based on enhanced Canberra distance feature in SDP image

Abstract

Feature enhancement is important in mechanical equipment fault diagnosis. A limited set of characteristic parameters is insufficient for diagnosing bearing signals with multiple fault types. The presence of noise increases the difficulty of extracting fault features from images. To address the challenge of diagnosing rolling bearing faults in complex environments, this study presents an enhanced weighted image fusion framework aimed at enhancing fault features within the images, which enables accurate diagnosis of bearing faults using a limited number of features. The proposed method encompasses four distinct stages. In the first stage, a symmetrized dot pattern method is employed to transform one-dimensional time-series data into two-dimensional images, visualizing the signal in a 2D format. In the second stage, image binarization and an improved weighted fusion method are utilized to simplify subsequent processing and enhance the image features. The third stage involves extracting the image’s contrast and maximum singular value to improve the Canberra distance calculation. Finally, the enhanced Canberra distance is used for classifying bearing faults. Performance testing of the image feature enhancement is conducted on various datasets containing rolling bearings. Comparative experiments with alternative enhancement methods demonstrate the superiority of the proposed improved weighted image fusion framework. Comparative experiments with the original Canberra distance validate the effectiveness of the enhanced Canberra distance. Additionally, experiments conducted in noisy environments confirm the robustness of the proposed approach. Furthermore, the image feature enhancement method is applied to other bearing datasets, and the experimental results demonstrate its effectiveness in enhancing fault feature representation and achieving accurate diagnosis of rolling bearings.

Introduction

The smooth operation of mechanical equipment is essential for ensuring efficient industrial production. The health status of mechanical equipment can be determined through diagnostic techniques. Analyzing the diagnostic results can help reduce the occurrence of faults. Analyzing the health of equipment enables planned maintenance and prevents failures from occurring in advance [1].

Fault diagnosis of mechanical equipment primarily involves analyzing the collected signals [2]. Early signal processing methods focus on time domain analysis, frequency domain analysis, and time–frequency domain analysis [3]. Time domain analysis enables direct processing of the collected time series [4]. Frequency domain analysis involves separating different frequencies of signals and analyzing the frequency components of interest in detail. It is particularly suitable for processing nonstationary signals [5]. For instance, the empirical wavelet transform (EWT) based on the Fourier transform can be employed to construct adaptive wavelets. This decomposition method is suitable for analyzing nonstationary signals and allows for the adaptive decomposition of signals [6]. In [7], empirical mode decomposition (EMD) and its improved versions are discussed. The use of signal decomposition methods enables better processing and analysis of signals. In fault diagnosis, it is possible to directly extract features from signals. Differentiation of various data sets is achieved through the differences in these features [8]. Additionally, image-formatted data is better suited for novel intelligent methods [9]. For instance, machine learning and deep learning have found broad application in signal analysis [10]. In [11], 96 features were extracted from color fundus images. Accurate diagnosis of diabetic retinopathy is achieved through the established artificial intelligence classification model. This reference provides an artificial intelligence model with low computational complexity and high classification accuracy. The model effectively addresses nonlinear dynamical problems.

Various techniques can be employed to enhance images, such as histogram equalization, contrast enhancement, texture enhancement, and image filtering. It is important to adopt appropriate processing methods for different types of images. Image enhancement in mechanical fault diagnosis aims to emphasize fault features while suppressing irrelevant ones. Bai et al. [12] utilized the channel selection method to enhance performance for extracting the features of interest in the image. On one hand, this method can decrease computation requirements and mitigate the impact of noise [13]. Kim et al. [14] used a multi-scale convolution filter to enhance fault features. This processing method facilitates the diagnosis of gearbox faults. Wu et al. [15] applied an adaptive logarithmic normalization method to enhance the quality. This method enhances image details and yields favorable results in rolling bearing tests. Jablonski et al. [16] aimed to obtain a clearer spectrum image, thus employing STFT for automatic analysis of image conversion parameters. This method can make the spectrum image more clear. Enhancing image definition can result in more prominent feature components of the signal. Wodecki et al. [17] employed a nonnegative matrix decomposition method to differentiate between the image’s features and noise. This image enhancement method can mitigate the impact of noise. It enables the filtration of specific noise components for fault diagnosis in rotating machinery. Reinhold et al. [18] applied the redistribution method to process the energy distribution of the image’s time–frequency spectrum. This image enhancement method enhances the spectrum’s robustness against noise. Therefore, image enhancement methods play a significant role in using images to diagnose mechanical equipment.

There are several widely used methods for signal transformation of image in the fault diagnosis of mechanical equipment, including recursive images, scalogram, fast kurtogram, Gramian angular field, and Markov transition field. Consequently, image enhancement methods play a significant role. Enhanced images can better highlight signal characteristics, thus providing an improved basis for fault classification. Peng et al. [19] developed an adaptive image segmentation algorithm that effectively mitigates distortion in two-dimensional images obtained from scanning imaging. Long et al. [20] employed the scale invariant feature transform to extract features from the symmetrized dot pattern. Wang et al. [21] converted signals into images using short-time Fourier transform (STFT). They then used a convolutional neural network for fault classification. Tang et al. [22] employed the Gramian angular summary field to convert rotational signals into diagnostic images. This method exhibits excellent performance in processing noisy signals. Li et al. [23] utilized a weighted horizontal visibility map to convert signals into images, specifically focusing on image edge processing. The advantage of this image enhancement method lies in its ability to attenuate the influence of noise. Fang et al. [24] employed a data conversion image method for fault diagnosis. Additionally, they proposed a spatial attention mechanism for adjusting image outputs. This image enhancement method exhibits robust anti-interference capabilities. Wang et al. [25] introduced a temporal-spatial graph method for converting signals to images. They utilized the graph-mapped spectrum for diagnosing bearing faults. This method enhances the frequency features of the signal and is applicable for diagnosing bearing faults. In conclusion, image processing methods are extensively employed for fault diagnosis of mechanical equipment [26]. Therefore, image processing and enhancement methods are also a very important research direction.

The vibration signal of rolling bearings is a nonstationary signal. Traditional time-domain and frequency-domain analysis methods cannot intelligently extract fault features. Image-based fault diagnosis methods can achieve intelligent fault diagnosis. However, the symmetrized dot pattern method, which converts one-dimensional signals into two-dimensional images, increases data dimensionality. Images contain a lot of redundant information, so enhancing fault features in the image can improve the effectiveness of fault diagnosis. In practical industrial production, signals are often influenced by noise. Studying the enhancement of fault features through image denoising is meaningful. It improves fault classification effectiveness. Therefore, the purpose of this paper is to achieve accurate fault classification through feature enhancement of images.

In this paper, a new method of image enhancement is proposed to process the synchronized dot pattern after signal conversion. The method of image enhancement is used to diagnose the fault on the bearing test bed. The method of image enhancement is also verified under different noise signals. The rest of this paper is arranged as follows. In the “Methods/experimental” section, methods for image enhancement are introduced. In the “Methods” section, relevant experiments were designed, and the experimental results were analyzed. In the “Results and discussion” section, comparative experiments were designed to verify the effectiveness of this method. In the “Limitations and implications of research findings in the current research context” section, the potential challenges of SDP images are discussed. The conclusion is given in the “Conclusion” section.

Methods/experimental

Aim

This study aims to classify various types of bearing faults using the vibration signals of rolling bearings, employing techniques such as image conversion and image enhancement. Specifically, the research focuses on addressing the classification problem of bearing fault types under noisy conditions.

Design and setting

This paper presents the development of the SDP image feature enhancement method and the enhancement of Canberra features for fault detection. As illustrated in Fig. 1, the method is primarily divided into two parts: image enhancement and feature extraction. The improved grayscale-weighted average method and the utilization of maximum singular value and contrast play vital roles in enhancing the Canberra distance.

Fig. 1
figure 1

Design of the flowchart for the proposed method

The research methodology in this paper is divided into three parts. The specific process is described as follows: In the first part, a method for image feature enhancement is proposed. In the second part, an improved method for extracting Canberra distance features is presented. In the third part, a workflow for fault diagnosis of rolling bearings is proposed. The diagnostic effectiveness of this method is validated through experiments.

In the image enhancement phase, data from each working condition is selected separately. Random selection of data with a length of 6000 is performed. The selected data is divided into 10 groups. Each group has an average of 600 data points. The 10 groups of data are first converted into SDP images. These SDP images are then transformed into grayscale images. Then, an enhanced image is obtained using the improved grayscale-weighted averaging method. The purpose of this process is to highlight fault features.

In the feature extraction part, the first step is to calculate the maximum singular value and contrast of the enhanced image. Then, the Canberra distance between the 10 grayscale images and the enhanced image is calculated. Use the three obtained features to obtain improved Canberra distance features. Finally, utilize these features to achieve fault classification for rolling bearings.

In the experiment, the fault types and data are first randomly selected. Then, the data undergoes the same image enhancement and feature extraction processes. By comparing the obtained features with the known working condition features, the diagnosis of bearing fault types is achieved.

Participants or materials

The rolling bearing dataset used in this study comes from the bearing data center at Case Western Reserve University (CWRU). The experimental device is depicted in Fig. 2. The device is capable of inducing bearing faults under varying loads and fault diameters. Its primary working conditions involve inner ring, outer ring, and ball faults. The dataset is further divided into single-point defects of varying sizes. At different sampling frequencies, the original time-domain vibration signal is formed.

Fig. 2
figure 2

Experimental platform

This data set collects time-series data of drive end, fan end, and base. The time series of the driving end is used in this experiment. The specifications of the bearing used are shown in Table 1. All the data are at 12-kHz sampling frequency. The total length of each data is 120,000. Each group of data is divided into two parts. The data length of each part is 60,000. One part of the data is used for feature extraction. The other part of the data is used for experimental validation.

Table 1 Types of rolling bearings

Processes and methodologies

This study utilizes variational mode decomposition and an enhanced weighted average method to enhance the fault features in symmetrized dot pattern (SDP). The diagnostic accuracy is considered as the evaluation metric. This method is employed to achieve accurate classification of rolling bearing faults in a noisy environment and to exploit the fault data in the images. To achieve effective diagnosis, four processing methods are employed: variational mode decomposition (VMD), symmetrized dot pattern, an improved weighted average method, and improved Canberra distance feature. Firstly, the signal is denoised using variational mode decomposition. Then, the denoised signal is transformed into an image using the SDP method. The enhanced weighted average method is utilized to enhance the fault features in the image for fault data mining. Finally, the improved Canberra distance feature is used to extract image features for fault diagnosis.

Ethics approval and consent

This study does not involve human participants, data, or tissue, nor does it involve animals. Therefore, ethics approval and consent are not applicable.

Statistical analysis

Statistical analysis was not conducted in this study. Ethics approval and consent are not applicable, and statistical analysis is not performed.

Methods

Variational mode decomposition

Variational mode decomposition can adaptively decompose the original signal, which can be transformed into the corresponding variational problem construction and solution [27]. It is mainly divided into the following three steps. In Hilbert transform to find the analytical signal, by multiplying with \({e}^{-j{\omega }_{k}t}\), the formula is as follows:

$$\begin{array}{c}(\delta (t)+\frac{j}{\pi t})*{v}_{k}(t)\end{array}$$
(1)

where.

\(\delta (t)\) = Dirac function.

\(\gamma (t)\) = The distribution function.

\({v}_{k}(t)\) = The kth mode component.

The analysis signal of \({v}_{k}(t)\) is calculated by Hilbert transformation [28]. By multiplying with the operator \({e}^{-j{\omega }_{k}t}\), the central band of \({v}_{k}(t)\) is modulated to the corresponding baseband, as shown in Eq. 2. Estimate the bandwidth of the signal, that is, calculate the square norm L2 of the demodulation gradient, as in Eq. 3.

$$\begin{array}{c}[(\delta (t)+\frac{j}{\pi t})*{v}_{k}(t)]{e}^{-j{\omega }_{k}t}\end{array}$$
(2)
$$\left\{\begin{array}{c}\underset{\{{v}_{k}\}\{{w}_{k}\}}{min}\{\sum_{k}{||{\partial }_{t}[(\delta (t)+\frac{j}{\pi t})*{v}_{k}(t)]{e}^{-j{\omega }_{k}t}||}^{2}\}\\ s.t.\sum_{k}{v}_{k}=s\end{array}\right.$$
(3)

where.

\({\omega }_{k}\) = The center frequency of each component signal.

  = Convolution operation.

\(f\) = The input signal.

The VMD method adaptively decomposes the signal by searching for the optimal solution of the aforementioned constrained variational model. During the iterative solving process, the central frequency and bandwidth of each component are gradually updated, and ultimately, the IMF components are adaptively partitioned based on the signal’s own frequency domain characteristics. When solving this model, it is necessary to introduce the quadratic penalty factor \(\alpha\) and the Lagrange operator \(\lambda (t)\). This transformation transforms the above constrained variational problem into the following unconstrained variational problem:

$$\begin{array}{cc}& L(\{{v}_{k}\},\{{\omega }_{k}\},\lambda )=\\ & \langle \lambda (t),f(t)-\sum_{k}{v}_{k}(t)\rangle +{\parallel f(t)-\sum_{k}{v}_{k}(t)\parallel }_{2}^{2}+\alpha \sum_{k}{\parallel {\partial }_{t}(\sigma (t)+\frac{j}{\pi t}){v}_{k}(t){e}^{-j{\omega }_{k}t})\parallel }_{2}^{2}\end{array}$$
(4)

where.

\(\sum_{k}{v}_{k}(t)\) = The sum of IMF components.

\(\alpha\) = Quadratic penalty factor.

\(\lambda (t)\) = Lagrange operator.

By alternately updating \({v}_{k}(t)\), \({\omega }_{k}\), and \(\lambda\), the aim is to find the saddle point in the expression. The updating process of each modal component is as follows:

$$\begin{array}{c}{v}_{k}^{n+1}=\underset{{v}_{k}\in X}{\text{arg}min}\{{\parallel f(t)-\sum_{i}{v}_{i}(t)(t+\frac{\lambda (t)}{2})\parallel }_{2}^{2}+\alpha {\parallel {\partial }_{t}[(\delta (t)+\frac{j}{\pi t}){v}_{k}(t)]{e}^{-j{\omega }_{k}t}\parallel }_{2}^{2}\}\end{array}$$
(5)

Convert the above equation into a frequency domain expression as follows:

$$\begin{array}{c}{\hat{v}}_{k}^{n+1}=\underset{{\hat{v}}_{k},{v}_{k}\in X}{\text{arg}min}\{{\parallel \hat{f}(t)-\sum_{i}{\hat{v}}_{i}(\omega )+\frac{\hat{\lambda }(\omega )}{2}\parallel }_{2}^{2}+\alpha {\parallel j\omega [(1+sgn(\omega +{\omega }_{k})){\hat{v}}_{k}(\omega +{\omega }_{k})]\parallel }_{2}^{2}\}\end{array}$$
(6)

By solving the equation, the optimal solution to the optimization problem can be obtained as follows:

$$\begin{array}{c}{\hat{v}}_{k}^{n+1}(\omega )=\frac{\hat{x}(\omega )-\sum_{i=1,i\ne k}^{K}{\hat{v}}_{i}(\omega )+\frac{\hat{\lambda }(\omega )}{2}}{1+2\alpha (\omega -{\omega }_{k}{)}^{2}}\end{array}$$
(7)

where

\(K\) = the number of decomposition layers.

Principle of symmetrized dot pattern

Symmetrized dot pattern (SDP) is a simple image conversion method. It maintains the relationship between amplitude and frequency of the signal [29]. For discrete signals, this transformation can obtain corresponding points at different positions in polar coordinates. For the sampled discrete signal, this conversion can obtain corresponding points at different positions of polar coordinates. The distribution of points can directly display the features of signals. The schematic diagram of SDP conversion is shown in the Fig. 3.

Fig. 3
figure 3

Schematic diagram of symmetrized dot pattern

For a time series, \({x}_{i}\) represents the \(i\)-th sampling point of the signal, and \({x}_{(i+l)}\) is the sampling point after adjacent time \(l\). According to the SDP conversion principle, sampling points at different times can be converted to polar coordinate space \(P(r(i),\theta (i),\phi (i))\). The radius from the center point can be expressed as follows:

$$\begin{array}{c}r(i)=\frac{{x}_{i}-{x}_{mean}}{{x}_{max}-{x}_{min}}\end{array}$$
(8)

where

\({x}_{i}\) = The \(i\)-th signal value

\({x}_{min}\) = The minimum value in the signal sequence.

\({x}_{max}\) = The maximum value in the signal sequence.

\({x}_{mean}\) = The mean value of the signal sequence.

The anticlockwise rotation angle \(\theta \left(i\right)\) and the clockwise rotation angle \(\phi (i)\) along the initial line are projected by the adjacent points \({x}_{(i+l)}\). The rotation angle can be expressed as follows:

$$\begin{array}{c}\theta (i)=\varphi +\frac{{x}_{i+l}-{x}_{mean}}{{x}_{max}-{x}_{min}}\zeta \end{array}$$
(9)
$$\begin{array}{c}\phi (i)=\varphi -\frac{{x}_{i+l}-{x}_{mean}}{{x}_{max}-{x}_{min}}\zeta \end{array}$$
(10)

where.

\(\zeta\) = The angular gain factor of the plot.

\(\theta (i)\) = The angle of counterclockwise rotation along the initial line.

\(\phi (i)\) = The angle of clockwise rotation along the implementation line

\(\varphi\) = The symmetrical rotation angle of the mirror. \(\varphi =360\times m/n,(m=\text{1,2},\dots ,n)\)

\(l\) = The time interval, usually between 1 and 10.

In this paper, SDP provides a way to convert data into images. This method converts one-dimensional time series into two-dimensional space. The features of the time series are converted into features such as contrast, texture, and shape in the image. This provides a suitable data format for subsequent feature enhancement. Compared with time–frequency analysis and raw signal-based methods, the proposed method has better adaptability to nonstationary signals. The data features are integrated into the features of the image. Image data is more convenient for intelligent extraction and analysis.

Image feature enhancement

The grayscale image is composed of different grayscale values. The value of each pixel is in the range of 0 to 255 [30]. Image graying is mainly to convert color images into grayscale images. The conversion formula is shown in Eq. 7. The grayscale image converts the three-channel color image into a grayscale matrix.

$$\begin{array}{c}Gray(i,j)=a\times R(i,j)+b\times G(i,j)+c\times B(i,j)\end{array}$$
(11)

where.

\(R (i, j)\) = The red channel pixel value.

\(G (i, j)\) = The green channel pixel value.

\(B (i, j)\) = The blue channel pixel value.

\(a\), \(b\), and \(c\) = The weights of pixel values of different channels.

\(Gray (i,j)\) = The grayscale values of the \(i\) th row and \(j\) th column of the grayscale image.

Image fusion is a process that involves processing and synthesizing images. Images are collected from multiple channels to create high-quality images. The purpose of image fusion is to weaken the uninterested content in the image. It also aims to suppress noise and highlight the key information of the image. Image fusion methods are mostly used to process remote-sensing images, infrared images, and medical images. Pixel level fusion is the process of finding the mean of the corresponding pixels in different images. The calculation formula can be expressed as follows:

$$FuImage(i,j)=(\sum_{i=1}^{N}{I}_{i}(i,j))/N$$
(12)

where.

\({I}_{n}(x,y)\) = The \(i\) th input grayscale image.

\(N\) = The number of images.

\(G (x, y)\) = The fused pixel matrix.

The equation combines multiple images \(I(i,j)\) into one image \(FuImage(i,j)\). The pixel values of \(FuImage(i,j)\) are determined by the average pixel values of all images.

This simple grayscale averaging method cannot handle images well. For grayscale SDP images, this paper proposes an improved grayscale-weighted average method. Different from the original method, this method first performs statistics on the gray values of the input images. The number of pixels that meet the condition is denoted as \({N}_{px}\). The pixel values of the enhanced image are determined based on the comparison between \({N}_{px}\) and the threshold \(T\). When \({N}_{px}\) is less than the threshold, the pixel value of the enhanced image is determined by the input image. When \({N}_{px}\) is greater than the threshold, the pixel value is set to 255. The specific mathematical formula is as follows:

$$EnhanceImage (i,j)=\left\{\begin{array}{c}\begin{array}{cc}\frac{(\sum_{i=1}^{N}{\omega }_{n}\times {I}_{n}(i,j))}{(N-{N}_{px})}& {N}_{px}<T\end{array}\\ \begin{array}{cc}255& { N}_{px}\ge T\end{array}\end{array}\right.$$
(13)

where.

\({N}_{px}\) = The number of pixels with a value of 255 corresponding to multiple images

\(T\) = The threshold range between 0 and N

\(\omega\) = The weight value

\(EnhanceImage(i,j)\) = The grayscale values of the \(i\) th row and \(j\) th column of the enhanced image.

Original canberra distance

The distance reflects the similarity between two samples. Manhattan distance calculates the sum of longitudinal and transverse distances. The Canberra distance is normalized based on the Manhattan distance. This processing eliminates the dimensional influence between different distances.

For two \(n\)-dimensional vectors, calculate the distance between the corresponding elements separately. Finally, summing n distance values can obtain the distance between two vectors. The Canberra distance between two vectors is expressed as follows:

$$\begin{array}{c}d(p,q)=\sum_{m=1}^{n}|\frac{{p}_{m}-{q}_{m}}{|{p}_{m}|+|{q}_{m}|}|\end{array}$$
(14)

where.

\(p\) and \(q\) = Respectively vectors of the same length.

\(d(p,q)\) = The Canberra distance between vector \(p\) and vector \(q\)

For two \(n\times n\) matrices, first calculate the distance between the corresponding row vector. Then average the distance of \(n\) row vector to get the distance between the last two images. The Canberra distance between two matrices can be expressed as follows:

$$\begin{array}{c}{d}_{Canberra} =\frac{\sum_{i=1}^{n}d({p}_{i}-{q}_{i})}{n}\end{array}$$
(15)

where.

\({d}_{Canberra}\) = The average value of Canberra distance.

\({p}_{i}\) = The \(i\) th row vector.

\(n\) = The number of rows in the matrix.

Improved canberra distance feature in SDP image

A grayscale image is a matrix. Each element of this matrix has grayscale values between 0 and 255. Each pixel in the image corresponds to a different grayscale value. Therefore, the Canberra distance can be used to extract features between two grayscale images. However, the Canberra distance cannot fully meet the diagnosis of bearing faults. In order to improve the performance of fault diagnosis, an improved Canberra distance is proposed based on the original Canberra distance. Two parameters are mainly used for improvement. These two parameters are singular value and contrast. The improved Canberra distance is shown in Eq. 12.

$$\begin{array}{c}D=({\sigma }_{max}/(k\times {d}_{m}))Con\end{array}$$
(16)

where:

\({\sigma }_{max}\) = The maximum singular value.

\(Con\) = The contrast of an image.

\(D\) = Enhanced Canberra distance.

\(k\) = A positive integer, and its main function, is to adjust the size of features and expand differences in features.

Singular value decomposition (SVD) is used in signal processing, data fusion, data reduction, and image denoising. Its principle is to transform the original complex matrix into three basic matrices. Let the original matrix be \(S\), and the matrix is a real matrix with rank greater than 0. Then the singular value of matrix \(S\) is decomposed into the following:

$$\begin{array}{c}S=U(\begin{array}{cc}\sum & 0\\ 0& 0\end{array}){V}^{H}\end{array}$$
(17)

where.

\(S\) = The real matrix of \(m\times n\) order.

\(U\) is a unitary matrix of order \(m\times m\)

\(V\) = A unitary matrix of order \(n\times n\)

\(V\times{H}\)= The conjugate transposition of\(V\) 

\(\sum =(\begin{array}{ccc}{\sigma }_{1}& \cdots & 0\\ :& \cdots & :\\ 0& \cdots & {\sigma }_{max}\end{array})\), \({\sigma }_{max}\) represents the maximum singular value. \({\sigma }_{1}\ge {\sigma }_{2}\ge {\sigma }_{3}\ge \cdots {\sigma }_{max}>0\).

A grayscale image can also be extracted by singular value decomposition. The singular value of the matrix can represent the feature information in the matrix. Here, the maximum singular value is selected as the parameter to improve the Canberra distance.

The gray-level co-occurrence matrix (GLCM) describes the texture characteristics of an image by analyzing its gray space. Figure 4 is a schematic diagram of converting an image to a gray-level co-occurrence matrix. The gray-level co-occurrence matrix describes the correlation characteristics of gray levels in an image based on the relationship between gray levels of pixels. The principle of the gray-level co-occurrence matrix is to count the number of times a certain pixel appears at a specific angle.

Fig. 4
figure 4

Schematic diagram of gray-level co-occurrence matrix. Geometric representation (a). Original image (b). GLCM (c)

Figure 4a is a geometric representation of the gray-level co-occurrence matrix. \(F(i,j)=a\) indicates that the pixel value in the \(i\) th row and \(j\) th column of the image is \(a\). \(X\) represents the horizontal distance from the pixel value \(a\) to the pixel value \(b\). \(Y\) represents the vertical distance from the pixel value \(a\) to the pixel value \(b\). Figure 4b is a pixel value matrix of the original image. The numerical value in the box represents the grayscale value of the pixel. Figure 4c is a gray-level co-occurrence matrix. The grayscale value of the original image ranges from 0 to 2. Therefore, the horizontal and vertical directions of the gray-level co-occurrence matrix are also within the range of 0 to 2. When \(\alpha ={45}^{\circ },X=1,Y=1\), the gray level co-occurrence matrix conversion process is shown in the figure. The number of occurrences from green pixel value 1 to red pixel value 1 is 1. Therefore, the value corresponding to the yellow position of the gray-level co-occurrence matrix is 1. Different features of an image can be extracted from the gray-level co-occurrence matrix, such as the energy, correlation, and contrast of an image. Because contrast reflects the clarity and depth of the image texture. Here, contrast is used to improve the Canberra distance. The formula for calculating contrast is shown in Eq. 14.

$$\begin{array}{c}CON=\sum_{i}\sum_{j}(i-j{)}^{2}P(i,j)\end{array}$$
(18)

where.

\(P (i, j)\) = The distribution probability of the distance \((i,j)\) between pixel values.

Results and discussion

Based on the fault diagnosis scheme outlined in Fig. 1, this experimental study developed a program for feature enhancement and extraction. Additionally, precision experiments were designed for verification. During the image enhancement stage, the original signal is processed to enhance fault features and reduce noise interference. Variational mode decomposition (VMD) is employed to decompose the nonstationary bearing signals from each working condition. In Fig. 5, the original signals from 10 working conditions are decomposed into two components. The blue line represents the time series of the original signal. The red line represents the time series of the first intrinsic mode function (IMF1) after VMD, and the yellow line represents the time series of the second mode function (IMF2). Through the figure, we can observe that the first intrinsic mode function (IMF 1) retains the characteristic information of the original signal. Compared to the IMF 2 signal, the IMF 1 signal exhibits more pronounced features. Consequently, the first component is selected as the decomposed data.

Fig. 5
figure 5

Time domain images of components after VMD under different fault conditions

According to the SDP transformation method, the IMF1 signal is converted into an image format. The signals and images of different fault types are shown in Fig. 6. Due to the VMD processing, the features of the signal are more apparent. These features can be observed more intuitively through the SDP image.

Fig. 6
figure 6

Time domain image and SDP image under different working conditions

The size of each SDP image is \(750\times 750\). When the signal is converted into an image, the features of the vibration signal become the shape, texture, and thickness of the image. The images of different types of signals are different. For instance, the normal working condition displays a smoother fan flap shape, while inner ring fault 3 exhibits a slenderer shape. Outer ring fault 2 appears in a discrete state, while ball fault 3 is more concentrated. However, due to nearly identical original signals for ball fault 1 and ball fault 2, the resulting converted images also bear strong resemblance. Different types of faults can be classified through the analysis of images. However, it is still difficult to extract the features of weak faults.

In order to enhance the weak fault features in the signal, this paper proposes an image feature enhancement method. Firstly, the time series decomposed by variational mode decomposition are uniformly divided into 10 groups. The SDP transformation method is used to plot the 10 groups of data for each operating condition into images. Since the color of the images does not highlight the features of the signal, it is necessary to process the images into grayscale. The grayscale images for each operating condition are shown in Fig. 7. Each row in the figure represents one operating condition, and each operating condition contains 10 images.

Fig. 7
figure 7

Gray SDP images under different working conditions after grouping

From Fig. 7, we can observe that there are differences in the SDP images under different operating conditions. This indicates that the SDP images can reflect the fault features of the signal. However, there are also varying degrees of differences between the images under the same operating condition. This indicates that the transformed images of the signal are not stable. Additionally, random points generated by noise in the images also affect the stability of the images. The instability of the images can lead to errors in fault diagnosis. To solve this problem and obtain stable images, we propose an improved weighted averaging method for image processing.

The schematic diagram of this improved SDP image-weighted averaging method is shown in Fig. 8. It is a pixel-level image fusion algorithm. The algorithm can be divided into four parts: pixel value acquisition, pixel value judgment, counting pixel values, and pixel synthesis. The detailed explanations for these four parts are as follows:

Fig. 8
figure 8

Flow chart of improved grayscale-weighted average method

In the pixel value acquisition stage, it is necessary to traverse the pixel values of each row and column of 10 images. In the pixel value judgment stage, the pixel values of the same row and column of 10 images are compared. It determines whether the pixel value is equal to 255. In the counting pixel values stage, the number of times that the pixel value is not equal to 255 is counted. The count can range from 0 to 10. In the pixel synthesis stage, the relationship between the count and the threshold value is determined. When the count is less than the threshold value, the enhanced pixel value of the image is obtained through weighted averaging. When the count is greater than or equal to the threshold value, the enhanced pixel value of the image is set to 255.

By using the above image feature enhancement method, random points in the image can be eliminated. And the fault feature information in the 10 images is preserved. The enhanced images have more obvious feature information compared to the original SDP images. In Fig. 9, SDP images under different thresholds are listed. In the figure, each column represents an enhanced image under a different threshold. By observing, it can be noticed that a larger threshold results in clearer image contours. A smaller threshold preserves less information in the image. Different operating conditions exhibit different behaviors under different thresholds. For example, the image under normal conditions is barely visible when the threshold is less than 2. This indicates that a threshold that is too small does not effectively reflect the signal features. When the threshold is set to 10, the images between different operating conditions have a high degree of similarity. This is not conducive to fault classification.

Fig. 9
figure 9

Enhanced images with different thresholds under various conditions

In order to determine the optimal image enhancement effect at different threshold values, we extract and analyze the features under each threshold condition. After conducting an accuracy test, the results are presented in Fig. 10. The height of the histogram represents the average accuracy under different threshold conditions. Each blue dot in the figure corresponds to an accuracy test result. The top horizontal line represents the highest accuracy achieved in the test set. The bottom horizontal line represents the lowest accuracy. The figure shows that the accuracy rate reaches its peak at a threshold value of 7. We believe that the reason for this result is that the fault characteristics are most prominent at this threshold. When the threshold is low, too much information in the image is eliminated. On the other hand, when the threshold is high, excessive noise information is retained. According to Eq. 13, the threshold \(T\) is set to 7. When the number Npx of pixels with a value of 255 in the original SDP image is between 7 and 10 (inclusive), the pixel value of the enhanced image is set to 255. When the number is less than 7, the average of the grayscale values of pixels that are not 255 is taken from 10 images. During the averaging process, white pixel values are not included. This image fusion method effectively reduces the influence of random points and enhances the visibility of features, particularly when the threshold is set to 7.

Fig. 10
figure 10

Accuracy of diagnosis with different thresholds

In the feature extraction stage, it is essential to extract the enhanced Canberra distance from the image. This feature involves calculating the Canberra distance, maximum singular value, and contrast of the image. In this paper, for the feature extraction method of SDP images, the Canberra distance measures the similarity between the original grayscale image and the enhanced image.

The Canberra distance is calculated between 10 sets of SDP images and enhanced images under the same working conditions. The mean of these 10 Canberra distances is taken as the feature. Table 2 presents the 10 repetitions of the Canberra distance calculated for each working condition. The table shows that the Canberra distance varies within a certain range under different working conditions. Specifically, the original Canberra distance of outer ring fault 2 is the smallest. Under normal status, the value of the original Canberra distance is the maximum. And it does not overlap with other working conditions. Based on Fig. 9 and Table 2, this result indicates that as the area of the pattern decreases, the Canberra distance of the image also decreases. The results also indicate that Canberra distance can achieve classification of fault types to a certain extent. However, the maximum value of ball fault 1 falls within the range of ball fault 2. This single feature fails to adequately classify the working conditions. Consequently, the original Canberra distance is enhanced by incorporating the maximum singular value and contrast.

Table 2 The original Canberra distance between the original image and the enhanced image

According to Eq. 16, we first need to calculate the ratio between the maximum singular value and the Canberra distance. We use the singular value decomposition method to decompose the enhanced image. This process generates multiple singular values. The maximum singular value is an important feature of a data matrix. Therefore, among these multiple singular values, we select the maximum singular value as the feature of the enhanced image. Then, the ratio of the maximum singular value to the Canberra distance is calculated. At the same time, it is necessary to set a threshold \(k\). When the threshold is set to 10, the ratio stays at a suitable size. Ten sets of each operating condition are randomly selected from the dataset, resulting in Table 3. In the table, we calculate the average maximum and minimum values of the 10 sets of features. From the table, we can discover that different working conditions have different ranges of features. This can provide reference standards for classification in the fault diagnosis stage.

Table 3 The ratio of maximum singular value to Canberra distance for ten operating conditions

The final stage of feature extraction involves calculating the contrast of the enhanced image. Contrast represents the texture and brightness characteristics of an image. For each working condition, calculate the contrast of the enhanced image using the principle of gray-level co-occurrence matrix and Eq. 13. Ten sets of contrast data were calculated for each operating condition, as shown in Table 4. From the results in the table, we can see that the contrast of normal status is the smallest, while the contrast of outer ring fault 2 is the highest. This result can be attributed to the smaller pattern of normal status, resulting in lower contrast. The pattern of outer ring fault 2 is larger and darker, resulting in higher contrast.

Table 4 Contrast of enhanced images for 10 operating conditions

The integration of three features according to Eq. 11 yields an enhanced Canberra distance. Figure 11 illustrates a scatter diagram representing three features in a three-dimensional space. The x-axis corresponds to the Canberra distance. The y-axis corresponds to the maximum singular value. The z-axis represents the contrast of the image. Each working condition is visually represented by distinct colors in the three-dimensional space. In the \(xy\) plane, it is evident that the Canberra distance is inversely proportional to the maximum singular value. Hence, dividing the maximum singular value by the Canberra distance reduces the feature’s order of magnitude and dimension. The figure indicates that the working conditions represented by different colors are concentrated in specific areas. Thus, the diagnosis of various working conditions can be achieved by demarcating specific areas for each condition.

Fig. 11
figure 11

The representation of Canberra distance, maximum singular value, and contrast in three-dimensional space

During the fault diagnosis stage, upper and lower limits are established for the characteristics of each operating condition. The type of malfunction is determined based on whether the improved Canberra distance falls within the predefined range. In the feature extraction section, 10 sets of improved Canberra distances can be generated for each working condition. The maximum and minimum values of the ratio of the maximum singular value to the Canberra distance can be obtained from Table 3. Similarly, Table 4 provides the maximum and minimum values of contrast. These feature ranges are organized in Table 5. The table represents the characteristic range of each fault type. Due to different images, the feature range of different fault types also varies. The image differences are converted into numerical differences for quantification. Therefore, by judging the characteristic values, the diagnosis of fault types can be achieved. These feature ranges will be used during the fault diagnosis phase. Firstly, it is necessary to perform image feature enhancement on signals of unknown fault types to enhance fault features. Then, the improved Canberra distance is obtained through feature extraction. Finally, the fault type is determined by comparing it with the feature range in Table 5.

Table 5 Contrast of enhanced images for 10 operating conditions

To assess the efficacy of this method, we conducted a series of accuracy experiments. In the random experiments, we selected the latter half of the dataset. Additionally, we randomly selected fault types to validate the classification performance of our method in this paper. When a feature falls within the range of Table 6, we check whether the diagnosed fault type matches the actual type. The number of accurate judgments and the total number of judgments are recorded to calculate the overall test accuracy. Table 6 presents the accuracy rates from 10 tests conducted after image enhancement, each consisting of 500 judgments. The highest accuracy rate among the 10 test groups is 99.60%, with an average accuracy rate exceeding 99%. The experimental results show that this method can classify various fault types effectively.

Table 6 Accuracy after image enhancement

Comparative experiment 1: With original Canberra distance

In order to verify the effectiveness of feature diagnosis, the original Canberra distance is compared with the improved accuracy. The original Canberra distance only uses one indicator of the original Canberra distance to test the accuracy. Run 10 times to obtain the accuracy of each time as shown in Table 7. The average accuracy of the original Canberra distance is 92.02%, the maximum accuracy is 92.79%, and the minimum accuracy is 91.18%. The average accuracy is 7.04% lower than that in this paper. However, only Canberra distance is used in the enhanced image, which also achieves more than 90% accuracy.

Table 7 Accuracy after image enhancement

The results in Table 7 indicate that the original Canberra distance has a certain degree of classification performance. The accuracy can be improved by using the maximum singular value and contrast. Compared to the original Canberra distance, there is an improvement in accuracy.

Comparative experiment 2: With non-enhancement condition

In order to verify the effectiveness of image enhancement, the effect without image enhancement is tested here. The difference before and after enhancement is whether 10 images are fused into 1 image. There is no image enhancement method to directly convert the signal into an image. The accuracy obtained from the test is shown in Table 8. The average accuracy of the enhanced method is 84.50%. Compared with the method after image enhancement, the average accuracy difference is 14.56%. The effect of enhancement is obvious. This shows that the fault feature of the image is indeed enhanced.

Table 8 Comparison with non-enhanced images

The results in Table 8 show that this method can highlight the features of the image. The proposed method enhances the accuracy of testing. The effect of the diagnosis is poor under the condition of no enhancement. This indicates that the fault features are not prominently visible in the image. By using the method of image feature enhancement, fault features can be detected more easily.

Comparative experiment 3: Under different noise conditions

In order to verify the effect of this method in noise environment, white noise with different signal-to-noise ratios is added to the original signal to test the accuracy. The accuracy of this method under different signal-to-noise ratios is shown in Table 9. With the increase of signal-to-noise ratio, the accuracy of the experiment also improves. From the table, we can see that the accuracy is 31.92% when the signal-to-noise ratio is 10 dB. When the signal-to-noise ratio is 20 dB, the accuracy is 79.70%. The signal-to-noise ratio has a significant impact on accuracy at 10 dB and 20 dB. When the signal-to-noise ratio is greater than 30 dB, the accuracy remains above 90%. In addition, with the increase of signal-to-noise ratio, the detection accuracy shows an upward trend. At a signal-to-noise ratio of 100 dB, the average detection accuracy is 94.74%.

Table 9 Accuracy under different signal-to-noise ratios

Through the table, we can obtain the following results: the accuracy of the method is significantly affected by the signal-to-noise ratio at 10 dB. However, the method shows higher diagnostic efficiency under noise conditions above 20 dB. The reason behind this result is that the white noise below 20-dB masks the characteristics of most signals, resulting in the improvement of Canberra distance being unable to effectively achieve fault classification. Therefore, there are limitations to this method in a noise environment below 20 dB.

Comparative experiment 4: With different image enhancement methods

In order to thoroughly evaluate the influence of pixel value variations on enhanced image feature extraction, we utilized the maximum and minimum pixel values as control groups for extracting fault features. The fusion method for the maximum pixel value is represented by Eq. 19. On the other hand, the fusion method for the minimum pixel value is represented by Eq. 20. Compared to the original method, Eq. 19 selects the maximum pixel value from 10 images for the enhanced image pixel values. On the other hand, Eq. 20 selects the minimum pixel value from 10 images for the enhanced image’s pixel values.

$${G}_{max}(x,y)=\left\{\begin{array}{c}\begin{array}{cc}\underset{i=\text{1,2},\dots px}{max}({I}_{i}(x,y))& {N}_{px}<T\end{array}\\ \begin{array}{cc}255& { N}_{px}\ge T\end{array}\end{array}\right.$$
(19)
$${G}_{min}(x,y)=\left\{\begin{array}{c}\begin{array}{cc}\underset{i=\text{1,2},\dots px}{min}({I}_{i}(x,y))& {N}_{px}<T\end{array}\\ \begin{array}{cc}255& { N}_{px}\ge T\end{array}\end{array}\right.$$
(20)

The control group utilized the maximum and minimum grayscale values from 10 images as the respective grayscale values for the enhanced images. The improved Canberra distance extracted through the maximum fusion method is shown in Fig. 12a. The improved Canberra distance obtained through the minimum value fusion method is displayed in Fig. 12b. Figure 12c exhibits the improved Canberra distance acquired via the enhanced weighted average method. The horizontal axis represents the number of calculation groups, with 10 sets calculated for each method. The vertical axis signifies the enhanced Canberra distance. In the blue box of Fig. 12a, the features of outer ring fault 1 and ball fault 2 intersect. This indicates that the method cannot accurately classify these two types of faults. The same phenomenon is also observed in Fig. 12b. When features become entangled, it becomes challenging to accurately distinguish the two working conditions. This article effectively addresses this phenomenon in the employed improved weighted average method. It effectively disentangles the entanglement between the two working conditions.

Fig. 12
figure 12

Improved Canberra distance features for different enhancement methods

Comparative experiment 5: With different datasets

The comparative experiment is conducted using data obtained from Southeast University (https://www.github.com\\/cathysiyu/Mechanical-datasets). The working conditions consist of normal status, inner ring fault, outer ring fault, and ball fault. The dataset is obtained from the Drivetrain Dynamics Simulator. We selected the motor vibration signals from the dataset. The motor vibration signals for the four fault types are plotted as SDP images, as shown in Fig. 13.

Fig. 13
figure 13

SDP images from the bearing dataset of Central South University

The differences between the SDP images for each working condition are visually apparent. The SDP image for the normal status exhibits a distinct outline. The SDP image for the inner ring fault shows a more pronounced contrast. The SDP images for the outer ring fault display a block-based distribution. Ball faults also show a block distribution. However, compared to the SDP image of the outer ring fault, the image contour of the ball fault is more distinct.

To validate the effectiveness of the proposed classification method, various feature classification methods were incorporated in this experiment. Simultaneously, we introduce varying levels of white noise to the signal in order to conduct accuracy experiments. This is done to demonstrate the noise resistance ability of the proposed method in this paper. The experimental results are illustrated in Fig. 14.

Fig. 14
figure 14

Classification accuracy under different methods and signal-to-noise ratios

The images demonstrate that the current method in this dataset consistently achieves high classification performance. It also maintains high classification accuracy in noisy environments. The current method maintains an accuracy of over 96% even when the signal-to-noise ratio exceeds 40 dB. Moreover, the proposed method surpasses the performance of both the support vector machine (SVM) and K-nearest neighbors (KNN) methods in terms of classification.

Limitations and implications of research findings in the current research context

The research content of this article has significant implications in the current research context. Currently, many fields have applied SDP signal processing methods. The content of this article can be to enhance image features, which is very meaningful for fault diagnosis using signals. The extraction of fault features from SDP images is a simple and effective method. An advantage of SDP images is their independence from mechanical power or configuration. Furthermore, they are applicable to various signal types. However, this conversion method does have limitations. Firstly, the image features may not fully capture the signal changes. Additionally, features can be lost during the process of data-to-image conversion. The conversion of data into images serves as a means to increase data dimensionality. The image format complicates the calculation process. This presents challenges for image processing and feature extraction. Achieving signal feature representation with reduced computational time and feature count is a potential challenge. Another potential research avenue involves employing image features to predict mechanical lifespan.

Conclusions

In this paper, a method of image enhancement based on image fusion is proposed. This method is applied to the fault diagnosis of mechanical vibration signals, and the enhancement of the synchronized dot pattern image features of vibration signal conversion is realized. In addition, the improved Canberra distance is used to extract image fault features. In this paper, vibration signal processing, feature extraction and fault classification are realized. At the same time, different experiments are carried out using the data of rolling bearings. In the comparison of original images, the effectiveness of the image enhancement method in this paper is verified. In the comparison test with the original Canberra distance, the effectiveness of the improved Canberra distance using the maximum singular value and contrast is verified. Finally, the robustness of the image enhancement method and feature extraction ability under the influence of different intensities of noise is analyzed. Therefore, the image enhancement method proposed in this paper has high reliability and classification accuracy in fault diagnosis using the synchronized dot pattern.

Availability of data and materials

It will be made available on request.

Abbreviations

STFT:

Short-time Fourier transform

SDP:

Symmetrized dot pattern

VMD:

Variational mode decomposition

SVM:

Support vector machine

KNN:

K-nearest neighbors

IMF:

Intrinsic mode function

References

  1. Liu ZH, Lu BL, Wei HL et al (2020) A stacked auto-encoder based partial adversarial domain adaptation model for intelligent fault diagnosis of rotating machines. IEEE Trans Industr Inf 17(10):6798–6809

    Article  Google Scholar 

  2. Zhang L, Zhang F, Qin Z et al (2022) Piezoelectric energy harvester for rolling bearings with capability of self-powered condition monitoring. Energy 238:121770

    Article  Google Scholar 

  3. Zhang Y, Cao J, Zhu H et al (2019) Design, modeling and experimental verification of circular Halbach electromagnetic energy harvesting from bearing motion. Energy Convers Manage 180:811–821

    Article  Google Scholar 

  4. He M, He D (2020) A new hybrid deep signal processing approach for bearing fault diagnosis using vibration signals. Neurocomputing 396:542–555

    Article  Google Scholar 

  5. Wang T, Qiao M, Zhang M et al (2020) Data-driven prognostic method based on self-supervised learning approaches for fault detection. J Intell Manuf 31:1611–1619

    Article  Google Scholar 

  6. Altan A, Karasu S, Bekiros S (2019) Digital currency forecasting with chaotic meta-heuristic bio-inspired signal processing techniques. Chaos Solitons Fractals 126:325–336

    Article  MathSciNet  Google Scholar 

  7. Altan A, Karasu S, Zio E (2021) A new hybrid model for wind speed forecasting combining long short-term memory neural network, decomposition methods and grey wolf optimizer. Appl Soft Comput 100:106996

    Article  Google Scholar 

  8. Wu P, Lou S, Zhang X et al (2020) Data-driven fault diagnosis using deep canonical variate analysis and fisher discriminant analysis. IEEE Trans Industr Inf 17(5):3324–3334

    Article  Google Scholar 

  9. Li C, Li S, Zhang A et al (2021) Meta-learning for few-shot bearing fault diagnosis under complex working conditions. Neurocomputing 439:197–211

    Article  Google Scholar 

  10. Liu C, Zhang L, Niu J et al (2020) Intelligent prognostics of machining tools based on adaptive variational mode decomposition and deep learning method with attention mechanism. Neurocomputing 417:239–254

    Article  Google Scholar 

  11. Özçelik YB, Altan A (2023) Overcoming nonlinear dynamics in diabetic retinopathy classification: a robust AI-based model with chaotic swarm intelligence optimization and recurrent long short-term memory. Fractal and Fractional 7(8):598

    Article  Google Scholar 

  12. Bai Y, Yang J, Wang J et al (2021) Image representation of vibration signals and its application in intelligent compound fault diagnosis in railway vehicle wheelset-axlebox assemblies. Mech Syst Signal Process 152:107421

    Article  Google Scholar 

  13. Chang L, Liu Z, Shen Y et al (2020) Novel multistate fault diagnosis and location method for key components of high-speed trains. IEEE Trans Industr Electron 68(4):3537–3547

    Article  Google Scholar 

  14. Kim Y, Na K, Youn BD (2022) A health-adaptive time-scale representation (HTSR) embedded convolutional neural network for gearbox fault diagnostics. Mech Syst Signal Process 167:108575

    Article  Google Scholar 

  15. Wu J, Tang T, Chen M et al (2020) A study on adaptation lightweight architecture based deep learning models for bearing fault diagnosis under varying working conditions. Expert Syst Appl 160:113710

    Article  Google Scholar 

  16. Jablonski A, Dziedziech K (2022) Intelligent spectrogram-a tool for analysis of complex non-stationary signals. Mech Syst Signal Process 167:108554

    Article  Google Scholar 

  17. Wodecki J, Kruczek P, Bartkowiak A et al (2019) Novel method of informative frequency band selection for vibration signal using nonnegative matrix factorization of spectrogram matrix. Mech Syst Signal Process 130:585–596

    Article  Google Scholar 

  18. Reinhold I, Sandsten M (2022) The multitaper reassigned spectrogram for oscillating transients with Gaussian envelopes. Signal Process 198:108570

    Article  Google Scholar 

  19. Peng C, Zou J, Liang M et al (2020) A novel linear CCD variable velocity two-dimensional imaging approach based on adaptive image segmentation. IEEE/ASME Trans Mechatron 25(2):762–769

    Article  Google Scholar 

  20. Long Z, Zhang X, He M et al (2021) Motor fault diagnosis based on scale invariant image features. IEEE Trans Industr Inf 18(3):1605–1617

    Article  Google Scholar 

  21. Wang XB, Luo L, Tang L et al (2021) Automatic representation and detection of fault bearings in in-wheel motors under variable load conditions. Adv Eng Inform 49:101321

    Article  Google Scholar 

  22. Tang H, Liao Z, Chen P et al (2020) A novel convolutional neural network for low-speed structural fault diagnosis under different operating condition and its understanding via visualization. IEEE Trans Instrum Meas 70:1–11

    Google Scholar 

  23. Li C, Mo L, Yan R (2021) Fault diagnosis of rolling bearing based on WHVG and GCN. IEEE Trans Instrum Meas 70:1–11

    Google Scholar 

  24. Fang H, Deng J, Zhao B et al (2021) LEFE-Net: a lightweight efficient feature extraction network with strong robustness for bearing fault diagnosis. IEEE Trans Instrum Meas 70:1–11

    Google Scholar 

  25. Wang T, Liu Z, Lu G et al (2020) Temporal-spatio graph based spectrum analysis for bearing fault detection and diagnosis. IEEE Trans Industr Electron 68(3):2598–2607

    Article  Google Scholar 

  26. Nath AG, Udmale SS, Raghuwanshi D et al (2021) Improved structural rotor fault diagnosis using multi-sensor fuzzy recurrence plots and classifier fusion. IEEE Sens J 21(19):21705–21717

    Article  Google Scholar 

  27. Sun Y, Li S, Wang X (2021) Bearing fault diagnosis based on EMD and improved Chebyshev distance in SDP image. Measurement 176:109100

    Article  Google Scholar 

  28. Wang H, Wu F, Zhang L (2021) Application of variational mode decomposition optimized with improved whale optimization algorithm in bearing failure diagnosis. Alex Eng J 60(5):4689–4699

    Article  Google Scholar 

  29. Chen P, Zhao X, Zhu Q (2020) A novel classification method based on ICGOA-KELM for fault diagnosis of rolling bearing. Appl Intell 50:2833–2847

    Article  Google Scholar 

  30. Sun Y, Li S, Wang Y et al (2021) Fault diagnosis of rolling bearing based on empirical mode decomposition and improved Manhattan distance in symmetrized dot pattern image. Mech Syst Signal Process 159:107817

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by National Natural Science Foundation of China (No. 61603149) and Shandong Province Natural Science Foundation (No. ZR2023MF102).

Funding

This work is supported by National Natural Science Foundation of China (No. 61603149) and Shandong Province Natural Science Foundation (No. ZR2023MF102).

Author information

Authors and Affiliations

Authors

Contributions

JGP is responsible for writing and rewriting the article, revising the article, simulating the program. YJS is responsible for providing ideas and formulating research directions. WW is responsible for building the framework and writing of the first draft. All authors have read and approved the manuscript.

Corresponding author

Correspondence to Yongjian Sun.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Peng, J., Wang, W. & Sun, Y. Bearing fault diagnosis based on enhanced Canberra distance feature in SDP image. J. Eng. Appl. Sci. 71, 198 (2024). https://doi.org/10.1186/s44147-024-00535-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s44147-024-00535-2

Keywords