# INTRODUCTION he railway is the great artery of the national economy. It's safe and reliable rapid development is of great significance to speed up the process of national modernization and promote economy and society. In recent years, with the gradual improvement of high-speed rail traction system, there has been wide usage of high-speed rail traction transformer [1]. Therefore, it not only makes the high-speed rail traction transformer failure is coming out one after other, but also poses a hazard to the system. Therefore, how to monitor the traction transformer operation and how to diagnose its fault in time, these problems have become important in research of traction transformer fault diagnosis. Accurate and real-time detection of the fault state of traction transformer, high-speed rail traction system for the efficient and safe operation is of great significance. In recent years, with the development of datadriven [2] fault diagnosis technology, fault diagnosis of high-speed rail traction system has provided a lot of valuable reference ideas. Based on the expert experience and the model-based fault diagnosis method, the fault diagnosis method based on data can avoid the uncertainties of system modeling, the shortcomings of unknown interference and the difficulty of obtaining expert experience. It can improve the fault diagnosis method the ability to detect faults, and is convenient in practical application. Based on Fast ICA [3] algorithm, the fault diagnosis of high-speed traction transformer derives from the data-driven fault diagnosis method. This paper presents a method of fault diagnosis using the V/V wiring [4] model of the high-speed traction transformer and the Fast ICA algorithm [5]. The simulation results show that the algorithm has high resolution and precision, It is of practical value for fault detection of high-speed traction transformer. # II. HIGH-SPEED IRON TRACTION TRANSFORMER & FAULT DIAGNOSIS V/V type wiring is the two single-phase transformer to V-shaped connection, two transformers secondary winding, each with one end connected to the traction substation two-phase bus. The other end connects to the rail back to the return line. By connecting two single-phase transformers to the three-phase power system in a V-way, each traction substation can be powered by a two-phase voltage of a three-phase system. Turn three-phase voltage of the power system into traction power supply of two-phase voltage power supply. The so-called fault diagnosis is the operation of the equipment and abnormal circumstances to make judgments, which consists of fault detection, fault separation and fault identification. Specifically, it refers to fault separation and fault identification. With the success of China's high-speed railway has been completed, traction transformer fault diagnosis made a higher demand, that is quick detection of transformer latent fault, and identification of the type of failure. Therefore, it is of great practical significance to study a complete fault diagnosis system of traction transformer, which can deal with the latent fault and reduce the economic loss. Fault diagnosis of traction transformer is a nonlinear mapping process from fault information to fault type, which cannot be described by the precise mathematical model, and it is difficult to obtain a large number of fault samples when the transformer fails. Therefore, the traditional fault diagnosis method has limitations in application. system in a V-way, each traction substation can be powered by a two-phase voltage of a three-phase system. Turn three-phase voltage of the power system into traction power supply of two-phase voltage power supply. ICA algorithm has been a new statistical signal processing method developed in recent years. Compared with the traditional methods, ICA algorithm has the following advantages: it is not necessary to assume that the measured data satisfy the Gaussian distribution when modeling. The independent composition of the ICA algorithm can not only remove the relevance of the variables, but also describe the process characteristics with high-level statistical information. The number of independent components to be processed by ICA algorithm is less than that of principal component analysis. It chooses the Newton iteration method as the optimization algorithm for fast speed, no need of selecting the step parameters, easy to use and so on. Given this, this paper mainly uses Fast ICA algorithm to capture and process the voltage signal of the traction transformer and establish the monitoring statistic to diagnose the failure of the traction transformer. Fast ICA algorithm [6], also known as fixed-point (Fixed-Point) algorithm. Fast ICA is an efficient algorithm for independent component analysis, invented by Aapo Hyvärinen at Helsinki University of Technology, which is a fast optimization iterative algorithm. Despite ordinary neural network algorithm, it uses a batch approach, which means a large number of sample data is involved in each step of operation. Fast ICA algorithm includes several forms such as kurtosis, the maximum likelihood, the largest negentropy, and etc. This paper is based on the largest negentropy, which uses the largest negentropy as a search direction to extract the independent source, fully embodies the projection tracking (Projection Pursuit) this traditional linear transformation of the idea. Also, the algorithm uses a fixed-point iteration scheme maximizing non-Gaussianity as a measure of statistical independence, which is faster than conventional gradient descent methods for ICA. The matrix model of ICA algorithm is, E AS X + = (1) There are d components in the input signal source, T d s s s S )] ( ),... 2 ( ), 1 ( [ =(2) Which are independent of each other, the observed signal is mixed signal matrix T m x x x X )] ( ),... 2 ( ), 1 ( [ =(3) There are m of mixed signals, this mixed signal is the actual observation signal. A is the mixed coefficient matrix with m rows and d columns, E is the error matrix with m rows and n columns, where n is the number of sampling points. The estimation of the active signal is, S WAS WX S ? = = ?(4) We can see that W and A are the mutually inverse matrix. The objective of the ICA is to obtain the separation matrix W and the independent component matrix S from the observed signal X in the case where the quantities of the source signals are independent of each other. Because the Fast ICA algorithm takes the largest negentropy as a search direction, we first discuss the negentropy criterion. From the theory of information theory, we can see that the entropy of Gaussian variables is the largest in all random variables of equal variance, so we can use entropy to measure non-Gaussian, common entropy correction forms, that is negentropy. According to the central limit theorem, if a random variable X is composed of many independent random variables Si(i=1,2,3,...N), as long as Si has a finite mean and variance, the random variable X is closer to the Gaussian distribution, regardless of its distribution. In other words, Si is more non-Gaussian than X. Thus, in the separation process, the independence of the separation results can be expressed by the non-Gaussian measure of the separation result, and when the non-Gaussian measurement reaches the maximum, it indicates that the separation of the individual components is completed. Negentropy Ng(Y) is defined by ) ( ) ( ) ( Y H Y H Y N Gauss g ? =(5) Where Y Gauss is a Gaussian random variable with the same variance as Y, H() is the differential entropy of the random variable. ? ? ? d p p Y H Y Y ) ( lg ) ( ) ( ? ? =(6) In the above equation, ) (? Y p is the probability density function. According to the information theory, the random variable of Gaussian distribution has the differential entropy in the random variable with the same variance. When Y has a Gaussian distribution, N g (Y). The higher the non-Gaussian property of Y, the smaller the differential entropy, the value of N g (Y), so N g (Y) can be used as a measure of the non-Gaussian property of the random variable Y. Since the probability density distribution function that needs to be known for calculating the differential entropy according to equation ( 6) is obviously impractical, so the following approximate formula Global Journal of Researches in Engineering ( ) Volume XVII Issue VIII Version I 2 Year 2017 F )]} ( [ )] ( [ { ) ( Guass g Y g E Y g E Y N ? =(7) Where E[?] is the mean operation, g(?) is a nonlinear function, take it as ) tanh( ) ( 1 y y g = ) 2 / exp( ) ( 2 2 y y y g ? = 3 3 ) ( y y g =(8) # ICA optimization algorithm consists of two parts Fault Diagnosis of High Speed Iron Traction Transformer for V/V Type Wiring based on Fast Independent Component Analysis (7) Step 1: Preprocessing of Observed Data At first, the signal is de-centered. Removal of the center is to make the mean of each vector zero. The variable itself is subtracted from the mean. After normalization, the center of the data is divided by its own standard deviation to make each variable equal weight for preventing a variable in the process of accounting for a dominant position. In practical applications, according to the actual situation of operation, each variable is given a different weight to get more targeted results. Step 2: Whitening The first step is to whiten the data, which can remove the correlation between the observed signals and reduce the computational complexity, thereby makes the convergence of the algorithm better. Let the random variable x the k th sample value be x(k), the covariance matrix is: )) ( ) ( ( k x k x E R T x = (9) The characteristic value of Rx is decomposed into T x U U R ? = (10) The whitening transformation matrix is: ) ( ) ( k Qx k z = (11) T U Q 2 1 ? ? = (12) From the whitening transformation matrix can be obtained: ) ( ) ( ) ( ) ( k Bs k QAs k Qx k z = = = (13) Where B is an orthogonal matrix and has a relation: I BB B k s k s BE k z k z E T T T T = = = )) ( ) ( ( )) ( ) ( ((14) It appears that the operation of ICA becomes simplified by whitening transformation. The problem of solving ICA is simplified from matrix A to solving quadrature matrix B, and the estimation of source signal is: ) ( ) ( ) ( ?k Qx B k z B k s T T = = (15) Combining formula (4), which can be obtain, T WQ B ) ( 1 ? = (16) The Fast ICA Algorithm is: Step 1: The observation data X is centered so that its mean is zero. Step 2: And whiten the data Z X ? . Step 3: Select the number of m components to be estimated, set the number of iteration 1 ? p . Step 4: Choose an initial (e.g., random) vector b p possessing a unit norm. Step 5: Set ( ) { } ( ) { } p T p T p p b Z b g E Z b Zg E b ' ? = (17) Step 6: Do the following orthogonalization Step 8: If it is not converged, go back to Step5. Step 9: Let 1 + = p p . If p is not greater than the desired number of m, go back to Step4. # IV. THEORETICAL ANALYSIS & SIMULATION First of all, to create the simulation model with MATLAB / SIMULINK (Fig. 1): Secondly, offline modeling. The mixed signal matrix X = [x (1), x (2), ... x (m)] T is taken as the signal observation point in the Fast ICA algorithm. The matrix is m rows and n columns, Where m is the number of observed signals. In this model, m is the number of observation vectors m = 5, and n is the number of sampling points, and X is the actually observed signal. Thirdly, establish historical statistics. Here in the Fast ICA fault diagnosis method, the two most widely used statistics, I 2 statistics and Squared Prediction Error (SPE) statistics, are used to monitor whether the process is normal or not. The I 2 statistic is used to measure the system change of the process, whereas The SPE is used to measure the random variation, and the I 2 and SPE statistic with reasonable control limits can detect different types of faults. Therefore, this method combines the two statistics for better monitoring effect. Combine the formula (16), The estimation of the mixed signal by independent component analysis is: (20) The error of the estimated and actual value of the mixed signal is: And then the traction transformer system for real-time data acquisition is to monitor the data, the monitoring data for pretreatment. The standardized data is decomposed by ICA to compute the corresponding I 2 and SPE statistics, and is compared with the control limits obtained in the historical statistical data. If the statistics are not exceeded, the system is running normally. If the statistics are overrun, the system is faulty. In MATLAB, the model of the high-speed traction transformer, which simulates the V/V wiring, is used to monitor the voltage signal of the input and output. The monitoring signal is a mixed signal and the observation signal. And then after collecting 500 sampling points, as shown in Fig. 3 The monitoring data were pretreated, the standardized data were ICA decomposition, Calculate the corresponding I2 and SPE statistics to compare it with the control limits obtained from historical statistics. # CONCLUSION The Fast ICA algorithm based on the maximum negative entropy criterion is used to analyze the data in the precision range, and the I and SPE monitoring statistics are calculated by Fast ICA algorithm. In the data simulation experiment, the waveform was also successfully monitored for failure. 1![Fig. 1: Correct Simulation Model](image-2.png "Fig. 1 :") ![collected historical data into the above formula, there can get the historical statistics during operation.](image-3.png "") ![below, which input and output signals have five sampling points. Add a short circuit fault to Fig.1 above, As shown in Figure Fig.2 below. In the Fig.2, the phase A and phase B of the V/V traction transformer input phase are interrupted. And then continue to sample this simulation, the same collection of 500 sampling points, sampling diagram shown in Fig.4, then use the 1000 sampling points as Fast ICA algorithm in the signal observation points.](image-4.png "") 23![Fig. 2: Fault Simulation Model](image-5.png "Fig. 2 :Fig. 3 :") 4![Fig. 4: Fault Sampling Signal](image-6.png "Fig. 4 :") 42017![2017 Global Journals Inc. (US) Global Journal of Researches in Engineering ( ) Volume XVII Issue VIII Version I Year It reveals that a failure has occurred after the 500th sampling point, as shown in Fig.5 below.](image-7.png "© 4 2017 F") © 2017 Global Journals Inc. (US) ## ACKNOWLEDGMENT This paper is supported by Project Supported by National Natural Science Foundation of China (51475246); National Natural Science Foundation of Jiangsu Province(BK20161019); University Science Research Project of Jiangsu Province (15KJB470011). * A simulation mathematical model of a traction transformer with tapped secondary windings MEvstaf'ev AYa AGYakushev Sereda * Russian Electrical Engineering 87 5 2016 * Performance of a data-driven technique to changes in wave height and its effect on beach response MJose HarshinieHorrillo-Caraballo Karunarathna DominicEShun-Qi Pan Reeve Water Science and Engineering 10 2016 * Independent component analysis: algorithms and applications AHyvarinen EOja Neural Networks 13 2000 * The Protection Setting for 110kV Line Connects to V/V Traction Transformers WeiZhang LeiLi * Advanced Materials Research 516 2012 * Fault diagnosis in dynamic systems using analytical and knowledge based redundancy a survey and some new results PFrank Automatica 26 3 1990 * Fixed-point ICA algorithm for blind separation of complex mixtures containing both circular and noncircular sources YaoJunliang RenHaipeng LiuQing The Journal of China Universities of Posts and Telecommunications 2016 23