УНИВЕРСИТЕТ ИТМО
НАУЧНО-ТЕХНИЧЕСКИИ ВЕСТНИК ИНФОРМАЦИОННЫХ ТЕХНОЛОГИИ, МЕХАНИКИ И ОПТИКИ январь-февраль 2015 Том 15 № 1 ISSN 2226-1494 http://ntv.ifmo.ru/
SCIENTIFIC AND TECHNICAL JOURNAL OF INFORMATION TECHNOLOGIES, MECHANICS AND OPTICS January-February 2015 Vol. 15 No 1 ISSN 2226-1494 http://ntv.ifmo.ru/en
COMPARISON OF VARIOUS APPROACHES TO MULTI-CHANNEL INFORMATION FUSION IN C-OTDR SYSTEMS FOR REMOTE MONITORING
OF EXTENDED OBJECTS A.V. Timofeeva
a LPP "EqualiZoom", Astana, 010000, Kazakhstan Corresponding author: [email protected] Article info
Received 24.11.2014, accepted 17.12. 2014 doi: 10.17586/2226-1494-2015-15-1-122-129 Article in English
Reference for citation: Timofeev A.V. Comparison of various approaches to multi-channel Information fusion in c-otdr systems for remote monitoring of extended objects. Scientific and Technical Journal of Information Technologies, Mechanics and Optics, 2015, no. 1, vol.15, pp. 122-129 (in English)
Abstract. The paper presents new results concerning selection of optimal information fusion formula for ensembles of C-OTDR channels. Here C-OTDR is a coherent optical time domain reflectometer. Each of these channels provides data for appropriate automatic classifier which is designed to classify the elastic vibration sources in the multiclass case. Those classifiers form a so-called classifiers ensemble. Ensembles of Lipschitz Classifiers were considered. In this case the goal of information fusion is to create an integral classificator designed for effective classification of seismoacoustic target events. The Matching Pursuit Optimization Ensemble Classifiers (MPOEC), the Linear Programming Boosting (LP-Boost) (LP-P and LP-B variants), the Multiple Kernel Learning (MKL), and Weighing of Inversely as Lipschitz Constants (WILC) approaches were compared. The WILC is a brand new approach to optimal fusion of Lipschitz Classifiers Ensembles. The basics of these methods have been briefly described along with intrinsic features. All of those methods are based on reducing the task of choosing convex hull parameters to a solution of an optimization problem. All of the mentioned approaches can be successfully used for using in the C-OTDR system data processing. Results of practical usage are presented. Keywords: C-OTDR channels, MKL, LP-Boost, Lipschitz Classifiers Ensembles.
УДК 004.942, 001.891.57
СРАВНЕНИЕ РАЗЛИЧНЫХ ПОДХОДОВ К МУЛЬТИКЛАССОВОМУ СЛИЯНИЮ ИНФОРМАЦИИ В C-OTDR СИСТЕМАХ ДЛЯ УДАЛЕННОГО МОНИТОРИНГА ПРОТЯЖЕННЫХ ОБЪЕКТОВ А.В. Тимофеев"
a ТОО «Эквализум», Астана, 010000, Казахстан, Адрес для переписки: [email protected] Информация о статье
Поступила в редакцию 24.11.2014, принята к печати 17.12. 2014 doi: 10.17586/2226-1494-2015-15-1-122-129 Язык статьи - английский
Ссылка для цитирования: Тимофеев А. В. Сравнение различных подходов к мультиклассовому слиянию информации в C-OTDR системах для удаленного мониторинга протяженных объектов // Научно-технический вестник информационных технологий, механики и оптики. 2015. Том 15. № 1. С. 122-129
Аннотация. Рассматривается проблема выбора оптимального способа слияния информации, поступающей от множества каналов системы когерентной оптической рефлектометрии во временной области (C-OTDR). Каждый из этих каналов является источником данных для автоматических классификаторов источников упругой сейсмоакусти-ческой вибрации. Эти классификаторы образуют ансамбль классификаторов. Были рассмотрены ансамбли липшице-вых классификаторов. В этом случае целью слияния информации является построение интегрального классификатора, предназначенного для эффективной классификации целевых сейсмоакустических событий. Сравнивались следующие методы: поиск совпадений на ансамбле классификаторов (MPOEC), бустинг с линейным программированием (LP-Boost) (LP-P и LP-B модификации), обучение с множественными ядерными функциями (MKL) и метод обратных констант Липшица (WILC). Выделен WILC как новый подход к оптимальному слиянию информации на ансамбле липшицевых классификаторов. Обсуждены основы работы каждого из методов с указанием специфических особенностей. Показано, что все методы основаны на сведении проблемы выбора параметров выпуклой оболочки к
оптимизационной задаче. Все рассмотренные подходы могут успешно использоваться при обработке информации в C-OTDR системах. Представлены результаты практического использования этих методов. Ключевые слова: C-OTDR каналы, MKL, LP-Boost, ансамбль липшицевых классификаторов
Introduction
Application of the C-OTDR (Coherent Optical Time Domain Reflectometer) technology to decide various problems of extended objects remote monitoring is currently being evaluated as a very promising approach [1, 2]. In particular, this technology can be effectively used to monitor oil and gas pipelines, controlling technological processes and identifying unauthorized activities in close proximity of the monitored objects. Simplistically, a C-OTDR-system consists of an infrared laser, an optical fiber and a processing unit. The laser sends the probing signals through the optical fiber which is buried in the vicinity of the monitoring object. The processing unit is designed for comprehensive processing of the backscattered signals, which are called speckle-structures. The main item of the C-OTDR technology is a comprehensive analysis of the Rayleigh backscattered radiation characteristics, which transforms into an energetically weakened pulse and propagates constantly in the direction opposite to the direction of a pulsed laser flow. The reflected signal is created by the presence of static impurities in the optical fiber body and defects in the microstructure. Signals scattered by the centers coherently and randomly interfere with each other, forming so-called speckle patterns. Speckle patterns corresponding to different sections of the optical fibers are recorded and accumulated in the data center. The slightest change of the reflectance index value of the fiber, which occurred in a particular place, radically changes the speckle pattern corresponding exactly to this place of the fiber. These changes are reliably detected by the data center. The local changes in refractive index occur under the impact of temperature or due to mechanical action on the optical fiber surface. Let us call the optical fiber buried in the soil to a depth of 50-100 cm, a fiber optical sensor (FOS). Mechanical stress on the FOS surface is caused by seismic acoustic waves. These waves are generated of the sources of elastic vibrations (SEV) which located in vicinity of laying the FOS. Upon reaching the FOS, seismoacoustic wave causes a local longitudinal microstrain on its surface. Those microstrains in turn, cause a change in the local refractive index of light in a relatively small sector of the FOS. As a result, the speckle pattern, which corresponds to this sector, changes significantly. Thus, the FOS quite accurately reflects the state of the seismoacoustic field in its vicinity. The seismoacoustic field contains information about events that occur in the surface layers of the ground near the FOS. This field is created by structural waves, which generated due to mechanical effects on the soil or as a result of a seismic activity. Walking or running man, traffic, earthworks, including hand digging are typical sources of the seismoacoustic emission (structural acoustic wave). In this case, the frequency range of the seismoacoustic waves is in the interval of 0.1 Hz to 1000 Hz. The information, which is required for correct identify the type of SEV, is concentrated in the frequency range of 0.1 Hz to 500 Hz, while 95% of the meaningful information is in even the more narrow range of 1 Hz to 350 Hz. The spectral characteristics of the target signals which lie above and below this frequency range carry information only about the individual characteristics of the SEV. The SEV, which are subjects of interest for remote C-OTDR monitoring will be called a target SEV (TSEV). For the convenience of data processing, the entire FOS length is broken to successive portions (sites) each has length around 10-15 m. The data from those sites is processed separately. These sites will be called C-OTDR channels or just channels. Width of the channel Aw depends on the probe pulse length. In practice, TSEV has its own small size and assumed point. Due to the nature of the elastic oscillation, the wave from a point source of seismoacoustic emission is usually detected simultaneously in several C-OTDR channels. At the same time, due to strongly anisotropic medium of the elastic vibrations propagation, the structure of the oscillations (speckle patterns) varies considerably between different C-OTDR channels. In each channel a time-frequency characteristics of the speckle pattern are largely reflect a time-frequency structure of the SEV, which occur in vicinity of the corresponding channel. The oscillation energy is considerably attenuated and distorted during propagation in the environment. Intensity of attenuation and distortion depend on the average absorption factor of the medium and on the distance from the oscillation point to the location of channel. Accordingly there is always one channel, wherein vibrations of the TSEV reflected substantially intense than in other channels. We will call this channel dominant. C-OTDR systems perform three major tasks in the following sequence:
1. task "D" (Detection) - detection of the TSEV;
2. task "E" (Estimation) - estimate of the location of the TSEV;
3. task "C" (Classification) - classification of detected TSEV by means of assigning it to one of D priori given classes.
In the multichannel case, the task "C" has to be solved by creation the method of effective multichannel data fusion. There are number of various approaches to effective multichannel data fusion for task "C" (classification). This paper describes results of a comparing various multichannel data fusion approaches for TSEV classification including a brand new approach which based on weighing of inversely as Lipschitz Constants (WILC) and it allows to improve the generalization ability of the classification system.
Designations and Research objective
Let us denote:
- C-OTDR channels. Ch(Kk) is k-th C-OTDR channel, where a tuple Kk = (Ak,Rk), here Ak is an absorption coefficient of k-th channel, Rk is a length of k-th channel;
- Feature. A tuple (Z_, d ) is a compact feature space where Z is a set of feature values, d is a metric of Z , data of all channels belongs to Z ;
- Set of SEV classes. A set © is a finite set of indexes of SEV-classes, |©| = D ;
- Training Set. ZT ={(Z, 6,) | i = 1,...N}, |{(z, 9,)}| = N , Z={z1;, z 2,,..., z m,}, 9,. e©, each of zk e Z,k e {1,...,m}, corresponds to Ch(Kk), and to 9,;
- True index of SEV class. A 9" e© is a true index of the SEV-class to which the samples zk belong, thus e* is an index of a target class;
- Samples to classify. A set Z = {z1,z2,...,zm}[0,1] is feature sample set; each of zk e Z,k e {1,...,m} , corresponds to Ch(Kk); in another words, we obtain the feature sample zk from k-th channel Ch(Kk);
- Lipschitz Margin Classifier. Let fk (9 | zk),k e {1,...,m};9 e ©, be a binary Lipschitz Margin classifier (LMC) [3, 4] with Lipschitz Constant (LC) Lk; fk (9 | zk): Z ^ {9,© \ 9} (concept: one against all); so, classifier fk (91 •) divides the feature space (Z, d) into two classes 9 and © \ 9;
fk (9 | zk ) = (fk (91 zk),Rk) here fk (9 | zk) e R1 is discriminate (stochastic) functions (so-called score-parameters, which shows similarity degree of a sample zk regarding to class 9e©; discriminant function fk (9 | zk) explicitly dependent on the index hypothesis to be tested 9 and implicitly on the index of the target class e*; Rk is the classification decision-making rule Rk : 9k = Arg Max (fk (9 | zk)); let us denote у -set of LMC fk (9 | zk,у) parameters, which needs to be tuned during of training process; otherwise, set у will be denoted as LMCP or LMCP у;
- Ensemble of LMC (Lipschitz Classifiers Ensembles). F(e | Z) = {fk (e | zk)|k = 1,..., m} is an ensemble of the LMC;
Fq : Z ^{9,6 \ 9} is an integral classifier on the ensemble F(0 | Z); F9 =(F (0|F(01 Z)), R); R : 9 = Arg Max (f (9 |F(9 | z))) is output of integral classifier F (9 |F(0 | Z));
- F (9|F(9 | Z)) - discriminate function on the classifiers ensemble F(9 | Z), F(e|F(e | Z))=ZPkfk (e | zk),
where 2Pk = 1, VPk ^ 0 ; coefficients {pk} are determined by various methods, which are object of our in-
kk
vestigation.
So there exist m statistical independent C-OTDR channels Ch = {Ch(Kk) | k = 1,...,m}. Each of those channels depends of external (environmental) parameters tuple Kk = (Ak, Rk) . Simply speaking, these channels transmit signals from sources of elastic vibrations (SEV) to FOS. Thus signals z k e Zk, k e{1,..., m} are outputs of C-OTDR channels Ch . The tuple Kk defines the effectiveness of channel Ch(Kk) for signal transmission.
The signals Z are contain relevant information about SEV time-frequency parameters. Every two channels Ch(Kk) and Ch(Kp) distort the SEV time-frequency parameters by differently because of external
parameters Kk and Kp are different. Accordingly we suppose every two different samples zk and z are statistically independent if k ф p . For each C-OTDR channel Ch(Kk) are used appropriate D binary classifiers fk (e,. | zk), e, e© . Each LMC fk (e,. | zk), is binary classifier, which divides the feature space (Z,d) into two classes 0,- and © \ 0 i.
So, we need to classify of the SEV type using observation Z of C-OTDR channels Ch . An obvious approach to solving this problem is to use the Lipschitz Classifiers Ensembles (F(0 | Z)). But the problem of effective multichannel data fusion arises. There are number of various approaches to multichannel data fusion.
The goal of this paper is to compare some data fusion methods effectiveness. A number of known
approaches and one a brand new method were studied. The brand new method is based on use of Lipschitz constants of LMC's.
Some Approaches to C-OTDR Multichannel Data Fusion for Multiclass Classification of TSEV
So the classification problem TSEV is reduced to the task of creating an effective multiclass classificator (MC) which is based on a LMC classifiers ensemble (Lipschitz Classifiers Ensembles). We remark that an ensemble of classifiers is a set of classifiers whose individual decisions are combined in some way (typically by weighted or unweighted voting) to classify new examples [5]. At any rate a MC learning method choice is a dominant problem. Usually the problem of learning a MC from training data is often addressed by means of kernel method (KM) [6-8]. In this case each kernel corresponds to an appropriate channel of the set Ch . For brevity we will not describe the baseline of this well-known method but we are going to pay attention to some KM modifications which are designed to work with LMC classifiers ensembles.
For the sake of simplicity, we will consider as a LMC a classic SVM [9]. By definition a SVM discriminant function f (61 zk,y) depends on the parameters a e RN (N is a power of the training set ZT ) and
b e R1. Here a is a normal vector to the hyperplane, |b| /||a|| is the perpendicular distance from the hyperplane to
the origin. Thus we have y = {a, b}, and each tuple y defines the hyperplane in the feature space.
Multiple Kernel Learning (MKL)
In contrast to baseline kernels selection ("averaging kernels" and "product kernels" [8]), MKL kernel selection is to learn a kernel combination during the training phase of the algorithm. So, the MKL objective is to optimize jointly over a linear combination of kernels q(Z(i),Z(j)) = Zm = 1 pkkk (zu,zkj) with LMCP y = {a,b}.
Here Z(i) = {zh.,z2,.,...,zmi}, Z(j> = {zij,z2j,...,zmj}, zpk = 1,Pk > 0, kk (•,•) is kernel function. MKL was
originally introduced in [6]. Let us denote Qt(Z) = (qt (Z,Z1),qt (Z,Z2),...,qt (Z,ZN)) e RN,i = 1,...,m. The final
decision has form FMKL (Z) = Arg M^X (sl=1Pk,e (Qk (Z )T a + be)). The choice of parameters MKL is made by using for each e the following scheme:
0.5 (z 1 = 1Pk, eaeTQt a) + C Z N=1 ((e., b + 21=^ Q (Z) aPe)) D (e, t) = max (0,1 - et), (Zt, e,) e ZT . In other words, in MKL case we optimize jointly the convex hull of kernels. Here for each e we have the same LMCP ye = {ae, be} for different k. LP-Boost (LP-P)
So, we will consider a case when classifiers ft (e | zk) of ensemble F (e |F(e | Z)) are not trained jointly, but coefficients {p k} are determined jointly. Here we have a situation where LMCP tuples ye are different for different k. This method is called the P-LP-Boost [10, 11], and here the final decision has the form
Flpp (Z)=Arg Max (zm = 1Pt (Qk (Z )T a,k + be,t)).
The training phase comes down to an optimal choice of parameters {p k}. This choice is performed by using standard optimization method (linear programming - LP) according to the following scheme:
vN (^,)),
under the condition Z1 = 1Pk (K (Z,Zt)a,,, + b^)-ArgMixZ1=1Pk (kk (Z,Zt)ae+ be^) + ^ >p,
m
i = 1,...,N, ZPk = 1,Pk >0,k = 1,..m. Here 4 - slack variables, v - regularization constant, which is chosen
k=1
using Cross Validation (CV). In frame of this approach not need provide the normalization of kernels kk ( ). Moreover, features for which p k = 0 need not to be computed for the final decision function. LP-Boost (LP-B)
Another version of LP approach to choice {pk} was called B-LP-Boost [11, 12]. In this case, each class has its own weight vector. So, we have (m x D) weighting matrix B. The final decision has the form
Flpb (Z) = Arg Max (zm=1 Bk (Qk (Z )T a,k + be,k)).
Choice of parameters {pk} we make in such way:
min
sb t. Z Pk,e = 1, Pk,e ^ 0.
k=1
minI -p +-
i vN VS
min |-p +—(Z £,.)|,
under the condition Z»1 B0 (K (Z, Zt) ae, t + be, t) - Z?=i B0'' (kk (Z, Z,) a0. t + b0. t) + £,. > p, i = 1,...,N, 0' * 0'', V0
,m:( Z Bk = 1,Bk > 0,k = 1,..m). As above, here ^ are slack variables, v — regularization
constant, v is chosen using CV. Here we have a linear programming problem too, but this problem is more expensive because of dimension increasing.
Matching Pursuit Optimization Ensemble Classifiers
The matching pursuit optimization approach was described in [13]. In [14] a new method that Matching Pursuit Optimization Ensemble Classifiers (MPOEC) was proposed in order to eliminate some useless or similar classifiers from the whole ensemble to improve the performance of ensembles without decreasing the diversity among classifiers. So, in frame of MPOEC, we adopt a greedy iterative approach [13] to search for an optimal combination of classifiers. We describe this approach very briefly. So, let ZT = {(Zt, 0,.)J is the training set and
we have a finite dictionary D = {dx,d2,...,dM} including M functions. The D is given in given in the Hilbert Space H. We can try to find some target function q e H , Z —— © . The sparse approximations of q can be found out in the dictionary D, which is expansion of the form qP = ZP= angn, V an e R1 , P is the number of the
n
basis functions (BF) in the expansion, {gn J c D is basis of the expansion, {anJ is the set of coefficient of this expansion. The qN is sparse approximation of function q. The approximation qN is built with usage of P basis functions from D. The approximation error has the following simple for
||Rp|| 2 =| |q - qp| |2 = Z (0, - qp (Z, ))2.
i=1
The basis function {gn J and the coefficients {a n J are selected by applying the iterative greedy method, in the interest to minimizing the RP . So, in frame of MPOEC approach, the ensemble of classifiers F(0 | Z) is regarded as a BF of the dictionary D, and each LMC fk (0 | z k )e F(0 | Z) obtains the coefficient ak by minimizing RP. As result of MPOEC procedure usage, the content of ensemble F(0 | Z) is changed. If ak ^ 0, the corresponding classifier fk (0 | zk) remains in ensemble, otherwise the fk (01 zk) is eliminated of ensemble F(0 | Z). On each of MPOEC procedure iteration t we have the various combination Ft(0 | Z) of LMC from F(0 | Z). For convenience let us denote an index of LMC inside the temporary combination Ft(01Z) (temporary ensemble) as j. On each iteration t the prediction error is calculated as following
e, = N-1 Z N= 1 (0, - qt (Z )) = N-1 Z N= 1 (0, - s,gn (z j = 1 a JfJ (01 z,,))) .
Here Zt ={z1,.,z2,.,...,zm,.J, t- number of iteration, aj is the coefficient gained inj-th iteration. On each iteration t the resides Rt refined by the formula R, = Rt-1 - atft(•), and coefficients {aj J are calculated as following formula a j ={ ft, Rt-^||f || 2, j = 1, . | Ft ( ) . The MPOEC procedure iterations are repeated until prediction error et is more than prior given threshold I > 0. So, the MPOEC method selects LMC according to diversity between a pair of LMC to increase the diversity of the ensemble F (•). On the other hand, the increase in the
diversity will not reduce the accuracy of individual classifiers in the optimization process of MPOEC approach. MKL Weighing of Inversely as Lipschitz Constants (WILC-MKL)
Let us consider the brand new modification of the MKL that differ from classical MKL by method choice of linear combination parameters. The motivation of this approach is using some intrinsic properties of LMC. The fact is that value of Lipschitz Constant significantly determines of the LMC properties. Simply speaking, the Lipschitz classifier decision function has to a small Lipschitz constant. This feature comes from well-known regularization principle, which recommends avoid using discriminative functions with a high variation. So, LMC's with small LC are more preferable for providing of stable classification process. In other words, classifiers with small LC provide the greater generalization ability of classification system: such difficulties have
lower complexity to avoid overfitting. Hence, in formula o
f F(0|F(01 Z)) LMC's with small LC must get
weight coefficients with bigger value. Let us call this approach to modification of MKL as Weighing of Inversely as to value of the Lipschitz Constant (WILC) or WILC-MKL. In frame of WILC-MKL approach to LMC-ensemble F(0 | Z) we have the following discriminative function:
F„lc (Z) = Arg Max (zm=1 °e,*pk,e ( (Z )T ae + b0 )).
Here °e,k = L-\k (zm=1 Ltjpj,e)k = 1,...m , Oe,tpk,e > 0,Zm=1 Oe,tpM = 1, Le,t is Lipschtz Constant °f
discrimination function Kk (Z)T ae + be. Thus, using WILC-MKL, we make attempt to improve the
generalization ability of MKL by considering information about variation characteristics of classifiers discrimination functions. As it was shown in series of practical experiments, usage of WILC-MKL allows considerably improve the performance of LMC-ensemble in some practical cases.
Results of practical usage
All of above described methods were used for multichannel TSEV classification in C-OTDR system of railways monitoring. This system was successfully installed on the railways test area (RTA) of Kazakhstan Railways Company (JSC NC "KTZ") in august of 2014, and this system continues to operate. The RTA is located at a distance of 10 km from Astana City. The FOS which has been installed on RTA has length around 2000 m, depth of the FOS laying is 50 cm approximately, and FOS offset from rails is 5 m. Parameters of the C-OTDR system:
- duration of the probe pulse is 50-200 ns;
- period of probe pulse belongs to the range 50-300 ps;
- laser wavelength is 1550 nm.
In this case, the main problem is to fusion of multichannel data to classify the TSEV with maximum accuracy. As was said above, for each C-OTDR channel Ch(Kk) are used appropriate D binary classifiers
fk (e, | z k), e, e© . Each LMC fk (e, | z k) is binary classifier, which divides the feature space (Z, d ) into two classes e, and © \ e,. Each LMC fk ( ) was trained independently, and each LMC uses the same set of features in the space (Z, d). The (Z, d) is the ordinary GMM-vector space [15]. We describe the procedure for calculation of the GMM-vectors very briefly. On feature extraction phase for each speckle pattern obtained in the probing period T for each of the channel are built Linear-Frequency Spaced Filterbank Cepstrum Coefficients (LFCC). In our case these features are based on 10 linear filter-banks (from 0.1 to 500 Hz) derived cepstra. Thus, 10 static and 10 first-order delta coefficients were used, giving the feature order m = 20. Further, approximation of the probability distribution function of the feature vectors (LFCC) by semi-parametric multivariate probability distribution model, so-called Gaussian Mixture Models (GMM), was carried. Presently, the GMM is one of the principal methods of modeling broadband acoustic emission sources (including TSEV) for their robust identification. The GMM of TSEV feature vectors distribution is a weighted sum of J components densities [15] and
given by the equation P (x ) = w ^ (x), where x is a random m-vector, w s =(ws1,...wsJ )e RJ,
Bs(x) = (( 1 (x),..Bj (x)) e RJ, VBsl(x) = ((2n)m/2 |S„|"2fexp(-2(x-p J S-/ (x-p„)),
Xs = {((,, p,, ) |, = 1, J}. In general, diagonal covariance matrices "Lsi are used to limit the model size. The model parameters Xs characterize a SEV in the form of a probabilistic density function. During training, those parameters are determined by the well-known expectation maximization (EM) algorithm [15]. In the described experiments value J was equal to 1024. Thus, for identification of TSEV class, each TSEV is modeled by a GMM-vector and is referred to as his model parameters X e Z . The classic SVM with Bhattacharyya-kernel [3] was used as the LMC.
Priori defined target classes of TSEV, which collectively makes up a finite set © . For example, in case of railways monitoring the array © consists of the following TSEV classes: "train", "hand digging the soil", "chiseling ground scrap", "pedestrian", "group of pedestrian", "passenger car", "heavy equipment excavator", "easy excavation equipment". Five alternative approaches for multichannel data fusion were compared on stage of TSEV classification. In particular, MKL, LP-p, LP-B, MPOEC, and WILC-MKL approaches were used. The results of using these methods as parts of the C-OTDR system are presented in Table.
In the process of using the method WILC-MKL values of Lipschitz Constants were evaluated numerically for each LMC from ensemble F(0 | Z). The volumes of training sets were equal for each of various data fusion approaches, but those volumes were different for various TSEV types. Presented results prove that the WILC-MKL and LP (p and B) are more effective with respect to MKL and MPOEC approaches. At the same time, WILC-MKL is more effective compared to MKL, but the LP-B is the best approach for a fusion of multichannel data in C-OTDR monitoring systems. It is important; the LP-B approach requires more computing resources than the WILC-MKL approach, wherein the accuracies of those methods are close. That is why the WILC-MLK approach is preferable from the practical point of view.
Method Type of SEV Accuracy Volume of training set
MKL "hand digging the soil" 76% 60
"chiselling ground scrap" 79% 60
"pedestrian" 78% 80
"group of pedestrians " 79% 30
"passenger car" 79% 50
"train" 100% 150
"heavy equipment excavator" 81% 20
"easy excavation equipment" 83% 20
"shrew digging the ground" 81% 30
LP-ß "hand digging the soil" 81% 60
"chiselling ground scrap" 83% 60
"pedestrian" 81% 80
"group of pedestrians " 83% 30
"passenger car" 80% 50
"train" 100% 150
"heavy equipment excavator" 85% 20
"easy excavation equipment" 86% 20
"shrew digging the ground" 84% 30
LP-B "hand digging the soil" 82% 60
"chiselling ground scrap" 85% 60
"pedestrian" 79% 80
"group of pedestrians " 84% 30
"passenger car" 81% 50
"train" 100% 150
"heavy equipment excavator" 86% 20
"easy excavation equipment" 88% 20
"shrew digging the ground" 88% 30
MPOEC "hand digging the soil" 72% 60
"chiselling ground scrap" 79% 60
"pedestrian" 75% 80
"group of pedestrians " 80% 30
"passenger car" 79% 50
"train" 100% 150
"heavy equipment excavator" 81% 20
"easy excavation equipment" 82% 20
"shrew digging the ground" 83% 30
WILC-MKL "hand digging the soil" 81% 60
"chiselling ground scrap" 82% 60
"pedestrian" 78% 80
"group of pedestrians " 83% 30
"passenger car" 84% 50
"train" 100% 150
"heavy equipment excavator" 84% 20
"easy excavation equipment" 87% 20
"shrew digging the ground" 86% 30
Table. The practical detection results
Conclusions
This paper describes results of comparison of various multichannel data fusion approaches for TSEV classification including MKL, LP-p, LP-B, MPOEC and WILC-MKL. The practical usage of these approaches proves better effectiveness of LP-B approach to fusion of multichannel data for classification of TSEV type. A brand new approach, WILC-MKL, was suggested for multichannel data fusion. This approach is simple to use and performs well in a C-OTDR classification subsystem.
References
1. Choi K.N., Juarez J.C., Taylor H.F. Distributed fiber-optic pressure/seismic sensor for low-cost monitoring // Proceedings of SPIE - The International Society for Optical Engineering. 2003. V. 5090. P. 134-141. doi: 10.1117/12.484911
2. Timofeev A.V., Egorov D.V. Multichannel classification of target signals by means of an SVM ensemble in C-OTDR systems for remote monitoring of extended objects // Proc. MVML-2014. Prague, 2014. V. 1. P. 22-32.
3. Timofeev A.V. The guaranteed estimation of the Lipschitz classifier accuracy: confidence set approach // Journal of the Korean Statistical Society. 2012. V. 41. N 1. P. 105-114. doi: 10.1016/j.jkss.2011.07.005
4. Von Luxburg U., Bousquet O. Distance-based classification with Lipschitz functions // Journal of Machine Learning Research. 2004. V. 5. P. 669-695.
5. Murty M.N., Devi V.S. Pattern Recognition: an Algorithmic Approach (Undergraduate Topics in Computer Science). London: Springer, 2011. 263 p.
6. Hofmann T., Sholkopf B., Smola A.J. Kernel methods in machine learning // Annals of Statistics. 2008. V. 36. N 3. P. 1171-1220. doi: 10.1214/009053607000000677
7. Lanckriet G.R.G., Cristianini N., Bartlett P., Et Ghaoui L., Jordan M.I. Learning the kernel matrix with semidefinite programming // Journal of Machine Learning Research. 2004. V. 5. P. 27-72.
8. Bach F.R., Lanckriet G.R.G., Jordan M.I. Multiple kernel learning, conic duality, and the SMO algorithm // Proc. 21st Int. Conf. on Machine Learning (ICML 2004). Banff, Canada, 2004. P. 41-48.
9. Hears M.A., Dumais S.T., Osman E., Platt J., Scholkopf B. Support vector machines // IEEE Intelligent Systems. 1998. V. 13. N 4. P. 18-28. doi: 10.1109/5254.708428
10. Demiriz A., Bennett K.P., Shawe-Taylor J. Linear programming boosting via column generation // Machine Learning. 2002. V. 46. N 1-3. P. 225-254. doi: 10.1023/A:1012470815092
11. Gehler P., Nowozin S. On feature combination for multiclass object classification // Proc. ICCV. 2009. P. 47-50.
12. Weston J., Watkins C. Support vector machines for multi-class pattern recognition // Proc. 7th European Symposium on Artificial Neural Networks. 1999. P. 219-224.
13. Mallat S.G., Zhang Z. Matching pursuit with time-frequency dictionaries // IEEE Transaction on Signal Processing. 1993. V. 41. N 12. P. 3397-3415. doi: 10.1109/78.258082
14. Mao S., Jiao L.C., Xiong L., Gou S. Greedy optimization classifiers ensemble based on diversity // Pattern Recognition. 2011. V. 44. N 6. P. 1245-1261.
15. Blimes J.A. A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models // Technical Report TR-97-021. Berkley, California: International Computer Science Institute, 1998. P. 97-021.
Audrey V. Timofeev -
Тимофеев Андрей Владимирович
Dr. Habil. Ing., CSO, LPP "EqualiZoom", Astana, 010000, Kazakhstan, [email protected]
доктор технических наук, научный директор, ТОО «Эквализум», Астана, 010000, Казахстан, [email protected]