• No results found

Efficient Ensemble Learning With Support Vector Machines http://esat.kuleuven.be/stadius/ensemblesvm/

N/A
N/A
Protected

Academic year: 2021

Share "Efficient Ensemble Learning With Support Vector Machines http://esat.kuleuven.be/stadius/ensemblesvm/"

Copied!
1
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

Efficient Ensemble Learning With Support Vector Machines

http://esat.kuleuven.be/stadius/ensemblesvm/

Marc Claesen† marc.claesen@esat.kuleuven.be

Frank De Smet‡,⋆ frank.desmet@cm.be

Johan A.K. Suykens† johan.suykens@esat.kuleuven.be

Bart De Moor† bart.demoor@esat.kuleuven.be †KU Leuven, Department of Electrical Engineering, ESAT – STADIUS/iMinds Medical IT

†Kasteelpark Arenberg 10 box 2446, 3001 Leuven, Belgium

‡KU Leuven, Department of Public Health and Primary Care, Environment and Health

‡Kapucijnenvoer 35 blok d box 7001, 3000 Leuven, Belgium

⋆ National Alliance of Christian Mutualities, Medical Management Department Keywords: ensemble learning, support vector machine, classification, free software

We present EnsembleSVM, a free software package con-taining efficient routines to perform ensemble classifi-cation with support vector machine (SVM) base mod-els (Claesen et al., 2014). The EnsembleSVM soft-ware is implemented in C++11 and is licensed under the GNU LGPL. The software is freely available at http://esat.kuleuven.be/stadius/ensemblesvm/. Ensembles of SVM models have merits in several seg-ments of machine learning, including large-scale learn-ing, semi-supervised learning and online learning. In large-scale learning, using ensemble approaches allows base models to be trained on small subsamples, which drastically decreases training time when using nonlin-ear kernels. Experiments show that nonlinnonlin-ear ensem-bles can maintain high predictive accuracy while being much faster to train than traditional SVMs. In a su-pervised context, ensembles are particularly useful for large-scale problems where the predictive performance of linear models is unsatisfactory and standard ker-nel SVMs take too long to train. This is the case in low-dimensional learning problems with many avail-able training instances.

EnsembleSVM provides a set of tools and a full pro-gramming API. Base models are trained using the LIBSVM software (Chang & Lin, 2011). A range of aggregation schemes is provided, both linear and non-linear. Duplicate storage and kernel evaluation of support vectors which are shared between constituent models is avoided, resulting in a smaller memory foot-print and faster prediction than implementations in-volving wrappers. Both training and prediction are parallelized in the tools provided by the library.

Acknowledgments

MC is a PhD student at KU Leuven. JS is a professor at KU Leuven. BDM is a full professor at KU Leuven. Research supported by:

• Research Council KU Leuven: GOA/10/09 MaNet, KUL PFV/10/016 SymBioSys, PhD/Postdoc grants,

• Industrial Research fund (IOF): IOF/HB/13/027 Logic Insulin,

• Flemish Government: FWO: projects: G.0871.12N (Neural circuits); PhD/Postdoc grants; IWT: TBM-Logic Insulin(100793), TBM Rectal Cancer(100783), TBM IETA(130256); PhD/Postdoc grants; Hercules Stichting: Her-cules 3: PacBio RS, HerHer-cules 1: The C1 single-cell auto prep system, BioMark HD System and IFC controllers (Fluidigm) for single-cell analyses; iMinds Medical Information Technologies SBO 2014; VLK Stichting E. van der Schueren: rectal cancer,

• EU: ERC AdG A-DATADRIVE-B.

References

Chang, C.-C., & Lin, C.-J. (2011). LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2, 27:1–27:27. Claesen, M., Smet, F. D., Suykens, J. A., & Moor,

B. D. (2014). EnsembleSVM: A library for ensemble learning using support vector machines. Journal of Machine Learning Research, 15, 141–145.

Referenties

GERELATEERDE DOCUMENTEN

This research is funded by a PhD grant of the Insti- tute for the Promotion of Innovation through Science and Technology in Flanders (IWT-Vlaanderen). This research work was carried

EnsembleSVM is a free software package containing efficient routines to perform ensemble learning with support vector machine (SVM) base models.. It currently offers ensemble

The contribution of this work involves providing smaller solutions which use M 0 < M PVs for FS-LSSVM, obtaining highly sparse models with guarantees of low complexity (L 0 -norm

The proposed approaches i.e L 0 reduced FS-LSSVM and Window reduced FS- LSSVM method introduce more sparsity in comparison to FS-LSSVM and SVM methods without significant trade-off

EnsembleSVM is a free software package containing efficient routines to perform ensemble learning with support vector machine (SVM) base models.. It currently offers ensemble

Compared to the hinge loss SVM, the major advan- tage of the proposed method is that the pin-SVM is less sensitive to noise, especially the feature noise around the decision

The second example shows the results of the proposed additive model compared with other survival models on the German Breast Cancer Study Group (gbsg) data [32].. This dataset

Algorithm 3.1 Three-layered SVM algorithm Initialise all SVMs within appropriate layers Train all hidden SVMs on perturbed dataset for all training epochs do.. Recompute main SVM