Prediction accuracy and stability of regression with optimal scaling transformations Kooij, A.J. van der

(1)

Prediction accuracy and stability of regression with optimal

scaling transformations

Kooij, A.J. van der

Citation

Kooij, A. J. van der. (2007, June 27). Prediction accuracy and stability of regression with optimal scaling transformations. Leiden. Retrieved from https://hdl.handle.net/1887/12096

Version: Corrected Publisher’s Version

License: Licence agreement concerning inclusion of doctoral thesis in the Institutional Repository of the University of Leiden

Downloaded from: https://hdl.handle.net/1887/12096

Note: To cite this publication please use the final published version (if applicable).

(2)

Prediction Accuracy and Stability

of Regression with

Optimal Scaling Transformations

(3)

Van der Kooij, Anita Jolande,

Prediction Accuracy and Stability of Regression with Optimal Scaling Transformations.

Dissertation Leiden University — With ref. — With Summary in Dutch.

Subject headings: nonlinear regression; CATREG; optimal scaling;

transformations; local minima; prediction accuracy; regularization;

.632 Bootstrap; Ridge regression; Lasso; Elastic Net ISBN 978-90-9021936-3

2007 Anita J. van der Kooijc

Printed by Mostert en van Onderen, Leiden

(4)

Prediction Accuracy and Stability of

Regression with Optimal Scaling

Transformations

Proefschrift ter verkrijging van

de graad van Doctor aan de Universiteit Leiden,

op gezag van de Rector Magnificus prof.mr. P.F. van der Heijden, volgens besluit van het College voor Promoties

te verdedigen op woensdag 27 juni 2007 klokke 16.15 uur

door

Anita J. van der Kooij geboren te Boskoop

in 1961

(5)

PROMOTIECOMMISSIE

Promotor Prof. dr. J.J. Meulman

Referent Prof. J.H. Friedman, Ph.D., Stanford University, USA Overige Leden Prof. dr. W.J. Heiser

Prof. dr. M.H. van IJzendoorn Dr. ing. P.H.C. Eilers

Prof. dr. R.D. Gill

(6)

Appendix A CATREG Algorithm 117 Appendix B CATREG sections from SPSS Categories^R 11.0 131 Appendix C Notation 179 References 181 Summary in Dutch (Samenvatting) 189 Curriculum vitae 197 Overview of Applications CATREG for Diamonds data . . . 6

Prediction accuracy for Ozone data . . . 47

Effect of number of observations on prediction accuracy for Demographic data . . . 58

Linear Lasso for Diabetes data . . . 73

Linear and nonlinear Ridge, Lasso, and Elastic Net for Prostate cancer data . . . 76

Nonlinear Lasso and dummies-Lasso for Breast cancer data . . . 82

Nonlinear Lasso and .632 bootstrap for Bulimia Nervosa data . . . . 91

(9)