• No results found

On data mining in context : cases, fusion and evaluation Putten, P.W.H. van der

N/A
N/A
Protected

Academic year: 2021

Share "On data mining in context : cases, fusion and evaluation Putten, P.W.H. van der"

Copied!
2
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

On data mining in context : cases, fusion and evaluation

Putten, P.W.H. van der

Citation

Putten, P. W. H. van der. (2010, January 19). On data mining in context : cases, fusion and evaluation. Retrieved from https://hdl.handle.net/1887/14600

Version: Not Applicable (or Unknown)

License: Leiden University Non-exclusive license Downloaded from: https://hdl.handle.net/1887/14600

Note: To cite this publication please use the final published version (if applicable).

(2)

Stellingen

behorende bij het proefschrift

‘On Data Mining in Context: Cases, Fusion and Evaluation’

Peter van der Putten

1. Steps beyond the modeling phase in the data mining process can have an important impact on the quality of the end result; research problems can be identified and methods developed t h a t cover these steps, and these methods can generali ze over multiple problems and problem domains (this thesis).

2. The fact that predictive models built on a data set enriched with data fusion can outperform models built just on the original data set, is a paradox (this thesis).

3. Feature construction can decrease as well as increase both bias and variance (this thesis).

4. AIRS is a relative of k-nearest neighbor and an even closer rela tive of competitive learners such as NeuralGaz and Self Organizing Neural Networks (this thesis).

5. Bias variance analysis is a useful tool for diagnosing models beyond standard error estimation, but more research is needed to increase its adoption.

6. Though there are very few examples, it is possible to identify research problems and develop generally applicable methods for the deployment step in the data mining process.

7. The ecologica l study of the data miner in its natural habita t is an interesting new research method for data mining research (adopted from J.J. Gibson, The Ecological Approach to Visua l Perception, 1979).

8. Wit h in the context of data mining, the distinction between statistics, machine learning and neural networks is more a matter of cultural nature than a case of differences between the techniques, and the introduction of data mining as a research community with its own research groups, conferences and journals has contributed to the integration of these fields.

9. The data mining research community can play an active role in reducing the adverse impact of data mining on privacy.

10. In science, the emphasis is placed on validating theories; the creative process to come up wit h new theories in the first place is often ascribed to ingenuity, epiphany, divine intervention or sheer random luck. In art in contrast, the emphasis is on methods to steer the creative process and less on methodologica l valida tion of quality of the end result. In this sense both fields may have a lot to learn from each other.

Referenties

GERELATEERDE DOCUMENTEN

Figure 1 shows four models that could be discovered using existing process mining techniques.. If we apply the α- algorithm [3] to event log L, we obtain model N 1

Publisher’s PDF, also known as Version of Record (includes final page, issue and volume numbers) Please check the document version of this publication:.. • A submitted manuscript is

Tijdens de opgraving werd een terrein met een oppervlakte van ongeveer 230 m² vlakdekkend onderzocht op een diepte van 0,30 m onder het straatniveau. Het vlak

© Omaha System Support, maart 2021 versie 5 Medicatie AIB Adviseren / Instrueren / Begeleiden Discipline Stimuleren: • Therapietrouw Medicatie AIB Adviseren

Net als bij andere mensen die zorg en ondersteuning nodig hebben, zijn mantelzorgers belangrijk voor het langer thuis kunnen wonen door mensen met dementie.. Hoewel mantelzorgers

We consider this family of invariants for the class of those ρ which are the projection operators describing stabilizer codes and give a complete translation of these invariants

Moreover, we solidify our data sources and algorithms in a gene prioritization software, which is characterized as a novel kernel-based approach to combine text mining data

RESVM con- structs an ensemble model using a bagging strategy in which the positive and unlabeled sets are resampled to obtain base model training sets.. By re- sampling both P and U