Wouter Duivesteijn
Hi, I'm Wouter, and I'm incurably curious.
I am an Assistant Professor in Data Mining at the
Technische Universiteit Eindhoven.
My research revolves around Exceptional Model Mining (EMM): a local pattern
mining method where we seek subsets of the dataset that are
interesting, which they are if they satisfy two conditions.
On the one hand, they must be interpretable: we must be able to succinctly describe the definition of a subgroup, so that the knowledge that they represent becomes actionable. On the other hand, they must be exceptional: they must display some kind of behavior that sets them apart from the overall population. The scientific challenges revolve around how to efficiently search for subgroups, and how to express exceptional behavior such that the subgroups we find are meaningful.
CV (last updated: September 08, 2023)
Work-life rebalancing
Every Friday, I am on parental leave. So, my working
week will be Monday-Thursday; please keep this in mind when
getting in touch. You may of course send me emails at any time you want,
as long as we agree that I will not be reading them while I'm out of the
office.
Latest publications
- R.M. Schouten, G.W.J.M. Stevens, S.A.F.M. van Dorsselaer, E.L. Duinhof,
K. Monshouwer, M. Pechenizkiy, W. Duivesteijn: Analyzing the interplay
between societal trends and socio-demographic variables with local
pattern mining: Discovering exceptional trends in adolescent alcohol use
in the Netherlands.
Accepted for presentation at BNAIC/BeNeLearn 2024, to appear.
- N.T.J. van den Berg, B.O. Broekgaarden, D.P.A. Mahieu, J.G.M.J. Martens,
J.M. Niederle, R.M. Schouten, W. Duivesteijn: Generating MNAR Missingness
in Image Data, with Additional Evaluation of MisGAN.
Accepted for presentation at BNAIC/BeNeLearn 2024, to appear.
- R.M. Schouten, W. Duivesteijn, P.J. Räsänen, J. Paul,
M. Pechenizkiy: Exceptional Subitizing Range: Exploring Mathematical
Abilities of Finnish Primary School Children with Piecewise Linear
Regression. In: Proceedings of the European Conference on Machine
Learning and Principles and Practice of Knowledge Discovery in Databases
(ECML PKDD 2024), Part X, pp. 66-82, 2024.
- I. Vloothuis, W. Duivesteijn: RMI-RRG: a Soft Protocol to Postulate
Monotonicity Constraints for Tabular Datasets. In: Proceedings of
the 22nd International Symposium on Intelligent Data Analysis
(IDA 2024), Part I, pp. 16-27, 2024.
- P.J.A.M. Mulders, E.R. van den Heuvel, P. Reidsma, W. Duivesteijn:
Introducting exceptional growth mining - Analyzing the impact of soil
characteristics on on-farm crop growth and yield variability.
In: PLOS ONE 19(1):a0296684, 2024.
You can find my full publication list
sorted by category here, and
sorted by year here.
How to Cheat the Page Limit
I am one of the Proceedings Chairs of ECMLPKDD 2024. I am reprising this role
from previous editions: for the 2022 edition, we wrote
a report on our findings, updating our 2020 paper
on How
to Cheat the Page Limit.
Sound of Science #16
Flemish comedian and science fanatic Lieven Scheire presents a podcast at TU/e,
called Sound of
Science. In this podcast, he discusses the last scientific discoveries
and the role of technology in society with researchers and students. In
Episode 16,
he and I talked about my work (in Dutch). NLP researchers are invited to
use this episode as a training set to let their algorithms distinguish a
Vlaams from a Rotterdams accent in the Dutch language.
Gelfilter dataset
In addition to a new data mining method, our IDA 2020 paper also introduces a
dataset, publicly available for research
purposes free of charge. It is a run to failure time series library,
where the primary task is to predict the Remaining Useful Life of a gel
filter in a chemical plant. You can find more details and the dataset itself
here.
Contact information
Miscellaneous