Oliver Hennhöfer

Research staff at Hochschule Karlsruhe, Intelligent Systems Research Group.

I work on statistical methods and software for machine learning systems that need to stay reliable under uncertainty.

About

I am interested in machine learning systems that remain useful when data are scarce, noisy, non-stationary, or arrive sequentially. My work sits at the intersection of statistics and machine learning, with a focus on anomaly detection, conformal inference, uncertainty quantification, and online false discovery rate control.

I also build research software for reproducible experiments and applied workflows.

HKA ISRG profile

Research

My research focuses on statistically principled machine learning systems for uncertain and dynamic environments.

  • Uncertainty quantification. Conformal inference methods that make predictions, scores, and decisions statistically interpretable under stated assumptions.
  • Conformal anomaly detection. Methods that convert anomaly scores into calibrated p-values and support principled outlier detection with error control.
  • Low-data and shifted settings. Weighted and adaptive conformal procedures for data scarcity, local non-stationarity, and distribution shift.
  • Online error control. Online FDR and FWER procedures for sequential testing when decisions must be made before the full dataset exists.

Notes

From Scores to Decisions

15 June 2026

Anomaly detectors often stop at a score. The score may rank observations in a useful way, but it does not say what level of evidence should trigger an action, nor how often that action is expected to be wrong under a stated reference condition.

Conformal inference is useful here because it gives a way to turn a score into a calibrated quantity. In anomaly detection, that often means converting a raw anomaly score into a p-value relative to calibration data. The resulting value is not magic, and it still depends on the assumptions behind the calibration set, but it gives the downstream decision a statistical meaning.

The Software Boundary

For research software, I like the boundary where a library accepts the model's scores and returns calibrated evidence. That keeps the anomaly detector itself flexible: it can be a classical method, a neural model, or a domain-specific scoring rule. The calibration layer then handles the part that most ad hoc thresholds leave implicit.

This is the design direction behind nonconform: keep the scoring model separate, expose the calibration assumptions clearly, and make the final decision rule inspectable. In practical workflows, this tends to be more maintainable than treating every detector as a sealed system with its own thresholding convention.

Sequential Settings

The same issue appears when tests arrive over time. A single threshold can look reasonable in isolation and still behave poorly when repeated decisions accumulate. Online FDR procedures address that by controlling the error rate across a stream of hypotheses, adapting the available testing budget as evidence arrives.

The common thread is simple: scores are useful, but decisions need accounting. Good APIs should make that accounting visible.

Projects

  • nonconform

    Python, conformal inference, anomaly detection, FDR control

    Conformal anomaly detection and change-point detection. The library turns anomaly scores into calibrated p-values and supports FDR-controlled decisions through split, cross, and weighted conformal strategies.

    GitHub | Docs | PyPI

  • online-fdr

    Python, online FDR, FWER, sequential testing

    Sequential multiple-testing procedures for online false discovery rate and family-wise error rate control, including LORD, SAFFRON, ADDIS, alpha-spending, and batch baselines.

    GitHub | Docs

  • oddball

    Python, datasets, benchmarks

    A lightweight collection of anomaly detection datasets for reproducible experiments.

    GitHub

  • aberrant

    Python, streaming, anomaly detection

    Online anomaly detection algorithms for streaming settings.

    GitHub

Publications

  1. TalkTag: Fine-Grained Morphosyntactic Error Annotation for Transcribed Speech

    Shamira Venturini, Oliver Hennhöfer, Steffen Kinkel, Jannik Strötgen. Preprint, arXiv:2606.01820, 2026.

    arXiv

  2. Conformal Anomaly Detection in Python: Moving Beyond Heuristic Thresholds with 'nonconform'

    Oliver Hennhöfer, Maximilian Kirsch, Christine Preisach. Preprint, arXiv:2605.13642, 2026.

    arXiv

  3. Between Resolution Collapse and Variance Inflation: Weighted Conformal Anomaly Detection in Low-Data Regimes

    Oliver Hennhöfer, Christine Preisach. Preprint, arXiv:2603.23205, 2026.

    arXiv

  4. Interdisciplinary Harmonies: A Story-Driven Course on AI and Music to Increase Interest in Computer Science

    Kai Marquardt, Qiongdan Shang, Oliver Hennhöfer, Lucia Happe. ECSEE 2025, pp. 145-153.

    DOI

  5. Leave-One-Out-, Bootstrap- and Cross-Conformal Anomaly Detectors

    Oliver Hennhöfer, Christine Preisach. IEEE ICKG 2024, pp. 110-119.

    DOI | arXiv

  6. Multidimensional Exploratory Spatial Data Analysis

    Oliver Hennhöfer, Julian Bruns, Peter Ullrich, Andreas Heiß, Galibjon Sharipov, Dimitrios Paraforos. GI_Forum 2021, pp. 136-151.

    DOI

For a bibliography index, see dblp.

Teaching

  • Machine Learning: Unsupervised Learning, WS/SS 2025
  • Environmental Monitoring, SS 2026, SS 2025, SS 2024, Bachelor
  • Data Science, WS/SS 2024, Master
  • Software Engineering, WS/SS 2023, Bachelor

Contact

Email: oliver.hennhoefer@h-ka.de