Aligning Offline Evaluation with Online Performance in Information Retrieval

Supervisors: Avishek Anand

Relevance labeling using Large Language Models for Offline-Online evaluation alignment in Information Retrieval Systems

By Jan Piotrowski

Evaluating Metric Sensitivity to Offline–Online Alignment in Information Retrieval

By Satsuki Udagawa

How does the choice of query influence the alignment between offline and online evaluation of ranking systems?

By Stephan Popov

Evaluating Prompting Strategies for Reliable LLM-Based User Simulation in Information Retrieval

By Ziliang Zhang

Exploring the landscape of budgeted search in program synthesis

Supervisors: Sebastijan Dumančić

Exploring the landscape of budgeted search in program synthesis

By Alexander Jelev

Exploring the landscape of budgeted search in program synthesis

By Chris Preda

Exploring the landscape of budgeted search in program synthesis

By Jelle Römer

Exploring the landscape of budgeted search in program synthesis

By Una Jaćimović

FUN with Algorithms: (Formal) Methods for Solving Singles

Supervisors: Dr. Anna L.D. Latour

Smashing Hitori: An analysis of the strenghts and weaknesses of constraint programming paradigm Pumpkin

By Lesley Smits

Solving Hitori puzzles using Satisfiability Modulo Theories

By Robin Rietdijk

Solving Hitori: Applying Answer Set Programming to Hitori

By Sappho de Nooij

Eating Soup with a Fork: Solving Hitori with Integer Linear Programming

By Sophieke van Luenen

Evaluating Logic Programming in PROLOG for solving Hitori puzzles

By Tom Friederich

Flow decomposition for viral genome analysis

Supervisors: Jasmijn Baaijens, Petr Kellnhofer

Weight sets for paths in minimum flow decomposition

By Agnese Ēlerte

Subpath constraints in ILP-based Minimum Flow Decomposition to improve strain-aware viral genome assembly

By Jona Bedaux

Topology and computational hardness of contig variation graphs in Minimum Flow Decomposition

By Matej Kliment

Graph pre-processing strategies for minimum flow decomposition in haplotypeaware genome assembly

By Senne Drent

Human vs. AI: How good are humans at recognizing different kinds of speech compared to state-of-the-art AI-based automatic speech recognisers?

Supervisors: Odette Scharenborg

Human vs AI: Comparing Transcription Performance in Dutch Older Adults' Speech

By Ansen Weng

Human vs AI: Recognising Teenage Speech

By Garv Singh

Comparing Human Listeners and Dutch ASR on Transcribing Child Speech

By Ilse Huisman

How good are humans at recognizing Flemish speech compared to AI-based automatic speech recognizers?

By Rares Popa

Investigating Contextual Variations in Explaining Plausible Narratives of Social Intention

Supervisors: Hayley Hung, Vitaliy Popov, Arthur Mercier

Investigating Narratives of Social Intention in Restaurant Interactions

By Aleksander Sak

Investigating Contextual Variations in Explaining Plausible Narratives of Social Intention in Driving

By Jang Hun Oh

Investigating Contextual Variations in Explaining Plausible Narratives of Social Intention in Hospitals

By Jyotiradityaa Jaiman

Investigating Contextual Variations in Aviation Social Intention Recognition

By Omer Arslan

LLM Agents for Cryptographic CTF Challenges

Supervisors: Zeki Erkin

Can Large Language Models reason? Investigating Open-Source Cryptographic Reasoning

By Aleksandra Taneva

Measuring LLM Tool-Use Efficiency in Cryptographic Capture-the-Flag Competitions

By Bogdan-Mihai Iordache

Do Agent Architectures Matter for Crypto CTFs?

By Querijn Voet

The Impact of Context Window Constraints on ReAct Agents in Cryptographic CTF Challenges

By Yusuf Köse

Out of distribution generalization of neural networks

Supervisors: Wendelin Böhmer

Impact of Dissimilarity Loss on Out of Distribution Generalization

By Alexandru Cristian Cazacu

When Do Deep Ensembles Improve Robustness to Spurious Correlations?

By Jaouad Hidayat

Evaluation of Similarity Loss on Out of Distribution generalization of Neural Networks

By Johan Bakker

Masking your problems away

By Quinten Nouwens

The Illusion of Ability: The Poisoned Promise of LLM Performance

Supervisors: Maliheh Izadi, Ali Al-Kaswan, Jonathan Katzy

The Illusion of Ability: The Poisoned Promise of LLM Performance. An Evaluation of the Min-K% Prob membership inference attack

By Cosmin Andrei Vasilescu

Evaluating SURP MIA performance on code samples

By Ísak Bieltvedt Jónsson

An Exploratory Study Into Polarized Augment Calibration for Membership Inference in Code LLMs

By Roham Koohestani

Towards Sustainable Continuous Integration: Quantifying the Energy Impact of Test Optimisation and prioritization, batch testing, and SATs

Supervisors: Carolin Brandt, Xutong Liu

The Energy Impact of Batch Testing in Continuous Integration

By Máté Oszkó

Quantification and Comparison of the Energy Impact of Static Analysis Security

By Rafael Petouris Rodríguez de Paterna

Energy Implications of Test Prioritisations in the Continuous Integration Process

By Rohan Cyr Lambert

How Configuration Choices Shape Environmental Impact of Static Analysis Tools

By Sophie van der Linden

Created by Jordi Smit