Contributions to a system for open reproducible publication research

Supervisors: Diomidis Spinellis, Georgios Gousios

Towards More Effective Querying of Medical Literature in Alexandria3K

By Bas Verlooy

Topic Classification of Publications

By Dayoung Lim

Use of LLMs to Improve Affiliation Disambiguation in Alexandria3k

By Dibyendu Gupta

Author Name Disambiguation Using Large Language Models

By Jelle van Lieshout

Finding your digital sibling: which other GitHub projects are similar to yours?

Supervisors: Sebastian Proksch, Shujun Huang

Finding similar repositories based on the available documentation

By Alexandru Catalin Turcu

Analyzing Similar Build Configurations Across Different GitHub Projects

By Calin Manoli

Contribution of source code identifiers to GitHub project similarity

By Juul Crienen

Discovering Digital Siblings: Quantifying Inter-Repository Similarity Through GitHub Dependency Structures

By Mateusz Rębacz

Finding your digital sibling: Grouping GitHub projects that share certain attributes based on interactions and activities

By Rowan de Bruin

Helping a Hoarder: Cleaning up the Maven Central Repository

Supervisors: Sebastian Proksch, Shujun Huang

Effects of Artifact Age on Maven Dependency Resolution

By Gints Kuļikovskis

Navigating Repositories: Assessing the Impact of External Repositories on Packages in Maven Central

By Jelle Sandifort

Can we extract a relevant, available, and self-contained core of the Maven ecosystem?

By Mathijs van der Schoot

An analysis of Java release practices on GitHub

By Vivian Roest

How Much Data is Enough? Learning Curves for Machine Learning

Supervisors: Tom Viering, Taylan Turan

Prevalence of non-monotonicity in learning curves

By Dinu Gafton

Deciphering Learning Curve Characteristics via K-Means Clustering of Curve Model Parameters

By Enes Arda Ozgur

How do data imbalances affect the learning curve using nearest mean model

By Jia Jie Feng

Learning Curve Extrapolation using Machine Learning

By Pratham Johari

Can patterns be identified amongst learning curves after the application of the K-Means algorithm using point and statistical vectors?

By Pravesha S.P. Ramsundersingh

Inferring user and project information from Scratch repositories

Supervisors: Fenia Aivaloglou, Sole Pera

Clustering Scratch projects by code complexity traits and project traits

By Brent Meeusen

Unraveling Sentiment Threads: An Analysis of Comment Sentiment and User Participation in Scratch Project Creation

By Gert-Jan Schaap

What are the types of projects that Scratch users create?

By Wojciech Marczuk

Landmarks in planning

Supervisors: Sebastijan Dumančić, Issa Hanou

Using landmarks as Intermediary Goals or as a Pseudo-Heuristic

By Bart van Maris

Extending SymbolicPlanners with forward propagation landmark extraction

By Ka Fui Yang

Re-evaluating the Full Landmark Extraction Algorithm

By Noah Tjoen

The effect of ordered landmarks on plan length in forward search

By Paul Tervoort

Exploring effectivity of Hm landmark extraction on modern planning domains

By Pauline Hengst

Learning reduced-order mappings between functions

Supervisors: David Tax, Mahdi Naderibeni

An Investigation of Generalization on the Viscosity Parameter

By Amund Kiste

An Investigation of Suitable Inputs and Outputs

By Bo Bakker

On Robustness of Reduced Order Mappings between Function Spaces Against Noise

By Pablo Lacombe

Malicious Parties in Multi-Server Federated Learning

Supervisors: Lydia Y. Chen, Jiyue Huang

Exploring the Impact of Single-Character Attacks in Federated Learning Language Classification

By Jan van der Meulen

Evaluating differential privacy on language processing federated learning

By Quinten van Opstal

Robustness Against Untargeted Attacks of Multi-Server Federated Learning for Image Classification

By Todor Mladenovic

Optimization problems and benchmarking for targeted sequencing of viral metagenomes

Supervisors: Jasmijn Baaijens, Jasper van Bemmelen

A Mismatch Relaxation to the Primer Selection Process of an Amplicon Sequencing Algorithm

By Dean Polimac

Benchmarking AmpliDiff for monkeypox, HIV-1 and Influenza-A

By Kevin den Boon

Fragmenting Genome Sequences by Coding Regions to Improve Performance of the AmpliDiff Algorithm for Large Genomes

By Samuel Karskens

Heuristic-Based Primer Set Minimization for PCR

By Thys Kok

Procedural Tree Generation

Supervisors: Elmar Eisemann, Petr Kellnhofer, Lucas Uzolas

Inverse Modelling of 2D Trees Using Graph Convolutional Networks

By Erfan Mozafari

Procedural Tree Generation, How efficiently predict branching structures from foliage

By Sam Taklimi

Compressing 3D Trees For Faster Rendering

By Sebastian Manda

Isolating a Tree's Skeleton Using a 3D reconstruction

By Shashwat Sahay

Synthetic data generation for the optimization of strains using generative models

Supervisors: Thomas Abeel, Paul van Lent

Optimizing strains in Metabolic Engineering: comparative analysis of β-Conditional Variational Auto-encoder and Probabilistic PCA for synthetic data generation

By Doruk Kirbeyi

Synthetic data generation for the optimization of strains in metabolic engineering using generative adversarial networks

By Marcin Jarosz

Synthetic data generation for the optimization of strains in metabolic engineering using latent space representations derived from a Conditional Variational Autoencoder

By Neil Alwani

Understanding and Modeling Human Behavior in Quitting Smoking

Supervisors: Willem-Paul Brinkman, Nele Albers

Examining the Efficacy of Persuasive eHealth Applications in Facilitating Smoking Cessation: An Analysis of Competency Based Activities

By Aaron Maguire

Assessing Behavioral Changes through eHealth in Smoking Cessation

By Jonathan van Oudheusden

Use Reinforcement Learning to Choose Activities for Preparing to Quit Smoking

By Meng Zhang

Towards Effective Smoking Cessation: Understanding the Needs of Daily Smokers from eHealth Chatbot Interactions

By Vlad-Gabriel Iftimescu

We need to learn how to teach Machine Learning

Supervisors: Gosia Migut

Navigating the Pedagogical Landscape: An Exploration of Machine Learning Teaching Methods

By Andreea Zlei

A comparative analysis of coding approaches in machine learning among computer science students and non-computer science students

By Grgur Dujmovic

Long-Term Memory Retention of Educational Content

By Ismail Music

The impact of Goal-Oriented Visualization on Academic Performance

By Kriss Tesink

Created by Jordi Smit