Student Projects – Student Projects

[TAKEN] Fairness and Robustness in Risk Detection Models (2 Projects)

Eoin Delaney (email | all projects)

Risk detection models (such as IBM’s Granite Guardian) are increasingly used to flag harmful prompts and responses in large language model pipelines. These systems are trained on human and synthetic data to identify risks across multiple dimensions, but their reliability and fairness are not guaranteed. They may over-flag certain groups, miss subtle harms, or be … Read more

[TAKEN] Debugging Classifications with Counterfactual Explanations

Eoin Delaney (email | all projects)

This project investigates how post-hoc counterfactual explanations can be used to debug opaque models such as deep neural networks by revealing which feature changes most influence predictions. In applications like anomaly detection, counterfactuals help clarify why certain cases are flagged as abnormal and expose when models rely on spurious correlations or biased patterns. By using … Read more

[TAKEN] Intersectional Fairness in Machine Learning

Eoin Delaney (email | all projects)

This project focuses on the rich field of algorithmic fairness where the goal is to ensure that predictions are not biased against subgroups of the population whilst maximising predictive performance. One challenge is when we focus on multiple protected attributes.