How Best To Optimise Machine Learning Hyperparameters?

When designing and training a neural network model the hyperparameters include the SGD step size, mini-batch size, gradient decay policy, choice of regularisation etc. Selecting values for these hyperparameters is a key step in obtaining a useful model. While selection is commonly based on heuristics and trial and error, there is also much interest in … Read more

Training A Transformer LLM To Play Tic Tac Toe

Transformer neural nets have transformed natural language processing, but they can also be applied to other sequences not just sequences of words. In particular, they can be applied to game playing, which consists of a sequence of moves and where the task is to predict a good next move. In this project you will investigate … Read more