Reinforcement learning agents often behave conservatively or inconsistently at the start of training, leading to the so-called cold-start problem. This project will simulate a simple traffic scenario and train a multi-agent RL-based autonomous vehicle controller. Students will then explore strategies to reduce cold-start effects, such as imitation learning for initialization or reusing past experiences, and evaluate how these methods improve early-stage performance.