Human Feedback

πŸ“œ My MSc Thesis: Aligning Language Models with …

Recently, I successfully defended my MSc Degree in Computer Science and Engineering Thesis on Aligning Language Models with Human Feedback without Reinforcement Learning. This research was supervised by AndrΓ© F.T. Martins, Head of Research at Unbabel, and Sweta Agrawal, Research Scientist at Google, …