Human Feedback

📜 My MSc Thesis: Aligning Language Models with …

Recently, I successfully defended my MSc Degree in Computer Science and Engineering Thesis on Aligning Language Models with Human Feedback without Reinforcement Learning. This research was supervised by André F.T. Martins, Head of Research at Unbabel, and Sweta Agrawal, Research Scientist at Google, …

Human Feedback

📜 My MSc Thesis: Aligning Language Models with …

🤖 Understanding AI Agents, tools, and protocols

Building my career at DareData ⚡

📜 My MSc Thesis: Aligning Language Models with …