Amirrea’s Page

Hi! I'm Amirreza Velae. I'm currently in my final year as an undergraduate student at Sharif University of Technology, majoring in Electrical Engineering with a minor in Mathematics. I'm fascinated by everything related to intelligence and robotics, specially reinforcement learning and optimization. Outside of academics, you’ll usually find me playing chess or soccer, or following chess tournaments. I'm also a big fan of movies and novels, though I don’t get to enjoy them as much these days since I’m quite busy figuring out how an imaginary gambler should play against some fictional bandit machines. For a more formal introduction, please see the section below.

Feel free to reach out if you have a research opportunity, a basement full of spare GPUs, happen to be a time traveler, or just want to ask me about my favorite music band and share yours. I also love persian rugs, so if you have one to show off, that would be great too!

Formal Bio

My name is Amirreza Velae, and I am an undergraduate student in Electrical Engineering and Mathematics at Sharif University of Technology. My academic interests center on the intersection of intelligence and computation, with a focus on modeling and realizing intelligence in machines. My primary research interest is reinforcement learning, which I view as a promising framework for advancing toward general intelligence. I am especially drawn to the theoretical foundations of deep reinforcement learning, bandit algorithms, and statistical learning theory.

For my B.Sc. thesis, I investigated the convergence properties of Trust Region Policy Optimization (TRPO) under the supervision of Prof. Hamed Shah-Mansouri. In the summer of 2024, I conducted remote research at the Max Planck Institute for Intelligent Systems with Amin Charusaie. Additionally, I collaborated with Prof. Mohammad Aliannejadi on debiasing ranking algorithms, focusing on methods to mitigate gender bias in language models. In the summer of 2025, I am working under the supervision of Arash Bahari Kordabadi and Prof. Sadegh Soudjani at the Max Planck Institute for Software Systems, where my research centers on the development of second-order reinforcement learning algorithms for linear-quadratic systems.

🎓 Education

Allameh Jafari High School (NODET) - Graduated in 2021
Sharif University of Technology - Expected graduation in 2026

🔍 Research Interests

Reinforcement Learning
Optimization

💡 Selected Projects

Debias Ranking with BackPack Language Model - Advisor: Prof. Mohammad Aliannejadi - Preprint
Optimization in Trust Region Policy Optimization (B.Sc. Thesis) - Advisor: Prof. Hamed Shah-Mansouri - Ongoing
Second-order Methods for Reinforcement Learning - Advisors: Arash Bahari Kordabadi & Sadegh Soudjani - Ongoing

"Instead of trying to produce a program to simulate the adult mind, why not rather try to produce one which simulates the child's? If this were then subjected to an appropriate course of education one would obtain the adult brain."
— Alan Turing