Amirrea’s Page

Hi! I'm Amirreza Velae. I'm currently in my final year as an undergraduate student at Sharif University of Technology, majoring in Electrical Engineering with a minor in Mathematics. I'm fascinated by everything related to intelligence and robotics, specially reinforcement learning and optimization. Outside of academics, you’ll usually find me playing chess or soccer, or following chess tournaments. I'm also a big fan of movies and novels, though I don’t get to enjoy them as much these days since I’m quite busy figuring out how an imaginary gambler should play against some fictional bandit machines. For a more formal introduction, please see the section below.

Feel free to reach out if you have a research opportunity, a basement full of spare GPUs, happen to be a time traveler, or just want to ask me about my favorite music band and share yours. I also love persian rugs, so if you have one to show off, that would be great too!

Formal Amirreza


My name is Amirreza Velae, and I am an undergraduate student in Electrical Engineering and Mathematics at Sharif University of Technology. My academic interests center on the intersection of intelligence and computation, with a focus on modeling and realizing intelligence in machines. My primary research interest is reinforcement learning, which I view as a promising framework for advancing toward general intelligence. I am especially drawn to the theoretical foundations of deep reinforcement learning, bandit algorithms, and statistical learning theory.

For my B.Sc. thesis, I investigated the convergence properties of Trust Region Policy Optimization (TRPO) under the supervision of Prof. Hamed Shah-Mansouri. In the summer of 2024, I conducted remote research at the Max Planck Institute for Intelligent Systems with Amin Charusaie, where I explored incorporating human feedback into neural networks using Bayesian layers. Additionally, I collaborated with Prof. Mohammad Aliannejadi and Prof. Lall on debiasing ranking algorithms, focusing on methods to mitigate gender bias in language models. In the summer of 2025, I am working under the supervision of Arash Bahari Kordabadi and Prof. Sadegh Soudjani at the Max Planck Institute for Software Systems, where my research centers on the development of second-order reinforcement learning algorithms for linear-quadratic systems.

🎓 Education

  • Allameh Jafari High School (NODET) - Graduated in 2021
  • Sharif University of Technology - Expected graduation in 2026

🔍 Research Interests

  • Reinforcement Learning
  • Optimization

💡 Selected Projects

"The exploiter never tells the exploited how he's exploiting them."
Jean-Luc Godard