Santeri Heiskanen

Ph.D. Student at Robot learning lab, Aalto University

I am a PhD student at the Robot learning lab at Aalto University, under the supervision of Prof. Joni Pajarinen. I am also part of the AI-DOC doctoral education pilot, hosted by FCAI. My current research focuses on developing reinforcement learning and generative modelling techniques for complex, combinatorial search spaces, which one may face in many practical problems, such as neural architecture search, molecule generation or scheduling. More broadly, my interests lie at the intersection of sequential decision making, multi-objective optimisation, and their application to practical, real-world problems.

Previously, I received an M.Sc. degree in computer science from Tampere University in 2024. My M.Sc. thesis studied the effect of soft-information sharing between policies in multi-objective reinforcement learning, and it was supervised by Prof. Ville Kyrki from the Intelligent Robotics group at Aalto University. During my studies, I was also fortunate to complete 2 internships at Huawei Finland’s research center, and work in a startup specialising in the green energy transition.

News

Feb 06, 2026 “Pareto-Conditioned Diffusion Models for Offline Multi-Objective Optimization” was accepted to ICLR2026 as an oral presentation. See you in Brazil :brazil:
Nov 12, 2024 Gave a guest lecture about multi-objective reinforcement learning during course “ELEC-E8125 reinforcement learning” at Aalto University. You can checkout the slides from here
Nov 01, 2024 Got accepted into AI-DOC doctoral education pilot program.
Oct 10, 2024 I graduated with M.Sc. (tech) from Tampere University. My thesis, “Generalizing Pareto optimal policies in multi-objective reinforcement learning: An empirical study of hypernetworks” studied the use of hypernetworks in multi-objective reinforcement learning. You can checkout the slides for the study from here.
Aug 22, 2024 Joined Robot learning lab at Aalto University as a PhD student.

Selected publications

  1. pcd_iclr2026.png
    Pareto-Conditioned Diffusion Models for Offline Multi-Objective Optimization
    Jatan Shrestha*, Santeri Heiskanen*, Kari Hepola, and 3 more authors
    In The Fourteenth International Conference on Learning Representations, 2026