Generalizing Pareto optimal policies in
Multi-objective Reinforcement learning
Empirical study of hypernetworks
Santeri Heiskanen
Supervisors: Prof. Ville Kyrki1,
Dr. Atanu Mazumdar 1,
Prof. Joni Kämäräinen2
1Aalto University, 2Tampere University
20-07-2024
References
[1] D. M. Roijers, P. Vamplew, S. Whiteson, and R. Dazeley,
“A survey of multi-objective sequential decision-making,”
Journal of Artificial Intelligence Research, vol. 48, 2013.
[2] C. F. Hayes et al., “A practical guide
to multi-objective reinforcement learning and planning,”
Autonomous Agents and Multi-Agent Systems, vol. 36, no. 1, 2022.
[3] X. Chen, A. Ghadirzadeh, M. Björkman, and P. Jensfelt,
“Meta-learning for multi-objective reinforcement learning,”
in 2019 IEEE/RSJ international conference on intelligent robots and systems (IROS),2019.
[4] V. K. Chauhan, J. Zhou, P. Lu, S. Molaei, and D. A. Clifton,
“A brief review of hypernetworks in deep learning,”
Artificial Intelligence Review, vol. 57, no. 6, 2024.
[5] H. Lu, D. Herman, and Y. Yu, “Multi-objective reinforcement learning:
Convexity, stationarity and pareto optimality,” in The
eleventh international conference on learning representations,
2023.
[6] E. Sarafian, S. Keynan, and S. Kraus, “Recomposing the reinforcement
learning building blocks with hypernetworks,” in Proceedings
of the 38th international conference on machine learning,
2021.
[7] J. Xu, Y. Tian, P. Ma, D. Rus, S. Sueda, and W. Matusik,
“Prediction-guided multi-objective reinforcement learning for
continuous robot control,” in Proceedings of the 37th
international conference on machine learning, 2020.
[8] A. B. Alegre Lucas N, D. M. Roijers, A. Nowé, and B. C. da Silva,
“Sample-efficient multi-objective learning via generalized policy
improvement prioritization,” in AAMAS ’23: Proceedings of the
2023 international conference on autonomous agents and multiagent
systems, 2023.
[9] L. M. Zintgraf, T. V. Kanters, D. M. Roijers, F. Oliehoek, and P. Beau,
“Quality assessment of MORL algorithms: A utility-based
approach,” in Benelearn 2015: Proceedings of the 24th annual
machine learning conference of belgium and the netherlands,
2015.
[10] E. Zitzler, J. Knowles, and L. Thiele, “Quality assessment of pareto
set approximations,” in Multiobjective optimization:
Interactive and evolutionary approaches, Springer, 2008.
[11] H.Ishibuchi, H. Masuda, Y. Tanigaki, and Y. Nojima, “Modified
distance calculation in generational distance and inverted generational
distance,” in Evolutionary multi-criterion optimization, 2015.