KL-Regularized RLHF with Multiple Reference Models: Exact Solutions and Sample Complexity
- Gholamali Aminian
- Amir R Asadi
- et al.
- 2025
- NeurIPS 2025
Youssef Mroueh is a Principal Research Scientist in IBM since April 2015. He received his PhD in computer science in February 2015 from MIT, CSAIL, where he was advised by Professor Tomaso Poggio.
In 2011, he obtained his engineering diploma from Ecole Polytechnique Paris France, and a master of science in Applied Maths from Ecole des Mines de Paris.
He is interested in Deep Learning, Machine Learning, Optimal transport, multimodal learning, Statistical Learning Theory, Computer Vision and Artificial Intelligence.