I am a Research Scientist in IBM Dublin Research Lab in the Exascale Systems Group and a former Research Associate in the Distributed Computing group of the Innovative Computing Laboratory.
I defended my PhD on fault tolerance for MPI communication libraries under the supervision of Franck Cappello and Joffroy Beauquier in 2006. I worked on fault tolerance protocols for MPI libraries through the MPICH-V and OpenMPI projects and their runtime environments. I also contributed to the DAGuE project on scalability and performance issues of runtime environment for large scale systems under the supervision of Jack Dongarra.
My areas of interest include Fault tolerance, Scalability, Middleware for large scale HPC systems.