Task scheduling by reinforcement learning

Internship proposal.

Title: task scheduling by reinforcement learning
Supervisors: Philippe Preux, Nathan Grinsztajn, Olivier Beaumont, Emmanuel Jeannot.
Duration: 5 to 6 months
When: Spring-Summer 2021
Where: ScooL, Inria Lille, Villeneuve d'Ascq, France, if sanitary conditions permit it, remotely otherwise. Could also be in Bordeaux.
Keywords: reinforcement learning, task scheduling, high performance computing
Context:
In high performance computing (HPC), a task is split into a set of sub-tasks. The excutio of tasks usually depend on the result computed by other tasks. The set of tasks and their dependencies can be organized as a directed acyclic graph (DAG). Given such a DAG and a set of computational resources, the goal is to execute the set of tasks as fast as possible. To execute a DAG, a scheduler is in charge of properly choosing the next task to perform, and the resource to allocate to the task.
What:
The goal of this internship is to dig further the approach we have proposed in 2020.
Based on what we did yet, there is already a series of experiments to perform to better assess our approach; we have to go beyond the RL algorithm we experimented with (A2C). More generally, we expect that the intern will propose new ideas and test them. The goal of our work is really to obtain competitive schedulers that can be used in a real task scheduling HPC environment.
Who:
This internship is tailored as the final project of a master degree in computer science. We expect a strong background in reinforcement learning. Knowledge in either combinatorial optimization, or in HPC is a plus.
The intern will read English w/o any difficulty and (s)he is able to work in English, make a scientific presentation in English, interact in English more generally. (We do not expect that the intern can speak French.)
Bibliography:
- N. Grinsztajn and O. Beaumont and E. Jeannot and Ph. Preux, Geometric deep reinforcement learning for dynamic DAG scheduling, Proc. ADPRL 2020.
Working environment: this internship is proposed as part of a collaboration between 3 Inria research groups, HiePACS, Scool, and Tadaam. Scool is a well-known research group in reinforcement learning and bandits, located in Lille. Hiepacs and Tadaam are well-known research groups in HPC, located in Bordeaux.
This internship can be the first step to a PhD on a related topic.

Back to homepage.