Internship proposal.
- Title: task scheduling by reinforcement learning
- Supervisors: Philippe Preux, Nathan Grinsztajn, Olivier Beaumont, Emmanuel Jeannot.
- Duration: 5 to 6 months
- When: Spring-Summer 2021
- Where: ScooL, Inria Lille, Villeneuve d'Ascq, France, if sanitary conditions permit it, remotely otherwise. Could also be in Bordeaux.
- Keywords: reinforcement learning, task scheduling, high performance computing
- Context:
In high performance computing (HPC), a task is split into a set of sub-tasks. The excutio of tasks usually depend on the result computed by other tasks. The set of tasks and their dependencies can be organized as a directed acyclic graph (DAG). Given such a DAG and a set of computational resources, the goal is to execute the set of tasks as fast as possible. To execute a DAG, a scheduler is in charge of properly choosing the next task to perform, and the resource to allocate to the task.
Scheduling tasks have been studied for decades now. There exists various approaches and algorithms for that. Reinforcement learning naturally fits to solve this sort of problems. However, until very recently, reinforcement learning has failed to provide a competitive approach. In 2020, we have been able to obtain very encouraging results using RL to solve such scheduling problems.
- What:
The goal of this internship is to dig further the approach we have proposed in 2020.
Based on what we did yet, there is already a series of experiments to perform to better assess our approach; we have to go beyond the RL algorithm we experimented with (A2C). More generally, we expect that the intern will propose new ideas and test them. The goal of our work is really to obtain competitive schedulers that can be used in a real task scheduling HPC environment.
- Who:
This internship is tailored as the final project of a master degree in computer science. We expect a strong background in reinforcement learning. Knowledge in either combinatorial optimization, or in HPC is a plus.
The intern will read English w/o any difficulty and (s)he is able to work in English, make a scientific presentation in English, interact in English more generally. (We do not expect that the intern can speak French.)
- Bibliography:
- Working environment: this internship is proposed as part of a collaboration between 3 Inria research groups, HiePACS, Scool, and Tadaam. Scool is a well-known research group in reinforcement learning and bandits, located in Lille. Hiepacs and Tadaam are well-known research groups in HPC, located in Bordeaux.
- This internship can be the first step to a PhD on a related topic.
Back to homepage.