Uniavisen
Københavns Universitet
Uafhængig af ledelsen

Ph.d.-forsvar

PhD defense by Tobias Sommer Thune

Ph.d.-forsvar — On 1 July  2020, at 15:30, Tobias Sommer Thune will defend his PhD thesis. The PhD defense will take place virtually via Zoom and you can join via this link: https://ucph-ku.zoom.us/j/61659609588

Info

Date & Time:

Place:
https://ucph-ku.zoom.us/j/61659609588

Hosted by:
Datalogisk Institut

Cost:
Free

Title

Exploiting Easiness and Overcoming Delays in Online Learning

Abstract

In machine learning we work towards building algorithms that can solve complex tasks by learning how to solve them, rather than knowing how to solve them by design. Online learning is the subfield focusing on simultaneous execution and learning — that is learning while a task is “live” or online. Imagine a medical trial, where we want to identify the best drug for some illness. Instead of setting aside a portion of patients for testing, we might be able to cure more people by considering all patients as an online task and optimise the total number we cure. An algorithm in this scenarios must balance on one hand being adventurous and exploring the options in order to sufficiently gather knowledge of the task, with choosing what seems to be the best option in order to be performant on the other.

Using the theoretical framework of “multi-armed bandits”, we explore two variations of online learning scenarios:

We construct an algorithm capable of performing better if the task has a certain structure making it easier. This is possible for two kinds of structure simultaneously without having knowledge about the setting, and while remaining robust to harder settings.

Secondly we explore how to deal with the feedback from the algorithm’ s actions being delayed. We expand prior approaches to the case where the delay might vary in time. Here we develop a new technique of skipping feedback if it is excessively delayed and prove a conjecture of the potential performance for this algorithm. In addition we show that in such problems our algorithms perform much better than what was previously thought possible, and design examples of tasks where this is the case.

Assessment Committee

Christian Igel, Professor (DIKU) (Head of committee)
Alina Beygelzinner, Senior Research Scientist
Yishay Mansour, Professor
Moderator at this defense: Sadegh Talebi, assistant professor

Principal supervisor

Yevgeny Seldin, Associate professor

 

For an electronic copy of the thesis, please contact phdadmin@di.ku.dk

Seneste