I'm currently a PhD student at UC Berkeley being supervised by Stuart Russell. I research mathematics relevant to ensuring that future artifical intelligences who may be much smarter than us behave in a safe way. Discussions of problems in this area are available here, here, and here.
I did my undergrad at the Australian National University, studying the theory of reinforcement learning, mathematics, and theoretical physics. I did my honours year (similar to a research master's degree lasting one year) under Marcus Hutter; you can read my thesis "Resource-bounded Complexity-based Priors for Agents" here.
I'm interested in effective altruism, how we can use our limited resources to do the most good in the world. I also sometimes bet on things, for reasons described by Bryan Caplan and Immanuel Kant.
- Self-Modification of Policy and Utility Function in Rational Agents. arxiv
With Tom Everitt (lead author), Mayank Daswani, and Marcus Hutter.
Presented at AGI 2016, winner of the Kurzweil prize for best paper.
Discusses agents that can modify their source code and predict the result of these modifications, and how to define them so that they don't make modifications that stop them from optimising what we originally told them to optimise.
- Loss Bounds and Time Complexity for Speed Priors. jmlr
With Jan Leike and Marcus Hutter.
Presented at AISTATS 2016.
A discussion of 'speed priors', that is to say priors over infinite sequences of bits that penalise complex strings, where complexity is measured by the length of programs that produce a string, and the time those programs take to run. Builds off Jürgen Schmidhuber's original paper defining his Speed Prior.
- Thesis: Resource-bounded Complexity-based Priors for Agents. pdf
Supervised by Marcus Hutter in 2015.
My honours thesis about speed priors, used both in sequence prediction and in an RL setting. The most interesting results are contained in the AISTATS paper above.