User Tools

Site Tools


home

Pedro A. Ortega

Research Scientist
Google DeepMind

This is Ludwig Boltzmann (no joke). Thanks Jordi!

About

I am a Research Scientist in the AI Safety Group at DeepMind. My research focuses on the formal principles of intelligent systems covering aspects such as learning, planning and decision making in both machines and biological organisms. My approach lies at the intersection between machine learning, computational neuroscience, theoretical economics, and physics.

Research Interests

Free Energy Functional (it's also meant to be a puzzling pic)

Most of my work centers on information-theoretic and statistical mechanical approaches to learning and control, leading to contributions in bounded rationality models and recasting adaptive control as a causal inference problem. I have also worked on causal induction, and on game- and decision-theoretic models in computational neuroscience.

To get a sense of my work, please refer to:

I was part of the former Tuebingen group on Sensorimotor Learning and Decision Making, where some of the now widespread ideas on Thompson sampling, causality, and thermodynamic methods for reinforcement learning have originated. Check out its members' sites:

Travel Schedule

Date Where Title
6-8 Jan 2017 Pacific Grove, California Beneficial AI 2017
25 Jan 2017 FHI, Oxford -
3 Mar 2017 FHI, Oxford -
17-26 Mar 2017 Philadelphia -
12 May 12 2017 University of Columbia Talk at Information Processing and Behavioral Variability
15 June 2017 FHI, Oxford -
28-31 July 2017 University of Cambridge Talk at Decision Theory & the Future of AI
7-27 August 2017 Philadelphia -

News (on Twitter)

Timeline of @AdaptiveAgents

AdaptiveAgents avatar

RT @nattyover: I wrote about a potentially major advance in the theoretical understanding of deep learning: https://t.co/jrRGubzuBs
About 2 hours, 23 mins ago by: Pedro A. Ortega (@AdaptiveAgents)

AdaptiveAgents avatar

RT @theRSAorg: What does artificial intelligence mean in practical terms? https://t.co/7HJJ28IpPH https://t.co/pkTMWtWNc7
About 2 hours, 23 mins ago by: Pedro A. Ortega (@AdaptiveAgents)

AdaptiveAgents avatar

TensorFlow Agents: Efficient Batched RL in TensorFlow https://t.co/caDafQ3Hz5
About 2 hours, 46 mins ago by: Pedro A. Ortega (@AdaptiveAgents)

AdaptiveAgents avatar

Causal illusions in children when the outcome is frequent https://t.co/xsHLx0pcvy https://t.co/2Mcs1pn1uf
About 8 hours, 48 mins ago by: Pedro A. Ortega (@AdaptiveAgents)

AdaptiveAgents avatar

Interactive Thompson Sampling for Multi-Objective Multi-Armed Bandits https://t.co/EbqaVUzlG0
About 1 day, 2 hours ago by: Pedro A. Ortega (@AdaptiveAgents)

Contact

Pedro A. Ortega
Google UK
DeepMind 6th Floor
6 Pancras Square
London N1C 4AG
U.K.

home.txt ยท Last modified: 2017/07/23 14:13 by peortega

Page Tools