During the day I'm a Research Engineer at DeepMind where I work on challenging open problems in machine learning and artificial intelligence. My current focus sits at the intersection of Reinforcement Learning, Multi-Agent, Game Theory and Continuous Control.
I grew up in China, moved to France in 2011 to study and I'm now living and working in London.
dm_control: Software and Tasks for Continuous Control
Saran Tunyasuvunakool, Alistair Muldal, Yotam Doron, Siqi Liu, Steven Bohez, Josh Merel, Tom Erez, Timothy Lillicrap, Nicolas Heess, Yuval Tassa
Journal | paper | codeV-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
H Francis Song, Abbas Abdolmaleki, Jost Tobias Springenberg, Aidan Clark, Hubert Soyer, Jack W Rae, Seb Noury, Arun Ahuja, Siqi Liu, Dhruva Tirumala, Nicolas Heess, Dan Belov, Martin Riedmiller, Matthew M Botvinick
ICLR 2020 | paperA Generalized Training Approach for Multiagent Learning
Paul Muller, Shayegan Omidshafiei, Mark Rowland, Karl Tuyls, Julien Perolat, Siqi Liu, Daniel Hennes, Luke Marris, Marc Lanctot, Edward Hughes, Zhe Wang, Guy Lever, Nicolas Heess, Thore Graepel, Remi Munos
ICLR 2020 | paperEmergent Coordination through Competition
Siqi Liu, Guy Lever, Josh Merel, Saran Tunyasuvunakool, Nicolas Heess, Thore Graepel
ICLR 2019 | paper | website | codeHierarchical Visuomotor Control of Humanoids
Josh Merel, Arun Ahuja, Vu Pham, Saran Tunyasuvunakool, Siqi Liu, Dhruva Tirumala, Nicolas Heess, Greg Wayne
ICLR 2019 | paper | demoReinforcement Learning Agents acquire Flocking and Symbiotic Behaviour in Simulated Ecosystems
Peter Sunehag, Guy Lever, Siqi Liu, Josh Merel, Nicolas Heess, Joel Z Leibo, Edward Hughes, Tom Eccles, Thore Graepel
ALIFE 2019 | paperThe Body is not a Given: Joint Agent Policy Learning and Morphology Evolution
Dylan Banarse, Yoram Bachrach, Siqi Liu, Chrisantha Fernando, Nicolas Heess, Pushmeet Kohli, Guy Lever, Thore Graepel
AAMAS 2019 | paperObservational Learning by Reinforcement Learning
Diana Borsa, Nicolas Heess, Bilal Piot, Siqi Liu, Leonard Hasenclever, Remi Munos, Olivier Pietquin
AAMAS 2019 | paper