Bio. I am a senior Research Scientist at DeepMind, focusing on Deep Reinforcement Learning. I pursued my Ph.D. at University of Technology Sydney (UTS), advised by Prof. Yi Yang. I received my Bachelor's degree from Zhejiang University in 2013, under the supervision of Prof. Yueting Zhuang and Prof. Fei Wu.
I spent two fabulous internships in Alphabet/Google's AI groups. In Spring 2017, I was a research intern at DeepMind, working with Dr. Hado van Hasselt and Prof. David Silver. And in Spring 2016, I was a research intern at Google Brain, working with Dr. Pierre Sermanet and Dr. George Toderici.
More details can be found in my CV.
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Zhongwen Xu, Hado van Hasselt, Matteo Hessel, Junhyuk Oh, Satinder Singh, David Silver
Balancing Constraints and Rewards with Meta-Gradient D4PG
Dan A Calian, Daniel J Mankowitz, Tom Zahavy, Zhongwen Xu, Junhyuk Oh, Nir Levine, Timothy Mann
Discovering Reinforcement Learning Algorithms
Junhyuk Oh, Matteo Hessel, Wojciech M. Czarnecki, Zhongwen Xu, Hado van Hasselt, Satinder Singh, David Silver
A Self-Tuning Actor-Critic Algorithm
Tom Zahavy, Zhongwen Xu, Vivek Veeriah, Matteo Hessel, Junhyuk Oh, Hado van Hasselt, David Silver, Satinder Singh
What Can Learned Intrinsic Rewards Capture?
Zeyu Zheng, Junhyuk Oh, Matteo Hessel, Zhongwen Xu, Manuel Kroiss, Hado van Hasselt, David Silver, Satinder Singh
Discovery of Useful Questions as Auxiliary Tasks
Vivek Veeriah, Matteo Hessel, Zhongwen Xu, Richard Lewis, Janarthanan Rajendran, Junhyuk Oh, Hado van Hasselt, David Silver, Satinder Singh
Watching a Small Portion could be as Good as Watching All: Towards Efficient Video Classification
Hehe Fan, Zhongwen Xu, Linchao Zhu, Chenggang Yan, Jianjun Ge, and Yi Yang
Natural Value Approximators: Learning When to Trust Past Estimates.
Zhongwen Xu, Joseph Modayil, Hado van Hasselt, Andre Barreto, David Silver and Tom Schaul
NIPS 2017 (Spotlight)
Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning
Pingbo Pan, Zhongwen Xu, Yi Yang, Fei Wu and Yueting Zhuang
CVPR 2016 [PDF]
Robust Semi-supervised Learning through Label Aggregation
Yan Yan, Zhongwen Xu, Ivor W. Tsang, Guodong Long and Yi Yang
AAAI 2016 [PDF]
A Discriminative CNN Video Representation for Event Detection
Zhongwen Xu, Yi Yang and Alexander G. Hauptmann
CVPR 2015 [PDF]
Content-Based Video Search over 1 Million Videos with 1 Core in 1 Second
Shoou-I Yu, Lu Jiang, Zhongwen Xu, Yi Yang and Alexander G. Hauptmann
Event Detection Using Multi-level Relevance Labels and Multiple Features
Zhongwen Xu, Ivor W. Tsang, Yi Yang, Zhigang Ma and Alexander G. Hauptmann
CVPR 2014 [PDF]
Feature Weighting via Optimal Thresholding for Video Analysis
Zhongwen Xu, Yi Yang, Ivor W. Tsang, Nicu Sebe and Alexander G. Hauptmann
ICCV 2013 [PDF]
How Related Exemplars Help Complex Event Detection in Web Videos?
Yi Yang, Zhigang Ma, Zhongwen Xu, Shuicheng Yan and Alexander G. Hauptmann
ICCV 2013 [PDF]
Complex Event Detection via Multi-source Video Attributes
Zhigang Ma, Yi Yang, Zhongwen Xu, Shuicheng Yan, Nicu Sebe and Alexander G. Hauptmann
CVPR 2013 [PDF]
We Are Not Equally Negative: Fine-grained Labeling for Multimedia Event Detection
Zhigang Ma, Yi Yang, Zhongwen Xu, Nicu Sebe and Alexander G. Hauptmann
ACM Multimedia 2013 [PDF]
Meta-gradient updates for training return functions for reinforcement learning systems
Zhongwen Xu, Hado van Hasselt, David Silver
US Patent App. 16/417,536
Training action selection neural networks using a differentiable credit function
Zhongwen Xu, Hado van Hasselt, Joseph Modayil, Andre Barreto, David Silver
US Patent App. 16/615,042
ILSVRC (ImageNet) 2014 Classification with Provided Data Only
Zhongwen Xu and Yi Yang
[Ranking: Google, VGG, MSRA, Howard, DeepVision, NUS, TTIC, XYZ (ours)] [link]
Theme from Karen Simonyan.