Home   Projects   CV   Misc    

TAP-Vid: A Benchmark for Tracking Any Point in a Video

Carl Doersch, Ankush Gupta, Larisa Markeeva, Adrià Recasens, Lucas Smaira, Yusuf Aytar, João Carreira, Andrew Zisserman, Yi Yang in NeurIPS Datasets and Benchmarks 2022 [arXiv] [github]

Input-level Inductive Biases for 3D Reconstruction

Wang Yifan, Carl Doersch, Relja Arandjelovic, Joao Carreira, Andrew Zisserman in CVPR 2022 [arXiv]

Kubric: A Scalable Dataset Generator

Klaus Greff, Francois Belletti, Lucas Beyer, Carl Doersch, Yilun Du, Daniel Duckworth, David J. Fleet, Dan Gnanapragasam, Florian Golemo, Charles Herrmann, Thomas Kipf, Abhijit Kundu, Dmitry Lagun, Issam Laradji, Hsueh-Ti (Derek) Liu, Henning Meyer, Yishu Miao, Derek Nowrouzezahrai, Cengiz Oztireli, Etienne Pot, Noha Radwan, Daniel Rebain, Sara Sabour, Mehdi S. M. Sajjadi, Matan Sela, Vincent Sitzmann, Austin Stone, Deqing Sun, Suhani Vora, Ziyu Wang, Tianhao Wu, Kwang Moo Yi, Fangcheng Zhong, Andrea Tagliasacchi in CVPR 2022 [arXiv] [github]

Perceiver IO: A General Architecture for Structured Inputs & Outputs

Andrew Jaegle, Sebastian Borgeaud, Jean-Baptiste Alayrac, Carl Doersch, Catalin Ionescu, David Ding, Skanda Koppula, Daniel Zoran, Andrew Brock, Evan Shelhamer, Olivier Henaff, Matthew M. Botvinick, Andrew Zisserman, Oriol Vinyals, Joao Carreira in ICLR 2022 [arXiv]

Inferring a Continuous Distribution of Atom Coordinates from Cryo-EM Images using VAEs

Dan Rosenbaum, Marta Garnelo, Michal Zielinski, Charlie Beattie, Ellen Clancy, Andrea Huber, Pushmeet Kohli, Andrew W. Senior, John Jumper, Carl Doersch, S. M. Ali Eslami, Olaf Ronneberger, Jonas Adler in NeurIPS 2021 workshop on Machine Learning in Structural Biology [arXiv]

CrossTransformers: spatially-aware few-shot transfer

Carl Doersch, Ankush Gupta, Andrew Zisserman in NeurIPS 2020 [arXiv]

Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning

Jean-Bastien Grill, Florian Strub, Florent Altché, Corentin Tallec, Pierre H. Richemond, Elena Buchatskaya, Carl Doersch, Bernardo Avila Pires, Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Bilal Piot, Koray Kavukcuoglu, Rémi Munos, Michal Valko in NeurIPS 2020 (Oral) [arXiv]

Data-Efficient Image Recognition with Contrastive Predictive Coding

Olivier J. Hénaff, Aravind Srinivas, Jeffrey De Fauw, Ali Razavi, Carl Doersch, S. M. Ali Eslami, Aaron van den Oord in ICML 2020 [arXiv]

Sim2real transfer learning for 3D human pose estimation: motion to the rescue

Carl Doersch, Andrew Zisserman in NeurIPS 2019 [arXiv]

Exploiting temporal context for 3D human pose estimation in the wild

Anurag Arnab, Carl Doersch, Andrew Zisserman in CVPR 2019 [arXiv]

Video Action Transformer Network

Rohit Girdhar, João Carreira, Carl Doersch, Andrew Zisserman in CVPR 2019 [arXiv]

A Better Baseline for AVA

Rohit Girdhar, João Carreira, Carl Doersch, Andrew Zisserman in CVPR 2018 ActivityNet Workshop [arXiv]

Kickstarting Deep Reinforcement Learning

Simon Schmitt, Jony Hudson, Augustin Zidek, Simon Osindero, Carl Doersch, Wojciech Czarnecki, Joel Leibo, Heinrich Kuttler, Andrew Zisserman, Karen Simonyan, Ali Eslami in NIPS 2018 Reinforcement Learning Workshop [arXiv]

Learning Visual Question Answering by Bootstrapping Hard Attention

Mateusz Malinowski, Carl Doersch, Adam Santoro, Peter Battaglia in ECCV 2018 [arXiv]

The Visual QA Devil in the Details: The Impact of Early Fusion and Batch Norm on CLEVR

Mateusz Malinowski, Carl Doersch in ECCV 2018 Workshop on Shortcomings in Vision and Language [arXiv]

Multi-task Self-Supervised Visual Learning

Carl Doersch and Andrew Zisserman in ICCV 2017 [arXiv] [Show BibTex]

Supervision Beyond Manual Annotations for Learning Visual Representations

Carl Doersch.
Carnegie Mellon Thesis Dissertation [pdf] [Show BibTex]

Tutorial on Variational Autoencoders

Carl Doersch.
Arxiv Tech Report, June 2016 [arXiv] [Show BibTex]

An Uncertain Future: Forecasting from Static Images using Variational Autoencoders

Jacob Walker, Carl Doersch, Abhinav Gupta, and Martial Hebert.
in ECCV 2016 [webpage] [arXiv] [Show BibTex]

Data-dependent Initializations of Convolutional Neural Networks

Philipp Krähenbühl, Carl Doersch, Jeff Donahue, and Trevor Darrell.
ICLR, 2016 [arxiv]

Unsupervised Visual Representation Learning by Context Prediction

Carl Doersch, Abhinav Gupta, and Alexei A. Efros.
in ICCV 2015 (oral) [webpage] [arXiv] [Show BibTex]

Context as Supervisory Signal: Discovering Objects with Predictable Context

Carl Doersch, Abhinav Gupta, and Alexei A. Efros.
In ECCV 2014 [Show BibTex]

Mid-Level Visual Element Discovery as Discriminative Mode Seeking

Carl Doersch, Abhinav Gupta, and Alexei A. Efros.
In NIPS 2013 [Show BibTex]

What Makes Paris Look like Paris?

Carl Doersch, Saurabh Singh, Abhinav Gupta, Josef Sivic, and Alexei A. Efros.
In SIGGRAPH 2012 (oral)
Republished on the cover of the CACM magazine Dec. 2015 [Show BibTex]

Bounding the Probability of Error for High Precision Optical Character Recognition

Gary B. Huang, Andrew Kae, Carl Doersch, and Erik Learned-Miller.
In JMLR 2012 [pdf] [Show BibTex]

Improving state-of-the-art OCR through high-precision document-specific modeling.

Andrew Kae, Gary B. Huang, Carl Doersch, and Erik Learned-Miller.
In CVPR 2010 [pdf] [Show BibTex]