Michael Dorkenwald

I am a PhD student in the QUVA lab at the University of Amsterdam supervised by Yuki Asano and Cees Snoek. I am also part of the ELLIS PhD program in cooperation with Qualcomm.

Before, I received my master's degree in physics from Heidelberg University during which I was part of the research group from Björn Ommer. There, I was working on understanding human and object dynamics within generative frameworks primarily for video synthesis. I had the opportunity for a research visit at Kosta Derpanis's lab in Toronto. Furthermore, I completed an internship in the AWS Rekognition team where I worked on self-supervised video representation learning.

Email  /  Google Scholar  /  Twitter  /  Github  /  LinkedIn

profile photo
News
Research

My research focuses on the intersection of self-supervised learning, video understanding, and vision-language models.

TVBench: Redesigning Video-Language Evaluation
Daniel Cores*, Michael Dorkenwald*, Manuel Mucientes, Cees G. M. Snoek, Yuki M. Asano
ArXiv 2024

ArXiv  /  Code  /  Hugging Face

GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models
M. Jehanzeb Mirza, Mengjie Zhao, Zhuoyuan Mao, Sivan Doveh, Wei Lin, Paul Gavrikov, Michael Dorkenwald, Shiqi Yang, Saurav Jha, Hiromi Wakaki, Yuki Mitsufuji, Horst Possegger, Rogerio Feris, Leonid Karlinsky, James Glass
ArXiv 2024

ArXiv  /  Project Page  /  Code

SIGMA: Sinkhorn-Guided Masked Video Modeling
Mohammadreza Salehi*, Michael Dorkenwald*, Fida Mohammad Thoker*, Efstratios Gavves, Cees Snoek, Yuki M. Asano
Accepted at ECCV 2024

ArXiv  /  Project Page  /  Code

PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs
Michael Dorkenwald, Nimrod Barazani, Cees Snoek*, Yuki M. Asano*
CVPR 2024

ArXiv  /  Project Page  /  Code
Image

SCVRL: Shuffled Contrastive Video Representation Learning
Michael Dorkenwald, Fanyi Xiao, Biagio Brattoli, Joseph Tighe, Davide Modolo
CVPR 2022 I3D-IVU workshop

Image

iPOKE: Poking a Still Image for Controlled Stochastic Video Synthesis
Andreas Blattmann, Timo Milbich, Michael Dorkenwald, Björn Ommer
ICCV 2021

ArXiv  /  Project Page  /  Code

Stochastic Image-to-Video Synthesis using cINNs
Michael Dorkenwald, Timo Milbich, Andreas Blattmann, Robin Rombach, Konstantinos G. Derpanis, Björn Ommer
CVPR 2021

ArXiv  /  Project Page  /  Code
Image

Behavior-Driven Synthesis of Human Dynamics
Andreas Blattmann, Timo Milbich, Michael Dorkenwald, Björn Ommer
CVPR 2021

ArXiv  /  Project Page  /  Code
Image

Understanding Object Dynamics for Interactive Image-to-Video Synthesis
Andreas Blattmann, Timo Milbich, Michael Dorkenwald, Björn Ommer
CVPR 2021

ArXiv  /  Project Page  /  Code
Image

Unsupervised behaviour analysis and magnification (uBAM) using deep learning
Biagio Brattoli*, Uta Buechler*, Michael Dorkenwald, Philipp Reiser, Lineard Filli, Fritjof Helmchen, Anna-Sophia Wahl, Björn Ommer
Nature Machine Intelligence

Article  /  Project Page  /  Code
Image

Unsupervised Magnification of Posture Deviations across Subjects
Michael Dorkenwald*, Uta Büchler*, Björn Ommer
CVPR 2020