Click to Move: Controlling Video Generation with Sparse Motion

Pierfrancesco Ardino; Marco de Nadai; Bruno Lepri; Elisa Ricci; Stéphane Lathuilière

Communication Dans Un Congrès Année : 2021

Click to Move: Controlling Video Generation with Sparse Motion

(1) , (2) , (2) , (1, 2) , (3, 4, 5)

1
2
3
4
5

Pierfrancesco Ardino

Fonction : Auteur

Università degli Studi di Trento = University of Trento

Marco de Nadai

Fonction : Auteur

Fondazione Bruno Kessler [Trento, Italy]

Bruno Lepri

Fonction : Auteur

Fondazione Bruno Kessler [Trento, Italy]

Elisa Ricci

Fonction : Auteur

Università degli Studi di Trento = University of Trento

Fondazione Bruno Kessler [Trento, Italy]

Stéphane Lathuilière

Fonction : Auteur
PersonId : 1058528
IdHAL : stephane-lathuiliere

Multimédia

Département Images, Données, Signal

Institut Polytechnique de Paris

Résumé

This paper introduces Click to Move (C2M), a novel framework for video generation where the user can control the motion of the synthesized video through mouse clicks specifying simple object trajectories of the key objects in the scene. Our model receives as input an initial frame, its corresponding segmentation map and the sparse motion vectors encoding the input provided by the user. It outputs a plausible video sequence starting from the given frame and with a motion that is consistent with user input. Notably, our proposed deep architecture incorporates a Graph Convolution Network (GCN) modelling the movements of all the objects in the scene in a holistic manner and effectively combining the sparse user motion information and image features. Experimental results show that C2M outperforms existing methods on two publicly available datasets, thus demonstrating the effectiveness of our GCN framework at modelling object interactions. The source code is publicly available at https://github.com/PierfrancescoArdino/C2M.

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Stéphane Lathuilière : Connectez-vous pour contacter le contributeur

https://telecom-paris.hal.science/hal-03335971

Soumis le : lundi 6 septembre 2021-16:29:16

Dernière modification le : mardi 19 mars 2024-09:34:06

Dates et versions

hal-03335971 , version 1 (06-09-2021)

Identifiants

HAL Id : hal-03335971 , version 1
ARXIV : 2108.08815

Citer

Pierfrancesco Ardino, Marco de Nadai, Bruno Lepri, Elisa Ricci, Stéphane Lathuilière. Click to Move: Controlling Video Generation with Sparse Motion. International Conference on Computer Vision, Oct 2021, Online, France. ⟨hal-03335971⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM LTCI IDS MM IP_PARIS

19 Consultations

0 Téléchargements

Click to Move: Controlling Video Generation with Sparse Motion

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager