Paper-Conference

Fine-grained Affordance Annotation for Egocentric Hand-Object Interaction Videos

Object affordance is an important concept in hand-object interaction, providing information on action possibilities based on human …

Zecheng Yu, Yifei Huang, Ryosuke Furuta, Takuma Yagi, Yusuke Gotsu, Yoichi Sato

Fine-grained Affordance Annotation for Egocentric Hand-Object Interaction Videos

Learning Video-independent Eye Contact Segmentation from In-the-Wild Videos

Human eye contact is a form of non-verbal communication and can have a great influence on social behavior. Since the location and size …

Tianyi Wu, Yusuke Sugano

Background Mixup Data Augmentation for Hand and Object-in-Contact Detection

Detecting the positions of human hands and objects-in-contact (hand-object detection) in each video frame is vital for understanding …

Koya Tango, Takehiko Ohkawa, Ryosuke Furuta, Yoichi Sato

CompNVS: Novel View Synthesis with Scene Completion

We introduce a scalable framework for novel view synthesis (NVS) from RGB-D images with largely incomplete scene coverage. While …

Zuoyue Li, Tianxing Fang, Zhenqiang Li, Zhaopeng Cui, Yoichi Sato, Marc Pollefeys, Martin R. Oswald

CompNVS: Novel View Synthesis with Scene Completion

Compound Prototype Matching for Few-shot Action Recognition

Few-shot action recognition aims to recognize novel action classes using only a small number of labeled training samples. In this work, …

Yifei Huang, Lijin Yang, Yoichi Sato

Domain Adaptive Hand Keypoint and Pixel Localization in the Wild

We aim to improve the performance of regressing hand keypoints and segmenting pixel-level hand masks under new imaging conditions …

Takehiko Ohkawa, Yu-Jhe Li, Qichen Fu, Ryosuke Furuta, Kris Kitani, Yoichi Sato

Surgical Skill Assessment via Video Semantic Aggregation

Automated video-based assessment of surgical skills is a promising task in assisting young surgical trainees, especially in …

Zhenqiang Li, Ling Gu, Weimin Wang, Ryosuke Nakamura, Yoichi Sato

Learning-by-Novel-View-Synthesis for Full-Face Appearance-Based 3D Gaze Estimation

Despite recent advances in appearance-based gaze estimation techniques, the need for training data that covers the target head pose and …

Jiawei Qin, Takuru Shimoyama, Yusuke Sugano

Ego4D: Around the World in 3,000 Hours of Egocentric Video

We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite. It offers 3,670 hours of daily-life activity video …

Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Abrham Gebreselasie, Cristina Gonzalez, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jachym Kolar, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Ziwei Zhao, Yunyi Zhu, Pablo Arbelaez, David Crandall, Dima Damen, Giovanni Maria Farinella, Christian Fuegen, Bernard Ghanem, Vamsi Krishna Ithapu, C. v. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik

Ego4D: Around the World in 3,000 Hours of Egocentric Video

Interact before Align: Leveraging Cross-Modal Knowledge for Domain Adaptive Action Recognition

Unsupervised domain adaptive video action recognition aims to recognize actions of a target domain using a model trained with only …

Lijin Yang, Yifei Huang, Yusuke Sugano, Yoichi Sato