Sato Lab./Sugano Lab.
Sato Lab./Sugano Lab.
Y. Sato Lab.
Sugano Lab.
News
Publications
Contact
Datasets
Internal Wiki
English
日本語
Recent Publications
» List of All Publications
ActionVOS: Action as Prompts for Video Object Segmentation
Delving into the realm of egocentric vision, the advancement of referring video object segmentation (RVOS) stands as pivotal in …
Liangyang Ouyang
,
Ruicong Liu
,
Yifei Huang
,
Ryosuke Furuta
,
Yoichi Sato
PDF
Cite
Code
Masked Video and Body-worn IMU Autoencoder for Egocentric Action Recognition
Compared with visual signals, Inertial Measurement Units (IMUs) placed on human limbs can capture accurate motion signals while being …
Mingfang Zhang
,
Yifei Huang
,
Ruicong Liu
,
Yoichi Sato
PDF
Cite
Single-to-Dual-View Adaptation for Egocentric 3D Hand Pose Estimation
The pursuit of accurate 3D hand pose estimation stands as a keystone for understanding human activity in the realm of egocentric …
Ruicong Liu
,
Takehiko Ohkawa
,
Mingfang Zhang
,
Yoichi Sato
PDF
Cite
Code
Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos
We propose a novel benchmark for cross-view knowledge transfer of dense video captioning, adapting models from web instructional videos …
Takehiko Ohkawa
,
Takuma Yagi
,
Taichi Nishimura
,
Ryosuke Furuta
,
Atsushi Hashimoto
,
Yoshitaka Ushiku
,
Yoichi Sato
PDF
Cite
Gaze Scanpath Transformer: Predicting Visual Search Target by Spatiotemporal Semantic Modeling of Gaze Scanpath
We introduce a new method called the Gaze Scanpath Transformer for predicting a search target category during a visual search task. …
Takumi Nishiyasu
,
Yoichi Sato
PDF
Cite
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
We present Ego-Exo4D, a diverse, large-scale multimodal multiview video dataset and benchmark challenge. Ego-Exo4D centers around …
Kristen Grauman
,
Andrew Westbury
,
Lorenzo Torresani
,
Kris Kitani
,
Jitendra Malik
,
Triantafyllos Afouras
,
Kumar Ashutosh
,
Vijay Baiyya
,
Siddhant Bansal
,
Bikram Boote
,
Eugene Byrne
,
Zach Chavis
,
Joya Chen
,
Feng Cheng
,
Fu-Jen Chu
,
Sean Crane
,
Avijit Dasgupta
,
Jing Dong
,
Maria Escobar
,
Cristhian Forigua
,
Abrham Gebreselasie
,
Sanjay Haresh
,
Jing Huang
,
Md Mohaiminul Islam
,
Suyog Jain
,
Rawal Khirodkar
,
Devansh Kukreja
,
Kevin J Liang
,
Jia-Wei Liu
,
Sagnik Majumder
,
Yongsen Mao
,
Miguel Martin
,
Effrosyni Mavroudi
,
Tushar Nagarajan
,
Francesco Ragusa
,
Santhosh Kumar Ramakrishnan
,
Luigi Seminara
,
Arjun Somayazulu
,
Yale Song
,
Shan Su
,
Zihui Xue
,
Edward Zhang
,
Jinxu Zhang
,
Angela Castillo
,
Changan Chen
,
Xinzhu Fu
,
Ryosuke Furuta
,
Cristina Gonzalez
,
Prince Gupta
,
Jiabo Hu
,
Yifei Huang
,
Yiming Huang
,
Weslie Khoo
,
Anush Kumar
,
Robert Kuo
,
Sach Lakhavani
,
Miao Liu
,
Mi Luo
,
Zhengyi Luo
,
Brighid Meredith
,
Austin Miller
,
Oluwatumininu Oguntola
,
Xiaqing Pan
,
Penny Peng
,
Shraman Pramanick
,
Merey Ramazanova
,
Fiona Ryan
,
Wei Shan
,
Kiran Somasundaram
,
Chenan Song
,
Audrey Southerland
,
Masatoshi Tateno
,
Huiyu Wang
,
Yuchen Wang
,
Takuma Yagi
,
Mingfei Yan
,
Xitong Yang
,
Zecheng Yu
,
Shengxin Cindy Zha
,
Chen Zhao
,
Ziwei Zhao
,
Zhifan Zhu
,
Jeff Zhuo
,
Pablo Arbelaez
,
Gedas Bertasius
,
David Crandall
,
Dima Damen
,
Jakob Engel
,
Giovanni Maria Farinella
,
Antonino Furnari
,
Bernard Ghanem
,
Judy Hoffman
,
C. v. Jawahar
,
Richard Newcombe
,
Hyun Soo Park
,
James M. Rehg
,
Yoichi Sato
,
Manolis Savva
,
Jianbo Shi
,
Mike Zheng Shou
,
Michael Wray
PDF
Cite
Matching Compound Prototypes for Few-Shot Action Recognition
The task of few-shot action recognition aims to recognize novel action classes using only a small number of labeled training samples. …
Yifei Huang
,
Lijin Yang
,
Guo Chen
,
Hongjie Zhang
,
Feng Lu
,
Yoichi Sato
PDF
Cite
Code
DOI
Image-to-Text Translation for Interactive Image Recognition: A Comparative User Study with Non-Expert Users
Interactive machine learning (IML) allows users to build their custom machine learning models without expert knowledge. While most …
Wataru Kawabe
,
Yusuke Sugano
PDF
Cite
DOI
Technical Understanding from Interactive Machine Learning Experience: A Study through a Public Event for Science Museum Visitors
While AI technology is becoming increasingly prevalent in our daily lives, the comprehension of machine learning (ML) among non-experts …
Wataru Kawabe
,
Yuri Nakao
,
Akihisa Shitara
,
Yusuke Sugano
PDF
Cite
DOI
Simultaneous control of head pose and expressions in 3D facial keypoint-based GAN
In this work, we present a novel method for simultaneously controlling the head pose and the facial expressions of a given input image …
Tomoyuki Hatakeyama
,
Ryosuke Furuta
,
Yoichi Sato
PDF
Cite
DOI
See all publications
Cite
×