佐藤研究室/菅野研究室
佐藤研究室/菅野研究室
佐藤 (洋) 研究室
菅野研究室
ニュース
発表文献
連絡先
リソース
内部ページ
日本語
English
Paper-Conference
Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos
We propose a novel benchmark for cross-view knowledge transfer of dense video captioning, adapting models from web instructional videos …
Takehiko Ohkawa
,
Takuma Yagi
,
Taichi Nishimura
,
Ryosuke Furuta
,
Atsushi Hashimoto
,
Yoshitaka Ushiku
,
Yoichi Sato
PDF
引用
Gaze Scanpath Transformer: Predicting Visual Search Target by Spatiotemporal Semantic Modeling of Gaze Scanpath
We introduce a new method called the Gaze Scanpath Transformer for predicting a search target category during a visual search task. …
Takumi Nishiyasu
,
Yoichi Sato
PDF
引用
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
We present Ego-Exo4D, a diverse, large-scale multimodal multiview video dataset and benchmark challenge. Ego-Exo4D centers around …
Kristen Grauman
,
Andrew Westbury
,
Lorenzo Torresani
,
Kris Kitani
,
Jitendra Malik
,
Triantafyllos Afouras
,
Kumar Ashutosh
,
Vijay Baiyya
,
Siddhant Bansal
,
Bikram Boote
,
Eugene Byrne
,
Zach Chavis
,
Joya Chen
,
Feng Cheng
,
Fu-Jen Chu
,
Sean Crane
,
Avijit Dasgupta
,
Jing Dong
,
Maria Escobar
,
Cristhian Forigua
,
Abrham Gebreselasie
,
Sanjay Haresh
,
Jing Huang
,
Md Mohaiminul Islam
,
Suyog Jain
,
Rawal Khirodkar
,
Devansh Kukreja
,
Kevin J Liang
,
Jia-Wei Liu
,
Sagnik Majumder
,
Yongsen Mao
,
Miguel Martin
,
Effrosyni Mavroudi
,
Tushar Nagarajan
,
Francesco Ragusa
,
Santhosh Kumar Ramakrishnan
,
Luigi Seminara
,
Arjun Somayazulu
,
Yale Song
,
Shan Su
,
Zihui Xue
,
Edward Zhang
,
Jinxu Zhang
,
Angela Castillo
,
Changan Chen
,
Xinzhu Fu
,
Ryosuke Furuta
,
Cristina Gonzalez
,
Prince Gupta
,
Jiabo Hu
,
Yifei Huang
,
Yiming Huang
,
Weslie Khoo
,
Anush Kumar
,
Robert Kuo
,
Sach Lakhavani
,
Miao Liu
,
Mi Luo
,
Zhengyi Luo
,
Brighid Meredith
,
Austin Miller
,
Oluwatumininu Oguntola
,
Xiaqing Pan
,
Penny Peng
,
Shraman Pramanick
,
Merey Ramazanova
,
Fiona Ryan
,
Wei Shan
,
Kiran Somasundaram
,
Chenan Song
,
Audrey Southerland
,
Masatoshi Tateno
,
Huiyu Wang
,
Yuchen Wang
,
Takuma Yagi
,
Mingfei Yan
,
Xitong Yang
,
Zecheng Yu
,
Shengxin Cindy Zha
,
Chen Zhao
,
Ziwei Zhao
,
Zhifan Zhu
,
Jeff Zhuo
,
Pablo Arbelaez
,
Gedas Bertasius
,
David Crandall
,
Dima Damen
,
Jakob Engel
,
Giovanni Maria Farinella
,
Antonino Furnari
,
Bernard Ghanem
,
Judy Hoffman
,
C. v. Jawahar
,
Richard Newcombe
,
Hyun Soo Park
,
James M. Rehg
,
Yoichi Sato
,
Manolis Savva
,
Jianbo Shi
,
Mike Zheng Shou
,
Michael Wray
PDF
引用
Rotation-Constrained Cross-View Feature Fusion for Multi-View Appearance-based Gaze Estimation
Appearance-based gaze estimation has been actively studied in recent years. However, its generalization performance for unseen head …
Yoichiro Hisadome
,
Tianyi Wu
,
Jiawei Qin
,
Yusuke Sugano
PDF
引用
ソースコード
Image Cropping under Design Constraints
Image cropping is essential in image editing for obtaining a compositionally enhanced image. In display media, image cropping is a …
Takumi Nishiyasu
,
Wataru Shimoda
,
Yoichi Sato
PDF
引用
ソースコード
Proposal-based Temporal Action Localization with Point-level Supervision
Point-level supervised temporal action localization (PTAL) aims at recognizing and localizing actions in untrimmed videos where only a …
Yuan Yin
,
Yifei Huang
,
Ryosuke Furuta
,
Yoichi Sato
PDF
引用
DeCo : Decomposition and Reconstruction for Compositional Temporal Grounding via Coarse-to-Fine Contrastive Ranking
Understanding dense action in videos is a fundamental challenge towards the generalization of vision models. Several works show that …
Lijin Yang
,
Quan Kong
,
Hsuan-Kung Yang
,
Wadim Kehl
,
Yoichi Sato
,
Norimasa Kobori
PDF
引用
FineBio: A Fine-Grained Video Dataset of Biological Experiments with Hierarchical Annotations
Takuma Yagi
,
Misaki Ohashi
,
Yifei Huang
,
Ryosuke Furuta
,
Shungo Adachi
,
Toutai Mitsuyama
,
Yoichi Sato
引用
Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction
The Multiplane Image (MPI), containing a set of fronto-parallel RGBA layers, is an effective and efficient representation for view …
Mingfang Zhang
,
Jinglu Wang
,
Xiao Li
,
Yifei Huang
,
Yoichi Sato
PDF
引用
Technical Report for EgoTracks in Ego4D Challenge 2023
Mingfang Zhang
,
Yuan Yin
,
Yifei Huang
,
Yoichi Sato
引用
«
»
引用
×