Sato Lab./Sugano Lab.
Sato Lab./Sugano Lab.
Y. Sato Lab.
Sugano Lab.
News
Publications
Contact
Resources
Internal Wiki
Conference
EgoExoBench: A Benchmark for First- and Third-person View Video Understanding in MLLMs
Transferring and integrating knowledge across first-person (egocentric) and third-person (exocentric) viewpoints is intrinsic to human …
Yuping He
,
Yifei Huang
,
Guo Chen
,
Baoqi Pei
,
Jilan Xu
,
Tong Lu
,
Jiangmiao Pang
PDF
Cite
Code
DOI
Unveiling Egocentric Reasoning with Spatio-Temporal CoT
Egocentric video reasoning focuses on the unseen, egocentric agent who shapes the scene, demanding inference of hidden intentions and …
Baoqi Pei
,
Yifei Huang
,
Jilan Xu
,
Yuping He
,
Guo Chen
,
Fei Wu
,
Yu Qiao
,
Jiangmiao Pang
PDF
Cite
Code
Vinci: A Real-time Smart Assistant based on Egocentric Vision-language Model for Portable Devices
We present Vinci, a vision-language system designed to provide real-time, comprehensive AI assistance on portable devices. At its core, …
Yifei Huang
,
Jilan Xu
,
Baoqi Pei
,
Yuping He
,
Guo Chen
,
Mingfang Zhang
,
Lijin Yang
,
Zheng Nie
,
Jinyao Liu
,
Guoshun Fan
,
Dechen Lin
,
Fang Fang
,
Kunpeng Li
,
Chang Yuan
,
Xinyuan Chen
,
Yaohui Wang
,
Yali Wang
,
Yu Qiao
,
Limin Wang
PDF
Cite
Code
DOI
Cite
×