firstbacksecondback
127 Results
Workshop
|
Fri 5:25 |
Impossibility of Collective Intelligence Krikamol Muandet |
|
Poster
|
Wed 7:30 |
Diagnosing and Rectifying Vision Models using Language Yuhui Zhang · Jeff Z. HaoChen · Shih-Cheng Huang · Kuan-Chieh Wang · James Y Zou · Serena Yeung |
|
Poster
|
Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal Retrieval Zhenghao Liu · Chenyan Xiong · Yuanhuiyi Lv · Zhiyuan Liu · Ge Yu |
||
Poster
|
Wed 7:30 |
CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos Hao-Wen Dong · Naoya Takahashi · Yuki Mitsufuji · Julian McAuley · Taylor Berg-Kirkpatrick |
|
Poster
|
DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training wei li · Linchao Zhu · Longyin Wen · Yi Yang |
||
Poster
|
Is a Caption Worth a Thousand Images? A Study on Representation Learning Shibani Santurkar · Yann Dubois · Rohan Taori · Percy Liang · Tatsunori Hashimoto |
||
Poster
|
Mon 2:30 |
Masked Vision and Language Modeling for Multi-modal Representation Learning Gukyeong Kwon · Zhaowei Cai · Avinash Ravichandran · Erhan Bas · Rahul Bhotika · Stefano Soatto |
|
Poster
|
Wed 7:30 |
Write and Paint: Generative Vision-Language Models are Unified Modal Learners Shizhe Diao · Wangchunshu Zhou · Xinsong Zhang · Jiawei Wang |
|
Poster
|
Wed 2:30 |
An Extensible Multi-modal Multi-task Object Dataset with Materials Trevor Standley · Ruohan Gao · Dawn Chen · Jiajun Wu · Silvio Savarese |
|
Poster
|
Contrastive Audio-Visual Masked Autoencoder Yuan Gong · Andrew Rouditchenko · Alexander Liu · David Harwath · Leonid Karlinsky · Hilde Kuehne · James R Glass |
||
Poster
|
Mon 2:30 |
Multimodal Federated Learning via Contrastive Representation Ensemble Qiying Yu · Yang Liu · Yimu Wang · Ke Xu · Jingjing Liu |
|
Workshop
|
Predicting Density of States via Multi-modal Transformer Namkyeong Lee · Heewoong Noh · Sungwon Kim · Dongmin Hyun · Gyoung S. Na · Chanyoung Park |