About me

I am a M.S. / Ph.D. Integrated student of CVLAB at KAIST, under the supervision of Prof. Seungryong Kim. My current work involves a variety of computer vision tasks, including image segmentation, text-to-image/video generation, and 3D modeling.

My research interests lie at the intersection of computer vision and robotic agents, with a focus on developing accurate and semantically rich perception and generative models for advancing robot perception and world modeling. I believe that integrating these fields will make the world more convenient and safer, and I am excited to be part of this journey. If you have any questions or suggestions, please feel free to contact me!

Publications

Visual Representation Alignment for Multimodal Large Language Models

Heeji Yoon, Jaewoo Jung, Junwan Kim, Hyungyu Choi, Heeseong Shin, Sangbeom Lim, Honggyu An, Chaehyun Kim, Jisang Han, Donghyun Kim, Chanho Eom, Sunghwan Hong, Seungryong Kim
arxiv

D2USt3R: Enhancing 3D Reconstruction with 4D Pointmaps for Dynamic Scenes

Jisang Han, Honggyu An, Jaewoo Jung, Takuya Narihira, Junyoung Seo, Kazumi Fukuda, Chaehyun Kim, Sunghwan Hong, Yuki Mitsufuji, and Seungryong Kim.
arxiv

Towards open-vocabulary semantic segmentation without semantic labels

Heeseong Shin, Chaehyun Kim, Sunghwan Hong, Seokju Cho, Anurag Arnab, Paul Hongsuck Seo, and Seungryong Kim.
NeurIPS2024