Introduction
I am a PhD student at KAIST, advised by Prof. Seungryong Kim. My main research interest is general perception in video, especially Track Any Point (or point tracking), and its application to the 3D/4D video understanding and reconstruction. I am closely working with Joon-Young Lee and Gabriel Huang at Adobe through a couple of internships, and before that, I collaborated with Google Research and Microsoft Research Asia (MSRA).
Before joining KAIST, I was a PhD student at Korea University and transferred to KAIST with my advisor. Prior to that, I received a B.S. degree from Yonsei University.
Education
- Korea Advanced Institute of Science and Technology (KAIST), Seoul, Korea
- MS/PhD Integrated Student in Artificial Intelligence
- Sep. 2024 - Present
- Korea University, Seoul, Korea
- MS/PhD Integrated Student in Computer Science and Engineering
- Mar. 2022 - Aug. 2024 (Transferred to KAIST with supervisor)
- Yonsei University, Seoul, Korea
- BS in Computer Science
- Mar. 2018 - Feb. 2022
Work Experience
- NVIDIA, Santa Clara, CA
- Incoming Research Intern
- Adobe, San Jose, CA
- Research Intern, June 2024 - Sep. 2024
- Worked on 3D point tracking, aiming to understand 3D structure from 2D trajectory; submitted to CVPR 2025.
- Mentor: Gabriel Huang, Joon-Young Lee
- Adobe, San Jose, CA
- Research Intern, June 2023 - Sep. 2023
- Conducted research on long-range dense tracking, leading to the publication of ‘Revisiting Optical Flow for Long-Range Dense Tracking’ at CVPR 2024.
- Mentor: Joon-Young Lee, Gabriel Huang
Publications
Seurat: From Moving Points to Depth
Seokju Cho, Jiahui Huang, Seungryong Kim, Joon-Young Lee
IEEE Conference Computer Vision Pattern Recognition (CVPR), 2025.
Exploring Temporally-Aware Features for Point Tracking
Inès Hyeonsu Kim*, Seokju Cho*, Jiahui Huang, Jung Yi, Joon-Young Lee, Seungryong Kim
IEEE Conference Computer Vision Pattern Recognition (CVPR), 2025.
[Project Page] [arXiv]
DiffFace: Diffusion-based Face Swapping with Facial Guidance
Kihong Kim*, Yunho Kim*, Seokju Cho, Junyoung Seo, Jisu Nam, Kychul Lee, Seungryong Kim, KwangHee Lee
Pattern Recognition (PR), 2025.
[Project Page] [arXiv]
Multi-Granularity Video Object Segmentation
Sangbeom Lim*, Seongchan Kim*, Seungjun An*, Seokju Cho, Paul Hongsuck Seo, Seungryong Kim
AAAI Conference on Artificial Intelligence (AAAI), 2025.
[Project Page] [arXiv]
Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels
Heeseong Shin, Chaehyun Kim, Sunghwan Hong, Seokju Cho, Anurag Arnab, Paul Hongsuck Seo, Seungryong Kim
Neural Information Processing Systems (NeurIPS), 2024.
[Project Page] [arXiv]
Local All-Pair Correspondence for Point Tracking
Seokju Cho, Jiahui Huang, Jisu Nam, Honggyu An, Seungryong Kim, and Joon-Young Lee
European Conference on Computer Vision (ECCV), 2024.
[Project Page] [arXiv]
FlowTrack: Revisiting Optical Flow for Long-Range Dense Tracking
Seokju Cho, Jiahui Huang, Seungryong Kim, and Joon-Young Lee
IEEE Conference Computer Vision Pattern Recognition (CVPR), 2024.
[arXiv]
CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation
Seokju Cho*, Heeseong Shin*, Sunghwan Hong, Anurag Arnab, Paul Hongsuck Seo, and Seungryong Kim
IEEE Conference Computer Vision Pattern Recognition (CVPR), Highlight (2.8% Acceptance Rate), 2024.
[Project Page] [arXiv]
Unifying Feature and Cost Aggregation with Transformers for Dense Correspondence
Sunghwan Hong*, Seokju Cho*, Seungryong Kim, and Stephen Lin
International Conference on Learning Representations (ICLR), 2024.
DäRF: Boosting Radiance Fields from Sparse Inputs with Monocular Depth Adaptation
Jiuhn Song*, Seonghoon Park*, Honggyu An*, Seokju Cho, Min-Seop Kwak, Sungjin Cho, and Seungryong Kim
Neural Information Processing Systems (NeurIPS), 2023.
[Project Page] [arXiv]
LANIT: Language-Driven Image-to-Image Translation for Unlabeled Data
Jihye Park*, Sunwoo Kim*, Soohyun Kim*, Seokju Cho, Jaejun Yoo, Youngjung Uh, and Seungryong Kim
IEEE Conference Computer Vision Pattern Recognition (CVPR), 2023.
[Project Page] [arXiv]
MIDMs: Matching Interleaved Diffusion Models for Exemplar-based Image Translation
Junyoung Seo*, Gyuseong Lee*, Seokju Cho, Jiyoung Lee, Seungryong Kim
AAAI Conference on Artificial Intelligence (AAAI), 2023.
[Project Page] [arXiv]
CATs++: Boosting Cost Aggregation with Convolutions and Transformers
Seokju Cho*, Sunghwan Hong*, and Seungryong Kim
IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), 2023 (Impact Factor: 24.314).
[Project Page] [arXiv]
Neural Matching Fields: Implicit Representation of Matching Fields for Visual Correspondence
Sunghwan Hong, Jisu Nam, Seokju Cho, Susung Hong, Sangryul Jeon, Dongbo Min, and Seungryong Kim
Neural Information Processing Systems (NeurIPS), 2022.
[Project Page] [arXiv]
Cost Aggregation with 4D Convolutional Swin Transformer for Few-Shot Segmentation
Sunghwan Hong*, Seokju Cho*, Jisu Nam, Stephen Lin, and Seungryong Kim
European Conference on Computer Vision (ECCV), 2022.
[Project Page] [arXiv]
CATs: Cost Aggregation Transformers for Visual Correspondence
Seokju Cho*, Sunghwan Hong*, Sangryul Jeon, Yunsung Lee, Kwanghoon Sohn, and Seungryong Kim
Neural Information Processing Systems (NeurIPS), 2021.
[Project Page] [arXiv] [Github]
Preprints
Pose-Diversified Augmentation with Diffusion Model for Person Re-Identification
Inès Hyeonsu Kim, JoungBin Lee, Soowon Son, Woojeong Jin, Kyusun Cho, Junyoung Seo, Min-Seop Kwak, Seokju Cho, JeongYeol Baek, Byeongwon Lee, Seungryong Kim
ArXiv Preprint, 2024.
[arXiv]
Academic Services
Reviewer
- 2024 - CVPR
- 2023 - CVPR, NeurIPS
Honors
- Google, June, 2024
- Google East Asia Student Travel Grants for CVPR
- Ministry of Science and ICT & National IT Industry Promotion Agency, Sep. 2021
- AI Online Competition, 3rd Place Award, Won 300M KRW