Suhwan Cho

I am a Ph.D. student in the Image and Video Pattern Recognition Lab. at Yonsei University, Seoul, Korea. I was a Research Scientist Intern in the Deep Learning Group at Adobe Research, San Jose, California, USA. My research mainly focuses on video editing technologies such as video object segmentation and video inpainting.

I am always open to collaborations! Please feel free to contact me if you are interested :)

CV / GitHub / Linkedin / Google Scholar

profile photo
Selected Publication
LSHNet: Leveraging Structure‐Prior with Hierarchical Feature Update for Salient Object Detection in Optical Remote Sensing Images
Seunghoon Lee, Suhwan Cho*, Chaewon Park, Seungwook Park, Jaeyeob Kim, Sangyoun Lee
IEEE Transactions on Geoscience and Remote Sensing (TGRS)
Paper / arXiv / Code

We propose a dual-branch architecture that hierarchically incorporates both structural priors and appearance information.

Dual Prototype Attention for Unsupervised Video Object Segmentation
Suhwan Cho*, Minhyeok Lee*, Seunghoon Lee, Dogyoon Lee, Sangyoun Lee
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)
Paper / arXiv / Code

We propose two novel prototype-based attention mechanisms to incorporate different modalities and frames via dense propagation across them.

Guided Slot Attention for Unsupervised Video Object Segmentation
Minhyeok Lee, Suhwan Cho, Dogyoon Lee, Chaewon Park, Jungho Lee, Sangyoun Lee
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)
Paper / arXiv / Code

We propose a guided slot attention network to reinforce spatial structural information and obtain better foreground–background separation.

Leveraging Spatio-Temporal Dependency for Skeleton-Based Action Recognition
Jungho Lee, Minhyeok Lee, Suhwan Cho, Sungmin Woo, Sangyoun Lee
IEEE/CVF International Conference on Computer Vision (ICCV 2023)
Paper / arXiv / Code

We propose the inter-frame curve network to effectively leverage the spatio-temporal dependency of the human skeleton.

Treating Motion as Option to Reduce Motion Dependency in Unsupervised Video Object Segmentation
Suhwan Cho, Minhyeok Lee, Seunghoon Lee, Chaewon Park, Donghyeong Kim, Sangyoun Lee
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2023)
Paper / arXiv / Code

We propose a novel network architecture that operates regardless of motion availability, termed as a motion-as-option network. A collaborative network learning strategy is also introduced to fully exploit the property of the proposed network.

Unsupervised Video Object Segmentation via Prototype Memory Network
Minhyeok Lee, Suhwan Cho, Seunghoon Lee, Chaewon Park, Sangyoun Lee
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2023)
Paper / arXiv / Code

We propose a prototype memory network architecture to effectively extract and store appearance and motion information based on component prototypes from RGB images and optical flow maps.

Tackling Background Distraction in Video Object Segmentation
Suhwan Cho, Heansung Lee, Minhyeok Lee, Chaewon Park, Sungjun Jang, Minjung Kim, Sangyoun Lee
European Conference on Computer Vision (ECCV 2022)
Paper / arXiv / Code

We propose three novel strategies to suppress the negative influence of background distractions in semi-supervised video object segmentation.

SPSN: Superpixel Prototype Sampling Network for RGB-D Salient Object Detection
Minhyeok Lee*, Chaewon Park*, Suhwan Cho, Sangyoun Lee
European Conference on Computer Vision (ECCV 2022)
Paper / arXiv / Code

We propose a prototype sampling network that only samples prototypes corresponding to obtain robustness to inconsistencies between RGB images and depth maps.

Unsupervised Video Anomaly Detection via Normalizing Flows with Implicit Latent Features
MyeongAh Cho, Taeoh Kim, Woo Jin Kim, Suhwan Cho, Sangyoun Lee
Pattern Recognition (PR)
Paper / arXiv / Code

We propose an implicit two-path AE, a structure in which two encoders implicitly model appearance and motion features, along with a single decoder that combines them to learn normal video patterns.

Pixel-Level Bijective Matching for Video Object Segmentation
Suhwan Cho, Heansung Lee, Minjung Kim, Sungjun Jang, Sangyoun Lee
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2022)
Paper / arXiv / Code

We introduce a bijective matching mechanism to find the best matches from the query frame to the reference frame and also vice versa. A mask embedding module is also proposed to consider multiple historic masks simultaneously.