학사

정기세미나

View

How Many Labels Do We Need to Understand Pixels?
담당자 홍승훈 교수(KAIST)	세미나 일자 2023.12.01 Fri	조회수 785

[ Abstract ]

Dense prediction is a fundamental class of computer vision problems where the goal is to predict the per-pixel labels of an input image. Since any problems relating pixels to labels can fall into this class, it broadly encapsulates the majority of vision tasks, including semantic segmentation, object detection, pose estimation, and depth estimation, to name a few. Despite the remarkable progress in the past, however, training a model for dense prediction still remains challenging due to the cost of collecting per-pixel labels. A more desirable approach is to build a few-shot learner for dense prediction, yet the current solutions are limited to specific tasks such as segmentation.

[ Biography ]

Seunghoon Hong is an assistant professor at the School of Computing, KAIST. Before joining KAIST, he had been a postdoctoral fellow at the University of Michigan and visiting research faculty at Google Brain team. His research interests lie in the intersection of machine learning and computer vision, with a specific focus on learning with least supervision and deep generative models. He received the B.S. and Ph.D. degree from the Department of Computer Science and Engineering at POSTECH, Pohang, Korea in 2011 and 2017, respectively.