Overcoming Occlusion with Inverse Graphics

详细信息查看全文

关键词：Vision ; as ; inverse ; graphics ; Scene understanding ; Occlusion
刊名：Lecture Notes in Computer Science
出版年：2016
出版时间：2016
年：2016
卷：9915
期：1
页码：170-185
丛书名：Computer Vision ?ECCV 2016 Workshops
ISBN：978-3-319-49409-8
卷排序：9915

文摘

Scene understanding tasks such as the prediction of object pose, shape, appearance and illumination are hampered by the occlusions often found in images. We propose a vision-as-inverse-graphics approach to handle these occlusions by making use of a graphics renderer in combination with a robust generative model (GM). Since searching over scene factors to obtain the best match for an image is very inefficient, we make use of a recognition model (RM) trained on synthetic data to initialize the search. This paper addresses two issues: (i) We study how the inferences are affected by the degree of occlusion of the foreground object, and show that a robust GM which includes an outlier model to account for occlusions works significantly better than a non-robust model. (ii) We characterize the performance of the RM and the gains that can be made by refining the search using the GM, using a new dataset that includes background clutter and occlusions. We find that pose and shape are predicted very well by the RM, but appearance and especially illumination less so. However, accuracy on these latter two factors can be clearly improved with the generative model.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700