李乐, 张茂军, 熊志辉, 徐玮. 基于内容理解的单幅静态街景图像深度估计[J]. 机器人, 2011, 33(2): 174-180..
LI Le, ZHANG Maojun, XIONG Zhihui, XU Wei. Depth Estimation from a Single Still Image of Street Scene Based on Content Understanding. ROBOT, 2011, 33(2): 174-180..
Abstract:A method for depth estimation by understanding how the objects compose the whole scene in a single image of street scene is presented.Firstly,a single image of street scene is segmented into regions.The features of each region and the associated features of its neighbor area are extracted.And the regions are classified as types of object with features of each region by machine learning method,which shows how the image is made up of every object.Then,the depth of ground is estimated by the relationship between coordinate in image and depth in the real world of the same object which is deduced from pin-hole imaging model.And the depth of others in image is estimated by not only the relative position between the objects and ground but also the change of some features in objects.The depth map of image is produced at last. The experiment shows that our algorithm performs better than others and the result of depth estimation reflects the location of each object in the real world exactly.
[1] Forsyth D A,Ponce J.Compmer vision:A modem approach[M].Englewood Cliffs,NJ,USA:Prentice Hall,2003.
[2] Hertzmann A,Seitz S M.Example-based photometric stereo:Shape reconstruction with general,varying BRDFs[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,27(8):1254-1264.
[3] 张大志,王勇涛,田金文,等.基于单目视觉系统的远距离场景重建算法研究[J].宇航学报,2008,29(1):289-294.Zhang D Z.Wang Y T,Tian J W,et al.Effcient 3D recoustruction using monocular vision[J].Journal of Astronautics,2008,29(1):289-294.
[4] Ens J,Lawrence P.An investigation of methods for determining depth from focus[J].IEEE Transactions on Pattern Analyms and Machine Intelligence,1993,15(2):97-108.
[5] 曾祥尽,黄心汉,吴倩,等.马尔可夫随机场在显微图像散焦深度信息估计中的应用[J].机器人,2008,30(5):416-420.Zeng X J.Huang X H,Wu Q,et al.Application of MRF to depth information estimation of micro image defocus[J].Robot,2008,30(5):416-420.
[6] Saxena A.Chung S H,Ng A Y.3-D depth reconstruction from a single still image[J].Compmer Vision,2008,76(1):53-69.
[7] Hoiem D,Efros A A,Hebert M.Automatic photo popup[C]//32nd International Conference on Computer Graphics and Interactive Techniques.2005:577-584.
[8] Hoiem D,Efros A A,Hebert M.Closing the loop on scene interpretation[C]//IEEE Conference on Computer Vision and Pattern Recognition.Piscataway,NJ.USA:IEEE.2008:1-8.
[9] Hoiem D,Efros A A,Hebert M.Putting objects in perspective[C]//IEEE Conference on Computer Vision and Pattern Recognition.Piscataway,NJ,USA:IEEE,2006:2137-2144.
[10] Felzenszwalb P,Hutteniocher D.Efficient graph-based image segmentation[J].International Journal of Computer Vision,2004,59(2):167-181.
[11] Shotton J,Winn J,Rother C,et al.TextonBoost:Joint appearance,shape and context modeling for multi-class object recognition and segmentation[C]//Europeon Conference on Computer Vision.Piscataway,NJ,USA:IEEE,2006:1-15.
[12] Chang C C,Lin C J.LIBSVM:A library for support Vector machines[EB/OL].(2010-04-01)[2010-05-10].http://www.csie.ntu.tw/~cjlin/libsvm.
[13] Kass M.Wit K A,Terzopoulos D.Snakes:Active contour moodels[J].International Journal of Computer Vision,1988,1[4):321-331.
[14] Havasi L,Sziffmyi T.Extraction of horizontal vanishing line using shapes and statistical error propagation[C]//Photogrammetric Computer Vision Symposium of Internadonal Society for Photogrammetry and Remote Sensing Commission Ⅲ.2006:167-173.
[15] Zafarifar B,Weda H,de With P H N.Horizon detection based on sky-color and edge features[C]//Proceedings of the SPIE,vol.6822.Bellingham,WA,USA:SHE,2008:682220-682220-9.
[16] Duda R,Hart P.Use of the Hough Transformation to Detect Lines and Curves in Pictures[J].Communications of the ACM,1972,15(1):11-15.