The Table below shows the state-of-the-art results of the depth prediction on the state-of-the-art algorithms.
(This is on Make3D Range Image Dataset-1.)

AlgorithmMain contributionRelative ErrorLog-10 errorQualitativeRMSE
Data baseline.Predict Average0.6980.3340%(coming soon)
Saxena, Sung, Ng, NIPS 2005. [1,2]Learn Depth: MRF0.5300.198NR16.7m
Saxena, Sun, Ng, 3dRR/PAMI, 2007. [3,4]Pointwise MRF0.4580.14923.0%NR
Saxena, Sun, Ng, 3dRR/PAMI 2007. [3,4]Superpixel MRF0.3700.18764.9%NR
Hoiem, Efros, Hebert, IJCV 2007 [5]Surface Layout1.4230.32033.1%NR
Heitz, Gould, Saxena, Koller, NIPS 2008. [6]Cascaded ModelsNRNRNR15.4m
Cherian, Morellas, Papanikolopoulos, ICRA 2009. Ground plane NR NR NR 22m
Liu, Gould, Koller, CVPR 2010. [7]Semantic Labels0.3750.148NRNR
Li, Kowdle, Saxena, Chen, NIPS 2010. [8,9,10]Feedback CascadesNRNRNR15.2m
Li, Saxena, Chen, NIPS 2011. [11]theta-MRFNRNRNR15.0m
Karsch, Liu, Kang, ECCV 2012. [12]depth transfer0.3610.148NR15.1m

NR - Not Reported.

References

  1. Learning Depth from Single Monocular Images,
    Ashutosh Saxena, Sung H. Chung, Andrew Y. Ng. In Neural Information Processing Systems (NIPS) 18, 2005. [pdf]

  2. 3-D Depth Reconstruction from a Single Still Image,
    Ashutosh Saxena, Sung H. Chung, Andrew Y. Ng. International Journal of Computer Vision (IJCV), Aug 2007. [pdf]

  3. Make3D: Learning 3-D Scene Structure from a Single Still Image,
    Ashutosh Saxena, Min Sun, Andrew Y. Ng, In IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2008. [pdf]

  4. Learning 3-D Scene Structure from a Single Still Image,
    Ashutosh Saxena, Min Sun, Andrew Y. Ng, In ICCV workshop on 3D Representation for Recognition (3dRR-07), 2007. (Best paper award.) [pdf, ppt, Talk]

  5. Recovering Surface Layout from an Image.
    D. Hoiem, A.A. Efros, and M. Hebert. IJCV 2007.

  6. Cascaded Classification Models: Combining Models for Holistic Scene Understanding.
    Geremy Heitz, Stephen Gould, Ashutosh Saxena, Daphne Koller. In Neural Information Processing Systems (NIPS), 2008. [pdf, more]

  7. Single Image Depth Estimation from Predicted Semantic Labels.
    Beyang Liu, Stephen Gould and Daphne Koller. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2010.

  8. A Generic Model to Compose Vision Modules for Holistic Scene Understanding.
    Adarsh Kowdle, Congcong Li, Ashutosh Saxena and Tsuhan Chen. In European Conference on Computer Vision Workshop on Parts and Attributes (ECCV '10), 2010.

  9. Towards Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models.
    Congcong Li, Adarsh Kowdle, Ashutosh Saxena, Tsuhan Chen. In Neural Information Processing Systems (NIPS), 2010.

  10. Towards Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models.
    Congcong Li, Adarsh Kowdle, Ashutosh Saxena, Tsuhan Chen. To appear in IEEE Trans Pattern Analysis and Machine Intelligence (PAMI), 2011.

  11. θ-MRF: Capturing Spatial and Semantic Structure in the Parameters for Scene Understanding, Congcong Li, Ashutosh Saxena, Tsuhan Chen. In Neural Information Processing Systems (NIPS), 2011.

  12. Depth Extraction from Video Using Non-parametric Sampling, Kevin Karsch, Ce Liu, and Sing Bing Kang. In European Conference on Computer Vision (ECCV), 2012.