{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T17:26:24Z","timestamp":1777656384490,"version":"3.51.4"},"reference-count":81,"publisher":"Association for Computing Machinery (ACM)","issue":"5","license":[{"start":{"date-parts":[[2015,11,3]],"date-time":"2015-11-03T00:00:00Z","timestamp":1446508800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2015,11,3]]},"abstract":"<jats:p>\n            We present a new interactive and online approach to 3D scene understanding. Our system,\n            <jats:italic>SemanticPaint<\/jats:italic>\n            , allows users to simultaneously scan their environment whilst interactively segmenting the scene simply by reaching out and touching any desired object or surface. Our system continuously learns from these segmentations, and labels new unseen parts of the environment. Unlike offline systems where capture, labeling, and batch learning often take hours or even days to perform, our approach is fully online. This provides users with continuous live feedback of the recognition during capture, allowing to immediately correct errors in the segmentation and\/or learning\u2014a feature that has so far been unavailable to batch and offline methods. This leads to models that are tailored or\n            <jats:italic>personalized<\/jats:italic>\n            specifically to the user's environments and object classes of interest, opening up the potential for new applications in augmented reality, interior design, and human\/robot navigation. It also provides the ability to capture substantial labeled 3D datasets for training large-scale visual recognition systems.\n          <\/jats:p>","DOI":"10.1145\/2751556","type":"journal-article","created":{"date-parts":[[2015,11,5]],"date-time":"2015-11-05T16:19:01Z","timestamp":1446740341000},"page":"1-17","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":83,"title":["SemanticPaint"],"prefix":"10.1145","volume":"34","author":[{"given":"Julien","family":"Valentin","sequence":"first","affiliation":[{"name":"University of Oxford, Oxford OX, UK"}]},{"given":"Vibhav","family":"Vineet","sequence":"additional","affiliation":[{"name":"University of Oxford, Oxford OX, UK"}]},{"given":"Ming-Ming","family":"Cheng","sequence":"additional","affiliation":[{"name":"University of Oxford and Nankai University, Nankai, Tianjin, China"}]},{"given":"David","family":"Kim","sequence":"additional","affiliation":[{"name":"Microsoft Research Cambridge, Cambridge, UK"}]},{"given":"Jamie","family":"Shotton","sequence":"additional","affiliation":[{"name":"Microsoft Research Cambridge, Cambridge, UK"}]},{"given":"Pushmeet","family":"Kohli","sequence":"additional","affiliation":[{"name":"Microsoft Research Cambridge, Cambridge, UK"}]},{"given":"Matthias","family":"Nie\u00dfner","sequence":"additional","affiliation":[{"name":"Stanford University, Stanford, CA"}]},{"given":"Antonio","family":"Criminisi","sequence":"additional","affiliation":[{"name":"Microsoft Research Cambridge, Cambridge, UK"}]},{"given":"Shahram","family":"Izadi","sequence":"additional","affiliation":[{"name":"Microsoft Research Cambridge, Oxford OX, UK"}]},{"given":"Philip","family":"Torr","sequence":"additional","affiliation":[{"name":"University of Oxford, Cambridge, UK"}]}],"member":"320","published-online":{"date-parts":[[2015,11,3]]},"reference":[{"key":"e_1_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.5555\/2601307.2601318"},{"key":"e_1_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1177\/0278364912461538"},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2007.09.014"},{"key":"e_1_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1557019.1557041"},{"key":"e_1_2_2_5_1","doi-asserted-by":"crossref","unstructured":"U. Bonde V. Badrinarayanan and R. Cipolla. 2013. Multi scale shape index for 3D object recognition. In Scale Space and Variational Methods in Computer Vision. Springer 306--318.  U. Bonde V. Badrinarayanan and R. Cipolla. 2013. Multi scale shape index for 3D object recognition. In Scale Space and Variational Methods in Computer Vision. Springer 306--318.","DOI":"10.1007\/978-3-642-38267-3_26"},{"key":"e_1_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.969114"},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010933404324"},{"key":"e_1_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-88682-2_5"},{"key":"e_1_2_2_9_1","volume-title":"Proceedings of the IEEE International Conference on Robotics and Automation (ICRA'07)","author":"Castle R. O.","unstructured":"R. O. Castle , D. Gawley , G. Klein , and D. W. Murray . 2007. Towards simultaneous recognition, localization and mapping for hand-held and wearable cameras . In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA'07) . R. O. Castle, D. Gawley, G. Klein, and D. W. Murray. 2007. Towards simultaneous recognition, localization and mapping for hand-held and wearable cameras. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA'07)."},{"key":"e_1_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/2461912.2461940"},{"key":"e_1_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1531326.1531379"},{"key":"e_1_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2682628"},{"key":"e_1_2_2_13_1","unstructured":"C. Couprie C. Farabet L. Najman and Y. Lecun. 2013. Indoor semantic segmentation using depth information. http:\/\/arxiv.org\/abs\/1301.3572.  C. Couprie C. Farabet L. Najman and Y. Lecun. 2013. Indoor semantic segmentation using depth information. http:\/\/arxiv.org\/abs\/1301.3572."},{"key":"e_1_2_2_14_1","doi-asserted-by":"crossref","unstructured":"A. Criminisi and J. Shotton. 2013. Decision Forests for Computer Vision and Medical Image Analysis. Springer.   A. Criminisi and J. Shotton. 2013. Decision Forests for Computer Vision and Medical Image Analysis. Springer.","DOI":"10.1007\/978-1-4471-4929-3"},{"key":"e_1_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/237170.237269"},{"key":"e_1_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2005.177"},{"key":"e_1_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/347090.347107"},{"key":"e_1_2_2_18_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'10)","author":"Drost B.","unstructured":"B. Drost , M. Ulrich , N. Navar , and S. Ilic . 2010. Model globally, match locally: Efficient and robust 3D object recognition . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'10) . B. Drost, M. Ulrich, N. Navar, and S. Ilic. 2010. Model globally, match locally: Efficient and robust 3D object recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'10)."},{"key":"e_1_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.202"},{"key":"e_1_2_2_20_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'12)","author":"Geiger A.","unstructured":"A. Geiger , P. Lenz , and R. Urtasun . 2012. Are we ready for autonomous driving? The KITTI vision benchmark suite . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'12) . A. Geiger, P. Lenz, and R. Urtasun. 2012. Are we ready for autonomous driving? The KITTI vision benchmark suite. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'12)."},{"key":"e_1_2_2_21_1","volume-title":"Proceedings of the European Conference on Computer Vision (ECCV'10)","author":"Gupta A.","unstructured":"A. Gupta , A. A. Efros , and M. Hebert . 2010. Blocks world revisited: Image understanding using qualitative geometry and mechanics . In Proceedings of the European Conference on Computer Vision (ECCV'10) . A. Gupta, A. A. Efros, and M. Hebert. 2010. Blocks world revisited: Image understanding using qualitative geometry and mechanics. In Proceedings of the European Conference on Computer Vision (ECCV'10)."},{"key":"e_1_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.20"},{"key":"e_1_2_2_23_1","volume-title":"Proceedings of the IEEE International Conference on Robotics and Automation (ICRA'14)","author":"Herbst E.","unstructured":"E. Herbst , P. Henry , and D. Fox . 2014. Toward online 3-D object segmentation and mapping . In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA'14) . E. Herbst, P. Henry, and D. Fox. 2014. Toward online 3-D object segmentation and mapping. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA'14)."},{"key":"e_1_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2007.1166"},{"key":"e_1_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/3DIMPVT.2012.12"},{"key":"e_1_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/2047196.2047270"},{"key":"e_1_2_2_27_1","unstructured":"A. Johnson. 1997. Spin-images: A representation for 3-D surface matching. Ph.D. thesis Robotics Institute Carnegie Mellon University.  A. Johnson. 1997. Spin-images: A representation for 3-D surface matching. Ph.D. thesis Robotics Institute Carnegie Mellon University."},{"key":"e_1_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2013.380"},{"key":"e_1_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/1778765.1778839"},{"key":"e_1_2_2_30_1","volume-title":"Proceedings of the IEEE International Conference on Robotics and Automation (ICRA'13)","author":"Karpathy A.","unstructured":"A. Karpathy , S. Miller , and L. Fei-Fei . 2013. Object discovery in 3D scenes via shape analysis . In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA'13) . A. Karpathy, S. Miller, and L. Fei-Fei. 2013. Object discovery in 3D scenes via shape analysis. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA'13)."},{"key":"e_1_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2013.180"},{"key":"e_1_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2461912.2461933"},{"key":"e_1_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/2366145.2366157"},{"key":"e_1_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-008-0202-0"},{"key":"e_1_2_2_35_1","unstructured":"D. Koller and N. Friedman. 2009. Probabilistic Graphical Models: Principles and Techniques. MIT Press.   D. Koller and N. Friedman. 2009. Probabilistic Graphical Models: Principles and Techniques. MIT Press."},{"key":"e_1_2_2_36_1","volume-title":"Proceedings of the Conference on Neural Information Processing Systems (NIPS'11)","author":"Koppula H. S.","unstructured":"H. S. Koppula , A. Anand , T. Joachims , and A. Saxena . 2011. Semantic labeling of 3D point clouds for indoor scenes . In Proceedings of the Conference on Neural Information Processing Systems (NIPS'11) . H. S. Koppula, A. Anand, T. Joachims, and A. Saxena. 2011. Semantic labeling of 3D point clouds for indoor scenes. In Proceedings of the Conference on Neural Information Processing Systems (NIPS'11)."},{"key":"e_1_2_2_37_1","volume-title":"Proceedings of the Conference on Neural Information Processing Systems (NIPS'11)","author":"Kr\u00e4henb\u00fchl P.","unstructured":"P. Kr\u00e4henb\u00fchl and V. Koltun . 2011. Efficient inference in fully connected CRFs with Gaussian edge potentials . In Proceedings of the Conference on Neural Information Processing Systems (NIPS'11) . P. Kr\u00e4henb\u00fchl and V. Koltun. 2011. Efficient inference in fully connected CRFs with Gaussian edge potentials. In Proceedings of the Conference on Neural Information Processing Systems (NIPS'11)."},{"key":"e_1_2_2_38_1","volume-title":"Proceedings of the Conference on Neural Information Processing Systems (NIPS'12)","author":"Krizhevsky A.","unstructured":"A. Krizhevsky , I. Sutskever , and G. Hinton . 2012. ImageNet classification with deep convolutional neural networks . In Proceedings of the Conference on Neural Information Processing Systems (NIPS'12) . A. Krizhevsky, I. Sutskever, and G. Hinton. 2012. ImageNet classification with deep convolutional neural networks. In Proceedings of the Conference on Neural Information Processing Systems (NIPS'12)."},{"key":"e_1_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-011-0489-0"},{"key":"e_1_2_2_40_1","volume-title":"Proceedings of the 18th International Conference on Machine Learning (ICML'01)","author":"Lafferty J.","unstructured":"J. Lafferty , A. Mccallum , and F. C. Pereira . 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data . In Proceedings of the 18th International Conference on Machine Learning (ICML'01) . 282--289. J. Lafferty, A. Mccallum, and F. C. Pereira. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the 18th International Conference on Machine Learning (ICML'01). 282--289."},{"key":"e_1_2_2_41_1","volume-title":"Proceedings of the IEEE International Conference on Robotics and Automation (ICRA'11)","author":"Lai K.","unstructured":"K. Lai , L. Bo , X. Ren , and D. Fox . 2011. A large-scale hierarchical multi-view RGB-D object dataset . In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA'11) . K. Lai, L. Bo, X. Ren, and D. Fox. 2011. A large-scale hierarchical multi-view RGB-D object dataset. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA'11)."},{"key":"e_1_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2006.188"},{"key":"e_1_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/344779.344849"},{"key":"e_1_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2013.179"},{"key":"e_1_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/2461912.2461969"},{"key":"e_1_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.5555\/850924.851523"},{"key":"e_1_2_2_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/2010324.1964982"},{"key":"e_1_2_2_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/2366145.2366156"},{"key":"e_1_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISMAR.2011.6092378"},{"key":"e_1_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2011.6126513"},{"key":"e_1_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/2508363.2508374"},{"key":"e_1_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-007-0086-4"},{"key":"e_1_2_2_53_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10514-009-9110-6"},{"key":"e_1_2_2_54_1","volume-title":"Proceedings of the International Symposium on Mixed and Augmented Reality (ISMAR'13)","author":"Pradeep V.","unstructured":"V. Pradeep , C. Rhemann , S. Izadi , C. Zach , M. Bleyer , and S. Bathiche . 2013. MonoFusion: Real-time 3D reconstruction of small scenes with a single Web camera . In Proceedings of the International Symposium on Mixed and Augmented Reality (ISMAR'13) . 83--88. V. Pradeep, C. Rhemann, S. Izadi, C. Zach, M. Bleyer, and S. Bathiche. 2013. MonoFusion: Real-time 3D reconstruction of small scenes with a single Web camera. In Proceedings of the International Symposium on Mixed and Augmented Reality (ISMAR'13). 83--88."},{"key":"e_1_2_2_55_1","doi-asserted-by":"crossref","unstructured":"F. Ramos J. Nieto and H. Durrant-Whyte. 2008. Combining object recognition and SLAM for extended map representations. In Experimental Robotics. Springer 55--64.  F. Ramos J. Nieto and H. Durrant-Whyte. 2008. Combining object recognition and SLAM for extended map representations. In Experimental Robotics. Springer 55--64.","DOI":"10.1007\/978-3-540-77457-0_6"},{"key":"e_1_2_2_56_1","volume-title":"Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR'12)","author":"Ren X.","unstructured":"X. Ren , L. Bo , and D. Fox . 2012. ROB-(D) scene labeling: Features and algorithms . In Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR'12) . X. Ren, L. Bo, and D. Fox. 2012. ROB-(D) scene labeling: Features and algorithms. In Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR'12)."},{"key":"e_1_2_2_58_1","doi-asserted-by":"publisher","DOI":"10.1145\/1015706.1015720"},{"key":"e_1_2_2_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/566654.566600"},{"key":"e_1_2_2_60_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-007-0090-8"},{"key":"e_1_2_2_61_1","volume-title":"Proceedings of the IEEE International Conference on Computer Vision Workshop (ICCVW'09)","author":"Saffari A.","unstructured":"A. Saffari , C. Leistner , J. Santner , M. Godec , and H. Bischop . 2009. On-line random forests . In Proceedings of the IEEE International Conference on Computer Vision Workshop (ICCVW'09) . A. Saffari, C. Leistner, J. Santner, M. Godec, and H. Bischop. 2009. On-line random forests. In Proceedings of the IEEE International Conference on Computer Vision Workshop (ICCVW'09)."},{"key":"e_1_2_2_62_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.178"},{"key":"e_1_2_2_63_1","volume-title":"Proceedings of the IEEE International Conference on Robotics and Automation (ICRA'13)","author":"Sengupta S.","unstructured":"S. Sengupta , E. Greveson , A. Shahrokni , and P. H. Torr . 2013. Urban 3D semantic modelling using stereo vision . In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA'13) . S. Sengupta, E. Greveson, A. Shahrokni, and P. H. Torr. 2013. Urban 3D semantic modelling using stereo vision. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA'13)."},{"key":"e_1_2_2_64_1","doi-asserted-by":"publisher","DOI":"10.1109\/3DV.2013.12"},{"key":"e_1_2_2_65_1","doi-asserted-by":"publisher","DOI":"10.1145\/2366145.2366155"},{"key":"e_1_2_2_66_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-009-0279-0"},{"key":"e_1_2_2_67_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-88693-8_44"},{"key":"e_1_2_2_68_1","doi-asserted-by":"publisher","DOI":"10.1145\/2366145.2366199"},{"key":"e_1_2_2_69_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995316"},{"key":"e_1_2_2_70_1","doi-asserted-by":"publisher","DOI":"10.1007\/11744023_1"},{"key":"e_1_2_2_71_1","volume-title":"Proceedings of the IEEE International Conference on Computer Vision Workshop (ICCVW'11)","author":"Silberman N.","unstructured":"N. Silberman and R. Fergus . 2011. Indoor scene segmentation using a structured light sensor . In Proceedings of the IEEE International Conference on Computer Vision Workshop (ICCVW'11) . N. Silberman and R. Fergus. 2011. Indoor scene segmentation using a structured light sensor. In Proceedings of the IEEE International Conference on Computer Vision Workshop (ICCVW'11)."},{"key":"e_1_2_2_72_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-33715-4_54"},{"key":"e_1_2_2_73_1","doi-asserted-by":"publisher","DOI":"10.1145\/1141911.1141964"},{"key":"e_1_2_2_74_1","doi-asserted-by":"crossref","unstructured":"J. St\u00fcckler B. Waldvogel H. Schultz and S. Behnke. 2013. Dense real-time mapping of object-class semantics from RGB-D video. http:\/\/www.ais.uni-bonn.de\/papers\/JRTIP_2014_Stueckler_RT_SemanticSLAM.pdf.  J. St\u00fcckler B. Waldvogel H. Schultz and S. Behnke. 2013. Dense real-time mapping of object-class semantics from RGB-D video. http:\/\/www.ais.uni-bonn.de\/papers\/JRTIP_2014_Stueckler_RT_SemanticSLAM.pdf.","DOI":"10.1007\/s11554-013-0379-5"},{"key":"e_1_2_2_75_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.269"},{"key":"e_1_2_2_76_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPR'08)","author":"Vineet V.","unstructured":"V. Vineet and P. Narayanan . 2008. CUDA cuts: Fast graph cuts on the GPU . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPR'08) . 1--8. V. Vineet and P. Narayanan. 2008. CUDA cuts: Fast graph cuts on the GPU. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPR'08). 1--8."},{"key":"e_1_2_2_77_1","doi-asserted-by":"publisher","DOI":"10.1145\/3147.3165"},{"key":"e_1_2_2_78_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10590-1_32"},{"key":"e_1_2_2_79_1","volume-title":"thesis","author":"Xiao J.","unstructured":"J. Xiao . 2014. A 2D+3D rich data approach to scene understanding. Ph . D. thesis , Massachusetts Institute of Technology . J. Xiao. 2014. A 2D+3D rich data approach to scene understanding. Ph.D. thesis, Massachusetts Institute of Technology."},{"key":"e_1_2_2_80_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'10)","author":"Xiao J.","unstructured":"J. Xiao , J. Hays , K. A. Ehinger , A. Oliva , and A. Torralba . 2010. SUN database: Large-scale scene recognition from abbey to zoo . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'10) . J. Xiao, J. Hays, K. A. Ehinger, A. Oliva, and A. Torralba. 2010. SUN database: Large-scale scene recognition from abbey to zoo. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'10)."},{"key":"e_1_2_2_81_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2013.458"},{"key":"e_1_2_2_82_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'12)","author":"Yao A.","unstructured":"A. Yao , J. Gall , C. Leistner , and L. Van Gool . 2012. Interactive object detection . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'12) . 3242--3249. A. Yao, J. Gall, C. Leistner, and L. Van Gool. 2012. Interactive object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'12). 3242--3249."}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2751556","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2751556","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T05:43:07Z","timestamp":1750225387000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2751556"}},"subtitle":["Interactive 3D Labeling and Learning at your Fingertips"],"short-title":[],"issued":{"date-parts":[[2015,11,3]]},"references-count":81,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2015,11,3]]}},"alternative-id":["10.1145\/2751556"],"URL":"https:\/\/doi.org\/10.1145\/2751556","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2015,11,3]]},"assertion":[{"value":"2014-08-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2015-03-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2015-11-03","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}