Adapted Vocabularies for Generic Visual Categorization

Perronnin, Florent; Dance, Christopher; Csurka, Gabriela; Bressan, Marco

doi:10.1007/11744085_36

Florent Perronnin¹⁹,
Christopher Dance¹⁹,
Gabriela Csurka¹⁹ &
…
Marco Bressan¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3954))

Included in the following conference series:

European Conference on Computer Vision

5326 Accesses
164 Citations
9 Altmetric

Abstract

Several state-of-the-art Generic Visual Categorization (GVC) systems are built around a vocabulary of visual terms and characterize images with one histogram of visual word counts. We propose a novel and practical approach to GVC based on a universal vocabulary, which describes the content of all the considered classes of images, and class vocabularies obtained through the adaptation of the universal vocabulary using class-specific data. An image is characterized by a set of histograms – one per class – where each histogram describes whether the image content is best modeled by the universal vocabulary or the corresponding class vocabulary. It is shown experimentally on three very different databases that this novel representation outperforms those approaches which characterize an image with a single histogram.

Download to read the full chapter text

Chapter PDF

Enhancing high-vocabulary image annotation with a novel attention-based pooling

Article 24 September 2024

Webly Supervised Concept Expansion for General Purpose Vision Models

Incremental Estimation of Visual Vocabulary Size for Image Retrieval

References

Amir, A., Argillander, J., Berg, M., Chang, S.-F., Franz, M., Hsu, W., Iyengar, G., Kender, J., Kennedy, L., Lin, C.-Y., Naphade, M., Natsev, A., Smith, J., Tesic, J., Wu, G., Yang, R., Zhang, D.: IBM research TRECVID-2004 video retrieval system. In: Proc. of TREC Video Retrieval Evaluation (2004)
Google Scholar
Bilmes, J.: A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models. Technical report, Department of Electrical Engineering and Computer Science, UC Berkeley (1998)
Google Scholar
Chen, Y., Wang, J.Z.: Image categorization by learning and reasoning with regions. Journal of Machine Mearning Research 5, 913–939 (2004)
MathSciNet Google Scholar
Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Proc. of ECCV Workshop on Statistical Learning for Computer Vision (2004)
Google Scholar
Dempster, A., Laird, N., Rubin, D.: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society 39(1), 1–38 (1977)
MathSciNet MATH Google Scholar
Deselaers, T., Keysers, D., Ney, H.: Classification error rate for quantitative evaluation of content-based image retrieval systems. In: Proc. of ICPR (2004)
Google Scholar
Farquhar, J., Szedmak, S., Meng, H., Shawe-Taylor, J.: Improving “bag-of-keypoints” image categorisation. Technical report, University of Southampton (2005)
Google Scholar
Gauvain, J.-L., Lee, C.-H.: Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Trans. on Speech and Audio Processing 2(2), 291–298 (1994)
Article Google Scholar
Hsu, W.H., Chang, S.-F.: Visual cue cluster construction via information bottleneck principle and kernel density estimation. In: Proc. of CIVR (2005)
Google Scholar
Leung, T., Malik, J.: Recognizing surfaces using three-dimensional textons. In: Proc. of ICCV (1999)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. Journal of Computer Vision 60(2), 91–110 (2004)
Article Google Scholar
Reynolds, D., Quatieri, T., Dunn, R.: Speaker verification using adapted Gaussian mixture models. Digital Signal Processing 10, 19–41 (2000)
Article Google Scholar
Salton, G., McGill, M.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)
MATH Google Scholar
Sivic, J.S., Zisserman, A.: Video google: A text retrieval approach to object matching in videos. In: Proc. of ICCV, vol. 2, pp. 1470–1477 (2003)
Google Scholar
Varma, M., Zisserman, A.: A statistical approach to texture classification from single images. Int. Journal of Computer Vision 62(1–2), 61–81 (2005)
Article Google Scholar
Winn, K., Criminisi, A., Minka, T.: Object categorization by learned visual dictionary. In: Proc. of ICCV (2005)
Google Scholar
Zhang, J., Marszalek, M., Lazebnik, S., Schmid, C.: Local features and kernels for classification of texture and object categories: an in-depth study. INRIA, Research report 5737 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Xerox Research Centre Europe, 6, chemin de Maupertuis, 38240, Meylan, France
Florent Perronnin, Christopher Dance, Gabriela Csurka & Marco Bressan

Authors

Florent Perronnin
View author publications
Search author on:PubMed Google Scholar
Christopher Dance
View author publications
Search author on:PubMed Google Scholar
Gabriela Csurka
View author publications
Search author on:PubMed Google Scholar
Marco Bressan
View author publications
Search author on:PubMed Google Scholar

Editor information

Editors and Affiliations

University of Ljubljana, Slovenia
Aleš Leonardis
Institute for Computer Graphics and Vision, TU Graz, Inffeldgasse 16, 8010, Graz, Austria
Horst Bischof
Vision-based Measurement Group, Inst. of El. Measurement and Meas. Sign. Proc. Graz, University of Technology, Austria
Axel Pinz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Perronnin, F., Dance, C., Csurka, G., Bressan, M. (2006). Adapted Vocabularies for Generic Visual Categorization. In: Leonardis, A., Bischof, H., Pinz, A. (eds) Computer Vision – ECCV 2006. ECCV 2006. Lecture Notes in Computer Science, vol 3954. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11744085_36

Download citation

DOI: https://doi.org/10.1007/11744085_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33838-3
Online ISBN: 978-3-540-33839-0
eBook Packages: Computer ScienceComputer Science (R0)Springer Nature Proceedings Computer Science

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Publish with us

Policies and ethics