Skip to main content

Advertisement

Springer Nature Link
Log in
Menu
Find a journal Publish with us Track your research
Search
Saved research
Cart
  1. Home
  2. Computer Vision – ECCV 2006
  3. Conference paper

Adapted Vocabularies for Generic Visual Categorization

  • Conference paper
  • pp 464–475
  • Cite this conference paper
Computer Vision – ECCV 2006 (ECCV 2006)
Adapted Vocabularies for Generic Visual Categorization
  • Florent Perronnin19,
  • Christopher Dance19,
  • Gabriela Csurka19 &
  • …
  • Marco Bressan19 

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3954))

Included in the following conference series:

  • European Conference on Computer Vision
  • 5326 Accesses

  • 164 Citations

  • 9 Altmetric

Abstract

Several state-of-the-art Generic Visual Categorization (GVC) systems are built around a vocabulary of visual terms and characterize images with one histogram of visual word counts. We propose a novel and practical approach to GVC based on a universal vocabulary, which describes the content of all the considered classes of images, and class vocabularies obtained through the adaptation of the universal vocabulary using class-specific data. An image is characterized by a set of histograms – one per class – where each histogram describes whether the image content is best modeled by the universal vocabulary or the corresponding class vocabulary. It is shown experimentally on three very different databases that this novel representation outperforms those approaches which characterize an image with a single histogram.

Download to read the full chapter text

Chapter PDF

Similar content being viewed by others

Enhancing high-vocabulary image annotation with a novel attention-based pooling

Article 24 September 2024

Webly Supervised Concept Expansion for General Purpose Vision Models

Chapter © 2022

Incremental Estimation of Visual Vocabulary Size for Image Retrieval

Chapter © 2017

Explore related subjects

Discover the latest articles, books and news in related subjects, suggested using machine learning.
  • Categorization
  • Computer Vision
  • Object Recognition
  • Object vision
  • Pattern vision
  • Visual Perception
  • Neural Network Approaches for Image Recognition

References

  1. Amir, A., Argillander, J., Berg, M., Chang, S.-F., Franz, M., Hsu, W., Iyengar, G., Kender, J., Kennedy, L., Lin, C.-Y., Naphade, M., Natsev, A., Smith, J., Tesic, J., Wu, G., Yang, R., Zhang, D.: IBM research TRECVID-2004 video retrieval system. In: Proc. of TREC Video Retrieval Evaluation (2004)

    Google Scholar 

  2. Bilmes, J.: A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models. Technical report, Department of Electrical Engineering and Computer Science, UC Berkeley (1998)

    Google Scholar 

  3. Chen, Y., Wang, J.Z.: Image categorization by learning and reasoning with regions. Journal of Machine Mearning Research 5, 913–939 (2004)

    MathSciNet  Google Scholar 

  4. Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Proc. of ECCV Workshop on Statistical Learning for Computer Vision (2004)

    Google Scholar 

  5. Dempster, A., Laird, N., Rubin, D.: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society 39(1), 1–38 (1977)

    MathSciNet  MATH  Google Scholar 

  6. Deselaers, T., Keysers, D., Ney, H.: Classification error rate for quantitative evaluation of content-based image retrieval systems. In: Proc. of ICPR (2004)

    Google Scholar 

  7. Farquhar, J., Szedmak, S., Meng, H., Shawe-Taylor, J.: Improving “bag-of-keypoints” image categorisation. Technical report, University of Southampton (2005)

    Google Scholar 

  8. Gauvain, J.-L., Lee, C.-H.: Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Trans. on Speech and Audio Processing 2(2), 291–298 (1994)

    Article  Google Scholar 

  9. Hsu, W.H., Chang, S.-F.: Visual cue cluster construction via information bottleneck principle and kernel density estimation. In: Proc. of CIVR (2005)

    Google Scholar 

  10. Leung, T., Malik, J.: Recognizing surfaces using three-dimensional textons. In: Proc. of ICCV (1999)

    Google Scholar 

  11. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. Journal of Computer Vision 60(2), 91–110 (2004)

    Article  Google Scholar 

  12. Reynolds, D., Quatieri, T., Dunn, R.: Speaker verification using adapted Gaussian mixture models. Digital Signal Processing 10, 19–41 (2000)

    Article  Google Scholar 

  13. Salton, G., McGill, M.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)

    MATH  Google Scholar 

  14. Sivic, J.S., Zisserman, A.: Video google: A text retrieval approach to object matching in videos. In: Proc. of ICCV, vol. 2, pp. 1470–1477 (2003)

    Google Scholar 

  15. Varma, M., Zisserman, A.: A statistical approach to texture classification from single images. Int. Journal of Computer Vision 62(1–2), 61–81 (2005)

    Article  Google Scholar 

  16. Winn, K., Criminisi, A., Minka, T.: Object categorization by learned visual dictionary. In: Proc. of ICCV (2005)

    Google Scholar 

  17. Zhang, J., Marszalek, M., Lazebnik, S., Schmid, C.: Local features and kernels for classification of texture and object categories: an in-depth study. INRIA, Research report 5737 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

  1. Xerox Research Centre Europe, 6, chemin de Maupertuis, 38240, Meylan, France

    Florent Perronnin, Christopher Dance, Gabriela Csurka & Marco Bressan

Authors
  1. Florent Perronnin
    View author publications

    Search author on:PubMed Google Scholar

  2. Christopher Dance
    View author publications

    Search author on:PubMed Google Scholar

  3. Gabriela Csurka
    View author publications

    Search author on:PubMed Google Scholar

  4. Marco Bressan
    View author publications

    Search author on:PubMed Google Scholar

Editor information

Editors and Affiliations

  1. University of Ljubljana, Slovenia

    Aleš Leonardis

  2. Institute for Computer Graphics and Vision, TU Graz, Inffeldgasse 16, 8010, Graz, Austria

    Horst Bischof

  3. Vision-based Measurement Group, Inst. of El. Measurement and Meas. Sign. Proc. Graz, University of Technology, Austria

    Axel Pinz

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Perronnin, F., Dance, C., Csurka, G., Bressan, M. (2006). Adapted Vocabularies for Generic Visual Categorization. In: Leonardis, A., Bischof, H., Pinz, A. (eds) Computer Vision – ECCV 2006. ECCV 2006. Lecture Notes in Computer Science, vol 3954. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11744085_36

Download citation

  • .RIS
  • .ENW
  • .BIB
  • DOI: https://doi.org/10.1007/11744085_36

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-33838-3

  • Online ISBN: 978-3-540-33839-0

  • eBook Packages: Computer ScienceComputer Science (R0)Springer Nature Proceedings Computer Science

Share this paper

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Keywords

  • Feature Vector
  • Gaussian Mixture Model
  • Visual Word
  • Speaker Recognition
  • Visual Vocabulary

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Publish with us

Policies and ethics

Search

Navigation

  • Find a journal
  • Publish with us
  • Track your research

Footer Navigation

Discover content

  • Journals A-Z
  • Books A-Z

Publish with us

  • Journal finder
  • Publish your research
  • Language editing
  • Open access publishing

Products and services

  • Our products
  • Librarians
  • Societies
  • Partners and advertisers

Our brands

  • Springer
  • Nature Portfolio
  • BMC
  • Palgrave Macmillan
  • Apress
  • Discover

Corporate Navigation

  • Your US state privacy rights
  • Accessibility statement
  • Terms and conditions
  • Privacy policy
  • Help and support
  • Legal notice
  • Cancel contracts here

162.0.217.198

Not affiliated

Springer Nature

© 2026 Springer Nature