TESTING BETA
Virtual Language Observatory
  • Search
  • Contributors
  • Help
  • CLARIN
  • VLO
  • Faceted search

The following URL can be used to share this page. Click the icon to copy the address to the clipboard.

Right click or drag the bookmark icon to your bookmarks area to create a bookmark!

CLARIN Virtual Language Observatory

Welcome to the VLO!

Use the search bar below to start searching through hundreds of thousands of language resources, or continue to browse everything and use facets to narrow down to your area of interest or discover new resources.

See all records  Learn more  Take a quick tour
Showing all records (560,651 results)
Search results include 929,896 record(s) , of which 369,245 hidden because of duplicate naming
Show facets and search options

Facets

Use the categories below to limit the search results to those matching the selected value(s).

Language

Collection

Resource type

Modality

File type

Keyword

Genre

Subject

Country

Organisation

Metadata provider

National project

Show more facets

Search options

Temporal Coverage

    -    

Availability

These levels provide an indication of the degree to which resources and tools are publicly accessible. Please check the specific conditions on any resource or tool that you end up using.

Search options

Search results

  • <<
  • <
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • >
  • >>

SmartKom Audio

(Part of Bavarian Archive for Speech Signals (BAS))
  • 447
  • 1

This corpus contains the audio recordings of all actors who use the SmartKom system; it covers the audio recordings (no …

This corpus contains the audio recordings of all actors who use the SmartKom system; it covers the audio recordings (no video) and annotations of all three original SmartKom corpora Public, Mobile and Home. Naive users were asked to test a 'prototype' for a market study not knowing that the system was in fact controlle…

German English
Landing page for this record
VCR

aGender

(Part of Bavarian Archive for Speech Signals (BAS))
  • 3614
  • 1

The speech corpus aGender contains speech sample recordings over public telephone lines with read and (semi-)spontaneous…

The speech corpus aGender contains speech sample recordings over public telephone lines with read and (semi-)spontaneous speech. Native German speakers called a voice portal from their private phone, and read text + answered some open questions. The purpose of the corpus is the automatic detection of gender and/or age …

English German
Landing page for this record
VCR

BAS HEMPEL

(Part of Bavarian Archive for Speech Signals (BAS))
  • 3903
  • 1
  • 1

Hempels Sofa is a collection of more than 3900 spontaneous speech items recorded as extra material during the German Spe…

Hempels Sofa is a collection of more than 3900 spontaneous speech items recorded as extra material during the German SpeechDat-II project. Speakers were asked to report what they had been doing during the last hour: "Was haben Sie in der letzten Stunde gemacht?". This item was recorded as the last item of the recording…

English German
Landing page for this record
VCR

Nautilus Speaker Characterization

(Part of Bavarian Archive for Speech Signals (BAS))
  • 300
  • 1

NSC contains scripted, semi-spontaneous, and spontaneous human-human dialogs. In total, 300 speakers of German without n…

NSC contains scripted, semi-spontaneous, and spontaneous human-human dialogs. In total, 300 speakers of German without noticeable accent participated and were recorded in an acoustically-isolated room. Interactions between speakers and their interlocutor are provided in separate mono files, accompanied by timestamps an…

English German
Landing page for this record
VCR

BAS RVG1_CLARIN

(Part of Bavarian Archive for Speech Signals (BAS))
  • 500
  • 1
  • 1

The corpus is a collection of more than 500 speakers of different dialect regions of Germany. The recordings were made u…

The corpus is a collection of more than 500 speakers of different dialect regions of Germany. The recordings were made using four different microphones (two in low and two in high quality) and consist of single digits, connected digits, phone numbers, phonetically balanced sentences, computer command phrases prompted o…

English German
Landing page for this record
VCR

Dissertation Data Dr. Veronika Neumeyer: Sibilant Production in Cochlear Implant Patients

(Part of Bavarian Archive for Speech Signals (BAS))
  • 96
  • 1
  • 1

The CI_2 corpora contain synchronous speech recordings of 48 cochlear implant users (CI) and 48 speakers without hearing…

The CI_2 corpora contain synchronous speech recordings of 48 cochlear implant users (CI) and 48 speakers without hearing impairment (control group, KG). The data were analyzed in Veronika Neumeyer's dissertation "Akustische Analysen der Sprachproduktion von CI-Trägern" (2015). CI_2_Sibilants contains recordings used fo…

English German
Landing page for this record
VCR

BAS VERIF1DE

(Part of Bavarian Archive for Speech Signals (BAS))
  • 3000
  • 1

The VERIF1DE database is a subset of the VERIDAT speaker verification database collected by T-Nova. VERIDAT contains add…

The VERIF1DE database is a subset of the VERIDAT speaker verification database collected by T-Nova. VERIDAT contains additional items and re-recordings of missing, corrupted, or otherwise unusable files in VERIF1DE. Please refer to the file DESIGN.PDF in the documentation package of this corpus for a detailed descripti…

English German
Landing page for this record
VCR

BAS SC1

(Part of Bavarian Archive for Speech Signals (BAS))
  • 88
  • 1
  • 1

The corpus contains speech of 88 different speakers, reading the German story 'Der Nordwind und die Sonne'. Subcorpus T …

The corpus contains speech of 88 different speakers, reading the German story 'Der Nordwind und die Sonne'. Subcorpus T contains the recordings of 16 native Germans (L1). The other 72 speakers which were born and educated in other countries (L2) are pooled in subcorpus C. Every speaker has a distinct accent. This corpu…

English German
Landing page for this record
VCR

BAS SC10

(Part of Bavarian Archive for Speech Signals (BAS))
  • 70
  • 1
  • 1

The SC10 corpus contains read and non-prompted German and mother tongue speech of 70 different speakers from 17 mother t…

The SC10 corpus contains read and non-prompted German and mother tongue speech of 70 different speakers from 17 mother tongues (L1) in a variety of speaking styles e.g. reading, retelling, free talk etc. Starting from version 1.5 (BAS CLARIN repository version 3), the corpus is distributed as an emuDB. BAS CLARIN repos…

English German
Landing page for this record
VCR

BAS ZIPTEL

(Part of Bavarian Archive for Speech Signals (BAS))
  • 1957
  • 1
  • 1

The ZipTel telephone speech database contains recordings of people applying for a SpeechDat prompt sheet via telephone. …

The ZipTel telephone speech database contains recordings of people applying for a SpeechDat prompt sheet via telephone. For the SpeechDat data collection, calls for participation were published in "phone", the customer magazine of the mobile telephone provider "e-plus", and in numerous newspapers all over Germany. In t…

English German
Landing page for this record
VCR
  • <<
  • <
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • >
  • >>

Service provided by CLARIN

About
v4.13.0
Deployed to clarineric-vps16
Contact
About  Contact