ABCDEFGHIJLMNOPQRSTUVWXYZAAAB
1
This document is partially derived from compilations by Justin Salamon (https://bioacousticsdatasets.weebly.com/) and Dan Morris (https://lila.science/otherdatasets)

For a complete list of machine learning-ready bioacoustics datasets, please refer to the excellent compilation by Céline Angonin (https://bioacoustic-ai.github.io/bioacoustics-datasets/)

Questions/additions? Please contact Tessa Rhinehart (tessa.rhinehart@pitt.edu)
2
Dataset Name (link)PublicationSourceClasses (e.g. bird species, non-bird species, "bird or not", sound type)Number of bird speciesRecording typeWeak or strong labeledAudio filesLabelsDuration (hours)SitesLocationCommentsSummarized annotation file location
3
BirdVox-full-nightLostanlen 2018NYU/Cornell2[detections only]ContinuousStrong635,402626Ithaca, NY, USADerivative datasets: BirdVox-70k, BirdVox-DCASE-20k; also included at least in part in BirdVox-ANAFCC and BirdVox-14SD
4
CLO-43SDSalamon 2016NYU/Cornell4343ClipsStrong5,4285,428Ithaca, NY and New York, NYIncluded at least in part in BirdVox-ANAFCC and BirdVox-14SD
5
CLO-SWTHSalamon 2016NYU/Cornell21ClipsStrong179,111179,111Ithaca, NY and New York, NYIncluded at least in part in BirdVox-ANAFCC and BirdVox-14SD
6
CLO-WTSPSalamon 2016NYU/Cornell21ClipsStrong16,70316,703Ithaca, NY and New York, NYIncluded at least in part in BirdVox-ANAFCC and BirdVox-14SD
7
ff1010birdStowell 2018QMUL2[detections only]ClipsStrong7,6907,69021.4
8
NIPS4BplusMorfi 2019QMUL8761ClipsStrong6745,878139Central France, Southern France, Spain
9
PicidaeDatasetVidaña-Vila 2017Universitat Ramon Llull137ClipsStrong1,6691,6691.4
Source is Xeno-Canto
10
warblrb10kStowell 2018QMUL2[detections only]ClipsStrong8,0008,00022
11
PowdermillChronister 2022University of Pittsburgh48ContinuousStrong416,0526.44Rector, PA, USA
None; see /media/emu/datasets/annotated/pnre_ecy3329/annotation_Files for individual files
12
IthacaKahl 2022Cornell81ContinuousStrong50,76028530Ithaca, NY, USAUsed in BirdCLEF
/media/emu/datasets/annotated/cornell_ithaca/annotations.csv
"Species eBird Code"
13
Coffee Farms
Vega-Hidalgo 2023
Cornell, Universidad de Costa Rica, Universidad de Antioquia89ContinuousStrong346,95234Jardín, Colombia and San Ramon, Costa RicaUsed in BirdCLEF
/media/emu/datasets/annotated/cornell_coffeefarm/annotations.csv
14
Sierra NevadaClapp 2023Cornell, IBP, NPS21ContinuousStrong10010,29616.710Sequoia & Kings Canyon NP, CA, USAUsed in BirdCLEF/Kaggle
/media/emu/datasets/annotated/cornell_sierranevada/annotations.csv
15
Southwestern Amazon BasinHopping 2022Cornell132ContinuousStrong2114,798217Inkaterra Reserva Amazonica, Madre de Dios, Peru
/media/emu/datasets/annotated/cornell_amazon/annotations.csv
16
Western United StatesKahl 2022Cornell, San Jose State Research Foundation, University of Wisconsin-Madison56ContinuousStrong3320,14733Lassen and Plumas National Forests, CA, USAUsed in BirdCLEF 2021
/media/emu/datasets/annotated/cornell_western/annotations.csv
17
HawaiiNavine 2022University of Hawai'i at Hilo, Cornell27ContinuousStrong59,58351Hawai'i, USAUsed in BirdCLEF 2022
/media/emu/datasets/annotated/cornell_hawaii/annotations.csv
18
PNWWeldy 2023Oregon State University, Google DeepMind, Conservation Metrics11858ContinuousStrong14139,71711.75525California, Oregon, & WashingtonIncludes township and range identifications, 38 environmental covariates, 215 partially annotated files, and ~1200 unlabeled recordings
19
ArcticBirdSoundsChristin 2023University of Moncton, ECCC, USFWS, Aaerhug University, McGill University49ContinuousStrong12,9332015Arctic from Alaska to Greenland
20
WildTraxN/AUniversity of Alberta, many collaboratorsContinuous, ClipsStrong750,000Worldwide, focus on CanadaHas 750,000 clips with first label
21
Xeno-cantoN/AXeno-canto Foundation, Naturalis Biodiversity CenterClipsWeakN/AWorldwide
22
Macaulay LibraryN/ACornellClipsWeakN/AWorldwide
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100