Jan Winkler commited on
Commit
f0c12aa
1 Parent(s): 719aa88

update (#8)

Browse files
Files changed (2) hide show
  1. README.md +1 -1
  2. docs/data-sets.md +56 -54
README.md CHANGED
@@ -66,4 +66,4 @@ now you shoold see the running docker containers.
66
 
67
  ## awesome data overview
68
 
69
- ![](./docs/data_overview_yuri.jpg)
 
66
 
67
  ## awesome data overview
68
 
69
+ ![](./docs/data_overview_yuri.jpg)
docs/data-sets.md CHANGED
@@ -1,116 +1,118 @@
1
  # Audio datasets
2
 
3
-
4
  > We have automatic queries running that will keep updating and uploading more unlabelled data over the next weeks. So this dataset is still growing. If not stated differently, all data was sampled at a frequency of 48KHz. The document sampling_sites has more information on local observations.
5
 
6
  This audio data was collected by mentors [email protected], David Dao, [email protected] and @Marina Gatto
7
 
8
  The data was collected through two sensors:
 
9
  - Audiomoths
10
  - SongMeters
11
 
12
- ***likely_bird_songs_in_Ingles***
13
  This file contains a list of likely bird songs to occur in the area of Ingles (Northern Rio Negro) and was curated by our domain experts. It is not exclusive.
14
 
15
- ***Ingles***
16
- All audiomoth collections of soundscapes in Ingles.
17
  This contains the primary forest (labeled as primary) as well as in an Inn (with direct human pressure).
18
  The audiomoth in the primary forest was recording up to a maximum frequency of 192Khz (creating a much larger dataset) with the hope of capturing ultrasonic soundscapes (bats)
19
 
20
- ***SongMeters_Ingles_Primary***
21
  All song meter recordings of the soundscapes of the primary forest of Ingles.
22
 
23
- ***SongMeter_Clusters***
24
  SongMeter Clusters are unsupervised detected clusters (run by Mentor [email protected]). They contain frequently occurring patterns detected in the SongMeter recordings in the frequency range of 2-5KHz.
25
 
26
- ***Inhaa-Be***
27
  3K min. All audiomoth collections of soundscapes in a forest close to Inhaa-Be.
28
- Inhaa-Be is an Indigenous Village, with a protected forest.
29
 
30
- ***ParqueDasTribos***
31
- 2K min. All audiomoth collections of soundscapes of Parque Das Tribos.
32
  Parque das Tribos is an Indigenous Urban Village at the outskirts of Manaus. Most recordings are close to roads and human pressure.
33
 
34
- ***Xingu***
35
  14K min. All this data was collected in a previous trip in Para State (a different state than Manaus) in October to November 2023. It is deployed in a protected territory far away from any human pressure. None of the provided global layers overlap with this area but can be requested if needed.
36
 
37
- ***XenoCanto/Amazonas***
38
  All XenoCanto songs that are originating from “Amazonas” (can include Colombia etc). This folder is split between frogs and birds.
39
 
40
- ***XenoCanto/Greater Manaus***
41
  A smaller subset of XenoCanto songs, containing all observations of the Upper Rio Negro area (overlapping with Manaus, Inhaa-Be, Ingles and Parque Das Tribos)
42
 
43
-
44
-
45
  ## primary 1,2 (including ultrasonic)
46
- - location:
47
- - In the primary forest
48
- - files
49
- - WAV
50
- - each 44 MB
51
- - Sample rate (Hz) : 384000
52
- - Sleep duration (s) : 240
53
- - Recording duration (s) : 60
54
 
 
 
 
55
 
 
 
 
 
 
56
 
57
  - links:
58
- - primary 1: https://drive.google.com/open?id=1VCP9VDDtm6-u0lPsm6Pi_A0nBNvQjPv9&usp=drive_copy
59
 
60
- - primary 2: https://drive.google.com/open?id=1sIeJ_1GBHuU7GO5CU6WVXZycU7I5YA_B&usp=drive_copy
61
 
 
62
 
63
  ## SoundMeters_Ingles_Primary
64
- - location:
65
- - In the primary forest
 
66
  - files
67
- - WAV
68
- - 10.6 MB
69
- - Recording duration (s) : 60
70
  - link
71
- - https://drive.google.com/open?id=1eu0P_PrTjgVhNVZK8xycYlDd5AnJ9WdN&usp=drive_copy
72
 
73
  ## inhaa-Be Audiomoth 1,2
74
- - location:
75
- - In a protected forest of Inhaa-Be
76
 
77
- - files
78
- - WAVE
79
- - each 5.5 MB
80
- - Sample rate (Hz) : 48000
81
- - Sleep duration (s) : 240
82
- - Recording duration (s) : 60
83
 
84
- - links
85
- - https://drive.google.com/open?id=1wUj3rxruAqInWgJzumFTYaetVbQ8YfW2&usp=drive_copy
86
- - https://drive.google.com/open?id=1raI3UcCUWKg49tIE6L9LdfRlE557mlNp&usp=drive_copy
87
 
 
88
 
 
 
 
 
 
89
 
 
 
 
90
 
91
  ## inn 2, 3
92
 
93
  - location:
94
- - inn 2: At the Garden of the Inn (human settlement)
95
- - inn 3: At the Inn, close to the River (human settlement)
 
96
 
97
  - files
98
- - WAV
99
- - each 5 MB
100
- - Sample rate (Hz) : 48000
101
- - Sleep duration (s) : 5
102
- - Recording duration (s) : 55
103
 
104
- - links
 
 
 
 
105
 
106
- - inn2: https://drive.google.com/open?id=18fy059Ypaq7kYnjZm4TNLoqkfmOHhaIE&usp=drive_copy
107
- - inn3: https://drive.google.com/open?id=1wUj3rxruAqInWgJzumFTYaetVbQ8YfW2&usp=drive_copy
108
 
 
 
109
 
110
  ## parque das Tribos
 
111
  - location:
112
- - Close to a road, at the outskirts of Manaus (human settlement)
113
 
114
  ## landing - (includig ultrasonic)
 
115
  - location:
116
- - In the secondary forest / recently deforested and close to a flooded area
 
1
  # Audio datasets
2
 
 
3
  > We have automatic queries running that will keep updating and uploading more unlabelled data over the next weeks. So this dataset is still growing. If not stated differently, all data was sampled at a frequency of 48KHz. The document sampling_sites has more information on local observations.
4
 
5
  This audio data was collected by mentors [email protected], David Dao, [email protected] and @Marina Gatto
6
 
7
  The data was collected through two sensors:
8
+
9
  - Audiomoths
10
  - SongMeters
11
 
12
+ **_likely_bird_songs_in_Ingles_**
13
  This file contains a list of likely bird songs to occur in the area of Ingles (Northern Rio Negro) and was curated by our domain experts. It is not exclusive.
14
 
15
+ **_Ingles_**
16
+ All audiomoth collections of soundscapes in Ingles.
17
  This contains the primary forest (labeled as primary) as well as in an Inn (with direct human pressure).
18
  The audiomoth in the primary forest was recording up to a maximum frequency of 192Khz (creating a much larger dataset) with the hope of capturing ultrasonic soundscapes (bats)
19
 
20
+ **_SongMeters_Ingles_Primary_**
21
  All song meter recordings of the soundscapes of the primary forest of Ingles.
22
 
23
+ **_SongMeter_Clusters_**
24
  SongMeter Clusters are unsupervised detected clusters (run by Mentor [email protected]). They contain frequently occurring patterns detected in the SongMeter recordings in the frequency range of 2-5KHz.
25
 
26
+ **_Inhaa-Be_**
27
  3K min. All audiomoth collections of soundscapes in a forest close to Inhaa-Be.
28
+ Inhaa-Be is an Indigenous Village, with a protected forest.
29
 
30
+ **_ParqueDasTribos_**
31
+ 2K min. All audiomoth collections of soundscapes of Parque Das Tribos.
32
  Parque das Tribos is an Indigenous Urban Village at the outskirts of Manaus. Most recordings are close to roads and human pressure.
33
 
34
+ **_Xingu_**
35
  14K min. All this data was collected in a previous trip in Para State (a different state than Manaus) in October to November 2023. It is deployed in a protected territory far away from any human pressure. None of the provided global layers overlap with this area but can be requested if needed.
36
 
37
+ **_XenoCanto/Amazonas_**
38
  All XenoCanto songs that are originating from “Amazonas” (can include Colombia etc). This folder is split between frogs and birds.
39
 
40
+ **_XenoCanto/Greater Manaus_**
41
  A smaller subset of XenoCanto songs, containing all observations of the Upper Rio Negro area (overlapping with Manaus, Inhaa-Be, Ingles and Parque Das Tribos)
42
 
 
 
43
  ## primary 1,2 (including ultrasonic)
 
 
 
 
 
 
 
 
44
 
45
+ - location:
46
+ - In the primary forest
47
+ - files
48
 
49
+ - WAV
50
+ - each 44 MB
51
+ - Sample rate (Hz) : 384000
52
+ - Sleep duration (s) : 240
53
+ - Recording duration (s) : 60
54
 
55
  - links:
 
56
 
57
+ - primary 1: https://drive.google.com/open?id=1VCP9VDDtm6-u0lPsm6Pi_A0nBNvQjPv9&usp=drive_copy
58
 
59
+ - primary 2: https://drive.google.com/open?id=1sIeJ_1GBHuU7GO5CU6WVXZycU7I5YA_B&usp=drive_copy
60
 
61
  ## SoundMeters_Ingles_Primary
62
+
63
+ - location:
64
+ - In the primary forest
65
  - files
66
+ - WAV
67
+ - 10.6 MB
68
+ - Recording duration (s) : 60
69
  - link
70
+ - https://drive.google.com/open?id=1eu0P_PrTjgVhNVZK8xycYlDd5AnJ9WdN&usp=drive_copy
71
 
72
  ## inhaa-Be Audiomoth 1,2
 
 
73
 
74
+ - location:
 
 
 
 
 
75
 
76
+ - In a protected forest of Inhaa-Be
 
 
77
 
78
+ - files
79
 
80
+ - WAVE
81
+ - each 5.5 MB
82
+ - Sample rate (Hz) : 48000
83
+ - Sleep duration (s) : 240
84
+ - Recording duration (s) : 60
85
 
86
+ - links
87
+ - https://drive.google.com/open?id=1wUj3rxruAqInWgJzumFTYaetVbQ8YfW2&usp=drive_copy
88
+ - https://drive.google.com/open?id=1raI3UcCUWKg49tIE6L9LdfRlE557mlNp&usp=drive_copy
89
 
90
  ## inn 2, 3
91
 
92
  - location:
93
+
94
+ - inn 2: At the Garden of the Inn (human settlement)
95
+ - inn 3: At the Inn, close to the River (human settlement)
96
 
97
  - files
 
 
 
 
 
98
 
99
+ - WAV
100
+ - each 5 MB
101
+ - Sample rate (Hz) : 48000
102
+ - Sleep duration (s) : 5
103
+ - Recording duration (s) : 55
104
 
105
+ - links
 
106
 
107
+ - inn2: https://drive.google.com/open?id=18fy059Ypaq7kYnjZm4TNLoqkfmOHhaIE&usp=drive_copy
108
+ - inn3: https://drive.google.com/open?id=1wUj3rxruAqInWgJzumFTYaetVbQ8YfW2&usp=drive_copy
109
 
110
  ## parque das Tribos
111
+
112
  - location:
113
+ - Close to a road, at the outskirts of Manaus (human settlement)
114
 
115
  ## landing - (includig ultrasonic)
116
+
117
  - location:
118
+ - In the secondary forest / recently deforested and close to a flooded area