The database was recorded in 2009 at the UPC smart-room with 5 calibrated cameras and 6 T-shaped 4-microphone clusters. The database includes two kinds of datasets: 8 recording sessions (S01-S08) of isolated AEs, where 6 different participants performed 10 times each AE (about 15 min each session), and a spontaneously generated dataset which consists of 9 scenes (T01-T09) about 5 minutes long with 2 participants that interact with each other in a natural way: discuss certain subject, drink coffee, speak on the mobile phone, etc. Although the interactive scenes were recorded according to a previously elaborated scenario, we call this type of recordings “spontaneous”, since the AEs were produced in a realistic seminar style with possible overlap with speech. Data has been manually annotated.
The approximate source positions of the acoustic events (AE) are shown in the figure, along with the positions of the 6 T-shaped 4-microphone arrays on the walls of the UPC smart-room. All audio signals were recorded at 44,1 kHz sampling frequency.
Table2. Number of annotated acoustic events in each session
Event type |
S01 |
S02 |
S03 |
S04 |
S05 |
S06 |
S07 |
S08 |
T01 |
T02 |
T03 |
T04 |
T05 |
T06 |
T07 |
T08 |
T09 |
TOTAL |
Knock (door, table), <kn> |
9 |
8 |
10 |
10 |
10 |
8 |
11 |
13 |
2 |
3 |
2 |
3 |
3 |
4 |
2 |
5 |
3 |
106 |
Door slam, <ds> |
17 |
15 |
19 |
20 |
40 |
37 |
56 |
52 |
8 |
11 |
8 |
9 |
10 |
8 |
10 |
10 |
8 |
338 |
Steps, <st> |
10 |
10 |
8 |
23 |
43 |
34 |
28 |
50 |
15 |
17 |
12 |
18 |
20 |
21 |
16 |
17 |
17 |
359 |
Chair moving, <cm> |
19 |
37 |
32 |
22 |
23 |
38 |
34 |
40 |
17 |
21 |
15 |
20 |
22 |
24 |
15 |
23 |
26 |
428 |
Spoon (cup jingle), <cl> |
10 |
11 |
13 |
11 |
10 |
15 |
11 |
15 |
5 |
3 |
8 |
4 |
4 |
6 |
2 |
11 |
5 |
144 |
Paper work (listing, wrapping), <pw> |
9 |
11 |
10 |
8 |
17 |
12 |
12 |
12 |
7 |
6 |
9 |
18 |
10 |
18 |
17 |
25 |
36 |
237 |
Key jingle, <kj> |
11 |
11 |
11 |
8 |
0 |
13 |
10 |
18 |
1 |
6 |
1 |
4 |
2 |
9 |
4 |
7 |
7 |
123 |
Keyboard typing, <kt> |
10 |
10 |
13 |
12 |
10 |
13 |
10 |
11 |
8 |
9 |
6 |
9 |
8 |
12 |
10 |
11 |
8 |
170 |
Phone ringing/Music, <pr> |
11 |
18 |
11 |
14 |
8 |
11 |
13 |
15 |
4 |
4 |
4 |
4 |
4 |
0 |
3 |
4 |
2 |
130 |
Applause, <ap> |
9 |
5 |
9 |
11 |
12 |
9 |
14 |
14 |
1 |
0 |
1 |
1 |
1 |
1 |
2 |
1 |
1 |
92 |
Cough, <co> |
10 |
10 |
12 |
13 |
9 |
13 |
11 |
12 |
7 |
3 |
2 |
1 |
4 |
2 |
1 |
3 |
1 |
114 |
Speech, <sp> |
0 |
0 |
0 |
0 |
8 |
20 |
12 |
34 |
27 |
33 |
36 |
31 |
41 |
46 |
41 |
- |
- |
329 |
It is distributed through UPC, without cost. A license agreement must be signed by the receiving institution.
Copyright © 2017 - Designed by Madstudio