The database was recorded in 2009 at the UPC smart-room with 5 calibrated cameras and 6 T-shaped 4-microphone clusters. The database includes two kinds of datasets: 8 recording sessions (S01-S08) of isolated AEs, where 6 different participants performed 10 times each AE (about 15 min each session), and a spontaneously generated dataset which consists of 9 scenes (T01-T09) about 5 minutes long with 2 participants that interact with each other in a natural way: discuss certain subject, drink coffee, speak on the mobile phone, etc. Although the interactive scenes were recorded according to a previously elaborated scenario, we call this type of recordings “spontaneous”, since the AEs were produced in a realistic seminar style with possible overlap with speech. Data has been manually annotated.

The approximate source positions of the acoustic events (AE) are shown in the figure, along with the positions of the 6 T-shaped 4-microphone arrays on the walls of the UPC smart-room. All audio signals were recorded at 44,1 kHz sampling frequency.

 

 

Table2. Number of annotated acoustic events in each session



















Event type

S01

S02

S03

S04

S05

S06

S07

S08

T01

T02

T03

T04

T05

T06

T07

T08

T09

TOTAL

Knock (door, table), <kn>

9

8

10

10

10

8

11

13

2

3

2

3

3

4

2

5

3

106

Door slam, <ds>

17

15

19

20

40

37

56

52

8

11

8

9

10

8

10

10

8

338

Steps, <st>

10

10

8

23

43

34

28

50

15

17

12

18

20

21

16

17

17

359

Chair moving, <cm>

19

37

32

22

23

38

34

40

17

21

15

20

22

24

15

23

26

428

Spoon (cup jingle), <cl>

10

11

13

11

10

15

11

15

5

3

8

4

4

6

2

11

5

144

Paper work (listing, wrapping), <pw>

9

11

10

8

17

12

12

12

7

6

9

18

10

18

17

25

36

237

Key jingle, <kj>

11

11

11

8

0

13

10

18

1

6

1

4

2

9

4

7

7

123

Keyboard typing, <kt>

10

10

13

12

10

13

10

11

8

9

6

9

8

12

10

11

8

170

Phone ringing/Music, <pr>

11

18

11

14

8

11

13

15

4

4

4

4

4

0

3

4

2

130

Applause, <ap>

9

5

9

11

12

9

14

14

1

0

1

1

1

1

2

1

1

92

Cough, <co>

10

10

12

13

9

13

11

12

7

3

2

1

4

2

1

3

1

114

Speech, <sp>

0

0

0

0

8

20

12

34

27

33

36

31

41

46

41

-

-

329

 

It is distributed through UPC, without cost. A license agreement must be signed by the receiving institution.

 

Scroll to Top