Task description

The task is to design a system that, given a short audio recording, returns a binary decision for the presence/absence of bird sound (bird sound of any kind).

An important goal of this task is generalisation to new conditions. To explore this we provide 3 separate development datasets, and 3 evaluation datasets, each recorded under differing conditions. The datasets have different balances of positive/negative cases, different bird species, different background sounds, different recording equipment.

More detailed task description can be found in the task description page

Teams ranking

Table including only the best performing system per submitting team.

Submission name	Technical Report	AUC with 95% confidence interval (Evaluation dataset)
Lasseck_MfN_1	Lasseck_MfN	89.0 (87.7 - 89.9)
bulbul_DCASE_1	bulbul_DCASE	88.5 (86.9 - 89.1)
SpeechLab_UKY_3	SpeechLab_UKY	83.9 (81.7 - 84.7)
JiananSong_BUPT_1	JiananSong_BUPT	82.1 (80.3 - 83.0)
Himawan_QUT_1	Himawan_QUT	81.7 (80.3 - 82.8)
Bai_NPU_1	Bai_NPU	81.5 (80.1 - 82.8)
Baseline_Surrey_1	Baseline_Surrey	80.9 (79.1 - 82.4)
Berger_JKU_1	Berger_JKU	80.8 (79.2 - 82.5)
Mukherjee_IITKgp_2	Mukherjee_IITKgp	80.7 (79.5 - 82.3)
Yu_LR_2	Yu_LR	80.6 (78.5 - 81.4)
Thakur_IITMANDI_1	Thakur_IITMANDI	79.2 (76.7 - 79.5)
Vesperini_A3Lab_1	Vesperini_A3Lab	78.8 (77.4 - 80.2)
Tao_IITLAB_2	Tao_IITLAB	75.4 (73.2 - 77.1)
skfl_DCASE_1	skfl_DCASE	73.4 (72.0 - 75.3)
smacpy_DCASE_1	smacpy_DCASE	51.7 (50.5 - 52.5)
Jamali_HUT_1	Jamali_HUT	48.9 (46.4 - 49.6)

Prize winners

The two prize winners receive £250 in recognition of their contribution.

1: Highest-scoring open-source/reproducible method award

Winner: Liaquat et al (University of Kentucky, USA) - This student team re-implemented the "bulbul" system (last year's winner) and then evaluated various ideas for improving it. Although the individual modifications did not improve the score, an ensemble of the resulting systems led to an improved final score. The tech report gives a discussion of the techniques tried, including a domain adaptation method and signal enhancement.

Code

2: Judges' award for the method considered by the judges to be the most interesting or innovative.

Winner: Vesperini et al (Università Politecnica delle Marche, Italy) - The authors use "capsule networks", a new idea for routing between modules in neural networks. The paper gives a clear introduction to the concept, and it's encouraging that this rather new idea gets respectable performance on the challenge data (78.8%).

Special mention: Berger et al (Johannes Kepler University, Austria) - The authors use a bulbul-like model, and they describe an interesting domain-adaptation technique, which gives them approximately a 1% boost over their base model.

Systems ranking

Table including all systems officially submitted (up to 4 per team).

Submission name	Technical Report	AUC with 95% confidence interval (Evaluation datasets)
Lasseck_MfN_1	Lasseck_MfN	89.0 (87.7 - 89.9)
bulbul_DCASE_1	bulbul_DCASE	88.5 (86.9 - 89.1)
SpeechLab_UKY_1	SpeechLab_UKY	82.5 (81.0 - 83.5)
JiananSong_BUPT_1	JiananSong_BUPT	82.1 (80.3 - 83.0)
Himawan_QUT_1	Himawan_QUT	81.7 (80.3 - 82.8)
Bai_NPU_1	Bai_NPU	81.5 (80.1 - 82.8)
Baseline_Surrey_1	Baseline_Surrey	80.9 (79.1 - 82.4)
Berger_JKU_1	Berger_JKU	80.8 (79.2 - 82.5)
Yu_LR_1	Yu_LR	80.5 (78.6 - 81.5)
Mukherjee_IITKgp_1	Mukherjee_IITKgp	80.4 (79.0 - 82.0)
Thakur_IITMANDI_1	Thakur_IITMANDI	79.2 (76.7 - 79.5)
Vesperini_A3Lab_1	Vesperini_A3Lab	78.8 (77.4 - 80.2)
Tao_IITLAB_1	Tao_IITLAB	74.9 (73.4 - 76.7)
skfl_DCASE_1	skfl_DCASE	73.4 (72.0 - 75.3)
smacpy_DCASE_1	smacpy_DCASE	51.7 (50.5 - 52.5)
Jamali_HUT_1	Jamali_HUT	48.9 (46.4 - 49.6)
SpeechLab_UKY_2	SpeechLab_UKY	82.7 (79.8 - 83.6)
Himawan_QUT_2	Himawan_QUT	81.3 (80.0 - 82.7)
Bai_NPU_2	Bai_NPU	80.9 (79.5 - 82.2)
Mukherjee_IITKgp_2	Mukherjee_IITKgp	80.7 (79.5 - 82.3)
Yu_LR_2	Yu_LR	80.6 (78.5 - 81.4)
Vesperini_A3Lab_2	Vesperini_A3Lab	75.9 (73.0 - 78.0)
Tao_IITLAB_2	Tao_IITLAB	75.4 (73.2 - 77.1)
Thakur_IITMANDI_2	Thakur_IITMANDI	75.4 (72.1 - 77.6)
Baseline_Surrey_2	Baseline_Surrey	74.8 (72.8 - 76.3)
Berger_JKU_2	Berger_JKU	70.8 (68.2 - 71.8)
JiananSong_BUPT_2	JiananSong_BUPT	51.5 (49.2 - 52.6)
SpeechLab_UKY_3	SpeechLab_UKY	83.9 (81.7 - 84.7)
Bai_NPU_3	Bai_NPU	81.5 (80.1 - 82.8)
Himawan_QUT_3	Himawan_QUT	80.6 (78.7 - 81.5)
Yu_LR_3	Yu_LR	80.0 (77.7 - 80.6)
Tao_IITLAB_3	Tao_IITLAB	74.1 (72.3 - 76.0)
Thakur_IITMANDI_3	Thakur_IITMANDI	72.9 (70.0 - 74.1)
SpeechLab_UKY_4	SpeechLab_UKY	83.6 (81.4 - 84.6)
Bai_NPU_4	Bai_NPU	81.4 (80.0 - 82.7)
Himawan_QUT_4	Himawan_QUT	78.4 (76.8 - 79.9)
Thakur_IITMANDI_4	Thakur_IITMANDI	77.7 (76.2 - 79.7)

Technical reports

CIAIC-BAD SYSTEM FOR DCASE2018 CHALLENGE TASK 3

Bai, Jisheng and Wu, Ru and Wang, Mou and Li, Dexin and Li, Di and Han, Xueyu and Wang, Qian and Liu, Qing and Wang, Bolun and Fu, Zhonghua

Northwestern Polytechnical University, Xi'an, China

Input	mono
Sampling rate	44.1kHz
Features	log-mel energies
Classifier	VGGish 8 layer CNN with global max pooling; AlexNetish 4 layer CNN with global max pooling

Content

Task description

Teams ranking

Prize winners

1: Highest-scoring open-source/reproducible method award

2: Judges' award for the method considered by the judges to be the most interesting or innovative.

Systems ranking

Technical reports

CIAIC-BAD SYSTEM FOR DCASE2018 CHALLENGE TASK 3

CIAIC-BAD SYSTEM FOR DCASE2018 CHALLENGE TASK 3

Abstract

Bird Audio Detection - DCASE 2018

Bird Audio Detection - DCASE 2018

Abstract

3D CONVOLUTIONAL RECURRENT NEURAL NETWORKS FOR BIRD SOUND DETECTION

3D CONVOLUTIONAL RECURRENT NEURAL NETWORKS FOR BIRD SOUND DETECTION

Abstract

Bird Audio Detection using Supervised Weighted NMF

Bird Audio Detection using Supervised Weighted NMF

Abstract

Bird Audio Detection using Convolutional Neural Networks and Binary Neural Networks

Bird Audio Detection using Convolutional Neural Networks and Binary Neural Networks

Abstract

DCASE 2018 Challenge Surrey Cross-Task convolutional neural network baseline

DCASE 2018 Challenge Surrey Cross-Task convolutional neural network baseline

Abstract

System characteristics

ACOUSTIC BIRD DETECTION WITH DEEP CONVOLUTIONAL NEURAL NETWORKS

ACOUSTIC BIRD DETECTION WITH DEEP CONVOLUTIONAL NEURAL NETWORKS

Abstract

CONVOLUTIONAL RECURRENT NEURAL NETWORK BASED BIRD AUDIO DETECTION

CONVOLUTIONAL RECURRENT NEURAL NETWORK BASED BIRD AUDIO DETECTION

Abstract

DOMAIN TUNING METHODS FOR BIRD AUDIO DETECTION

DOMAIN TUNING METHODS FOR BIRD AUDIO DETECTION

Abstract

BIRD AUDIO DETECTION FOR DCASE 2018 CHALLENGE TECHNICAL REPORT

BIRD AUDIO DETECTION FOR DCASE 2018 CHALLENGE TECHNICAL REPORT

Abstract

LEARNED AGGREGATION IN CNN: ALL-CONV NET FOR BIRD ACTIVITY DETECTION

LEARNED AGGREGATION IN CNN: ALL-CONV NET FOR BIRD ACTIVITY DETECTION

Abstract

A CAPSULE NEURAL NETWORKS BASED APPROACH FOR BIRD AUDIO DETECTION

A CAPSULE NEURAL NETWORKS BASED APPROACH FOR BIRD AUDIO DETECTION

Abstract

DCASE 2018 CHALLENGE TECHNICAL REPORT

DCASE 2018 CHALLENGE TECHNICAL REPORT

Abstract