Acoustic scene classification

Task description

The goal of acoustic scene classification task was to classify a test recordings into one of predefined classes (15) that characterizes the environment in which it was recorded — for example "park", "home", "office".

More detailed task description can be found in the task description page

Challenge results

Here you can find complete information on the submissions for Task 1: results on evaluation and development set (when reported by authors), class-wise results, technical reports and bibtex citations.

System outputs:

DCASE2016 Challenge Submissions Package (28.7 MB)

Systems ranking

Submission code	Submission name	Technical Report	Accuracy (Evaluation dataset)	Accuracy (Development dataset)
Aggarwal_task1_1		Vij2016	74.4	74.1
Bae_task1_1	CLC	Bae2016	84.1	79.2
Bao_task1_1		Bao2016	83.1
Battaglino_task1_1		Battaglino2016	80.0
Bisot_task1_1		Bisot2016	87.7	86.2
DCASE2016 baseline	DCASE2016_baseline	Heittola2016	77.2	72.5
Duong_task1_1	Tec_SVM_A	Sena_Mafra2016	76.4	80.0
Duong_task1_2	Tec_SVM_V	Sena_Mafra2016	80.5	78.0
Duong_task1_3	Tec_MLP	Sena_Mafra2016	73.1	75.0
Duong_task1_4	Tec_CNN	Sena_Mafra2016	62.8	59.0
Eghbal-Zadeh_task1_1	CPJKU16_BMBI	Eghbal-Zadeh2016	86.4	80.8
Eghbal-Zadeh_task1_2	CPJKU16_CBMBI	Eghbal-Zadeh2016	88.7	83.9
Eghbal-Zadeh_task1_3	CPJKU16_DCNN	Eghbal-Zadeh2016	83.3	79.5
Eghbal-Zadeh_task1_4	CPJKU16_LFCBI	Eghbal-Zadeh2016	89.7	89.9
Foleiss_task1_1	JFTT	Foleiss2016	76.2	71.8
Hertel_task1_1	All-ConvNet	Hertel2016	79.5	84.5
Kim_task1_1	QRK	Yun2016	82.1	84.0
Ko_task1_1	KU_ISPL1_2016	Park2016	87.2	76.3
Ko_task1_2	KU_ISPL2_2016	Mun2016	82.3	72.7
Kong_task1_1	QK	Kong2016	81.0	76.4
Kumar_task1_1	Gauss	Elizalde2016	85.9	78.9
Lee_task1_1	MARGNet_MWFD	Han2016	84.6	83.1
Lee_task1_2	MARGNet_ZENS	Kim2016	85.4	81.6
Liu_task1_1	liu-re	Liu2016	83.8
Liu_task1_2	liu-pre	Liu2016	83.6
Lostanlen_task1_1	LostanlenAnden_2016	Lostanlen2016	80.8	79.4
Marchi_task1_1	Marchi_2016	Marchi2016	86.4	81.4
Marques_task1_1	DRKNN_2016	Marques2016	83.1	78.2
Moritz_task1_1		Moritz2016	79.0	76.5
Mulimani_task1_1		Mulimani2016	65.6	66.8
Nogueira_task1_1		Nogueira2016	81.0
Patiyal_task1_1	IITMandi_2016	Patiyal2016	78.5	97.6
Phan_task1_1	CNN-LTE	Phan2016	83.3	81.2
Pugachev_task1_1		Pugachev2016	73.1	82.9
Qu_task1_1		Dai2016	80.5
Qu_task1_2		Dai2016	84.1
Qu_task1_3		Dai2016	82.3
Qu_task1_4		Dai2016	80.5
Rakotomamonjy_task1_1	RAK_2016_1	Rakotomamonjy2016	82.1	81.2
Rakotomamonjy_task1_2	RAK_2016_2	Rakotomamonjy2016	79.2
Santoso_task1_1	SWW	Santoso2016	80.8	78.8
Schindler_task1_1	CQTCNN_1	Lidy2016	81.8	80.8
Schindler_task1_2	CQTCNN_2	Lidy2016	83.3
Takahashi_task1_1	UTNII_2016	Takahashi2016	85.6	77.5
Valenti_task1_1		Valenti2016	86.2	79.0
Vikaskumar_task1_1	ABSP_IITKGP_2016	Vikaskumar2016	81.3	80.4
Vu_task1_1		Vu2016	80.0	82.1
Xu_task1_1	HL-DNN-ASC_2016	Xu2016	73.3	81.4
Zoehrer_task1_1		Zoehrer2016	73.1

Teams ranking

Table including only the best performing system per submitting team.

Submission code	Submission name	Technical Report	Accuracy (Evaluation dataset)	Accuracy (Development dataset)
Aggarwal_task1_1		Vij2016	74.4	74.1
Bae_task1_1	CLC	Bae2016	84.1	79.2
Bao_task1_1		Bao2016	83.1
Battaglino_task1_1		Battaglino2016	80.0
Bisot_task1_1		Bisot2016	87.7	86.2
DCASE2016 baseline	DCASE2016_baseline	Heittola2016	77.2	72.5
Duong_task1_2	Tec_SVM_V	Sena_Mafra2016	80.5	78.0
Eghbal-Zadeh_task1_4	CPJKU16_LFCBI	Eghbal-Zadeh2016	89.7	89.9
Foleiss_task1_1	JFTT	Foleiss2016	76.2	71.8
Hertel_task1_1	All-ConvNet	Hertel2016	79.5	84.5
Kim_task1_1	QRK	Yun2016	82.1	84.0
Ko_task1_1	KU_ISPL1_2016	Park2016	87.2	76.3
Ko_task1_2	KU_ISPL2_2016	Mun2016	82.3	72.7
Kong_task1_1	QK	Kong2016	81.0	76.4
Kumar_task1_1	Gauss	Elizalde2016	85.9	78.9
Lee_task1_1	MARGNet_MWFD	Han2016	84.6	83.1
Lee_task1_2	MARGNet_ZENS	Kim2016	85.4	81.6
Liu_task1_1	liu-re	Liu2016	83.8
Lostanlen_task1_1	LostanlenAnden_2016	Lostanlen2016	80.8	79.4
Marchi_task1_1	Marchi_2016	Marchi2016	86.4	81.4
Marques_task1_1	DRKNN_2016	Marques2016	83.1	78.2
Moritz_task1_1		Moritz2016	79.0	76.5
Mulimani_task1_1		Mulimani2016	65.6	66.8
Nogueira_task1_1		Nogueira2016	81.0
Patiyal_task1_1	IITMandi_2016	Patiyal2016	78.5	97.6
Phan_task1_1	CNN-LTE	Phan2016	83.3	81.2
Pugachev_task1_1		Pugachev2016	73.1	82.9
Qu_task1_2		Dai2016	84.1
Rakotomamonjy_task1_1	RAK_2016_1	Rakotomamonjy2016	82.1	81.2
Santoso_task1_1	SWW	Santoso2016	80.8	78.8
Schindler_task1_2	CQTCNN_2	Lidy2016	83.3
Takahashi_task1_1	UTNII_2016	Takahashi2016	85.6	77.5
Valenti_task1_1		Valenti2016	86.2	79.0
Vikaskumar_task1_1	ABSP_IITKGP_2016	Vikaskumar2016	81.3	80.4
Vu_task1_1		Vu2016	80.0	82.1
Xu_task1_1	HL-DNN-ASC_2016	Xu2016	73.3	81.4
Zoehrer_task1_1		Zoehrer2016	73.1

Class-wise performance

Submission code	Submission name	Technical Report	Accuracy (Evaluation dataset)	Beach	Bus	Cafe / Restaurant	Car	City center	Forest path	Grocery store	Home	Library	Metro station	Office	Park	Residential area	Train	Tram
Aggarwal_task1_1		Vij2016	74.4	80.8	84.6	69.2	88.5	80.8	84.6	84.6	92.3	38.5	96.2	92.3	65.4	42.3	34.6	80.8
Bae_task1_1	CLC	Bae2016	84.1	84.6	100.0	61.5	88.5	92.3	100.0	96.2	88.5	46.2	88.5	100.0	96.2	65.4	53.8	100.0
Bao_task1_1		Bao2016	83.1	84.6	96.2	57.7	100.0	76.9	92.3	84.6	88.5	46.2	96.2	100.0	96.2	76.9	50.0	100.0
Battaglino_task1_1		Battaglino2016	80.0	84.6	73.1	76.9	84.6	96.2	100.0	96.2	84.6	34.6	80.8	84.6	96.2	65.4	53.8	88.5
Bisot_task1_1		Bisot2016	87.7	88.5	100.0	76.9	100.0	100.0	88.5	88.5	96.2	50.0	100.0	96.2	80.8	76.9	73.1	100.0
DCASE2016 baseline	DCASE2016_baseline	Heittola2016	77.2	84.6	88.5	69.2	96.2	80.8	65.4	88.5	92.3	26.9	100.0	96.2	53.8	88.5	30.8	96.2
Duong_task1_1	Tec_SVM_A	Sena_Mafra2016	76.4	88.5	100.0	69.2	88.5	84.6	100.0	96.2	38.5	46.2	80.8	100.0	61.5	34.6	57.7	100.0
Duong_task1_2	Tec_SVM_V	Sena_Mafra2016	80.5	80.8	100.0	84.6	92.3	92.3	100.0	96.2	57.7	46.2	96.2	100.0	50.0	53.8	57.7	100.0
Duong_task1_3	Tec_MLP	Sena_Mafra2016	73.1	73.1	92.3	50.0	84.6	88.5	100.0	80.8	34.6	26.9	92.3	100.0	84.6	46.2	50.0	92.3
Duong_task1_4	Tec_CNN	Sena_Mafra2016	62.8	80.8	88.5	53.8	80.8	69.2	96.2	76.9	50.0	15.4	46.2	92.3	42.3	34.6	19.2	96.2
Eghbal-Zadeh_task1_1	CPJKU16_BMBI	Eghbal-Zadeh2016	86.4	92.3	92.3	76.9	96.2	92.3	96.2	100.0	88.5	69.2	73.1	100.0	96.2	76.9	46.2	100.0
Eghbal-Zadeh_task1_2	CPJKU16_CBMBI	Eghbal-Zadeh2016	88.7	96.2	100.0	84.6	100.0	92.3	96.2	100.0	92.3	69.2	69.2	100.0	96.2	84.6	50.0	100.0
Eghbal-Zadeh_task1_3	CPJKU16_DCNN	Eghbal-Zadeh2016	83.3	92.3	96.2	42.3	88.5	84.6	100.0	100.0	100.0	53.8	100.0	96.2	46.2	80.8	69.2	100.0
Eghbal-Zadeh_task1_4	CPJKU16_LFCBI	Eghbal-Zadeh2016	89.7	96.2	100.0	61.5	96.2	96.2	96.2	100.0	96.2	69.2	100.0	96.2	88.5	88.5	61.5	100.0
Foleiss_task1_1	JFTT	Foleiss2016	76.2	84.6	84.6	61.5	80.8	96.2	84.6	96.2	88.5	46.2	57.7	84.6	65.4	42.3	80.8	88.5
Hertel_task1_1	All-ConvNet	Hertel2016	79.5	84.6	92.3	53.8	100.0	80.8	80.8	76.9	76.9	69.2	100.0	100.0	84.6	46.2	53.8	92.3
Kim_task1_1	QRK	Yun2016	82.1	76.9	100.0	76.9	100.0	84.6	100.0	88.5	100.0	0.0	92.3	96.2	76.9	69.2	69.2	100.0
Ko_task1_1	KU_ISPL1_2016	Park2016	87.2	88.5	96.2	84.6	96.2	100.0	96.2	96.2	88.5	53.8	80.8	100.0	57.7	80.8	88.5	100.0
Ko_task1_2	KU_ISPL2_2016	Mun2016	82.3	92.3	84.6	65.4	92.3	100.0	84.6	96.2	92.3	53.8	65.4	84.6	92.3	84.6	53.8	92.3
Kong_task1_1	QK	Kong2016	81.0	84.6	100.0	57.7	92.3	88.5	96.2	92.3	76.9	34.6	80.8	100.0	96.2	69.2	46.2	100.0
Kumar_task1_1	Gauss	Elizalde2016	85.9	84.6	92.3	73.1	88.5	92.3	96.2	96.2	92.3	50.0	96.2	96.2	80.8	88.5	73.1	88.5
Lee_task1_1	MARGNet_MWFD	Han2016	84.6	84.6	96.2	61.5	100.0	88.5	96.2	92.3	96.2	42.3	84.6	96.2	84.6	76.9	69.2	100.0
Lee_task1_2	MARGNet_ZENS	Kim2016	85.4	84.6	92.3	61.5	100.0	96.2	100.0	96.2	96.2	46.2	84.6	100.0	92.3	69.2	61.5	100.0
Liu_task1_1	liu-re	Liu2016	83.8	84.6	96.2	69.2	84.6	92.3	96.2	88.5	92.3	46.2	92.3	96.2	88.5	76.9	53.8	100.0
Liu_task1_2	liu-pre	Liu2016	83.6	88.5	92.3	69.2	84.6	96.2	92.3	92.3	88.5	46.2	88.5	96.2	92.3	76.9	50.0	100.0
Lostanlen_task1_1	LostanlenAnden_2016	Lostanlen2016	80.8	80.8	92.3	50.0	96.2	84.6	96.2	84.6	80.8	65.4	96.2	100.0	65.4	69.2	53.8	96.2
Marchi_task1_1	Marchi_2016	Marchi2016	86.4	88.5	92.3	80.8	100.0	96.2	100.0	100.0	76.9	50.0	96.2	100.0	92.3	84.6	42.3	96.2
Marques_task1_1	DRKNN_2016	Marques2016	83.1	88.5	96.2	65.4	84.6	84.6	96.2	80.8	84.6	69.2	84.6	92.3	96.2	65.4	57.7	100.0
Moritz_task1_1		Moritz2016	79.0	88.5	100.0	19.2	100.0	92.3	100.0	88.5	92.3	38.5	80.8	100.0	61.5	76.9	46.2	100.0
Mulimani_task1_1		Mulimani2016	65.6	73.1	96.2	69.2	100.0	73.1	50.0	65.4	76.9	7.7	76.9	96.2	96.2	23.1	15.4	65.4
Nogueira_task1_1		Nogueira2016	81.0	88.5	88.5	65.4	92.3	73.1	96.2	84.6	92.3	38.5	96.2	100.0	73.1	80.8	53.8	92.3
Patiyal_task1_1	IITMandi_2016	Patiyal2016	78.5	84.6	96.2	61.5	92.3	92.3	92.3	80.8	92.3	34.6	96.2	96.2	92.3	69.2	11.5	84.6
Phan_task1_1	CNN-LTE	Phan2016	83.3	84.6	96.2	53.8	100.0	100.0	96.2	84.6	88.5	46.2	84.6	100.0	88.5	84.6	46.2	96.2
Pugachev_task1_1		Pugachev2016	73.1	84.6	69.2	61.5	92.3	80.8	96.2	92.3	80.8	26.9	96.2	88.5	57.7	42.3	34.6	92.3
Qu_task1_1		Dai2016	80.5	84.6	100.0	73.1	88.5	96.2	84.6	100.0	88.5	23.1	76.9	96.2	73.1	76.9	46.2	100.0
Qu_task1_2		Dai2016	84.1	88.5	100.0	80.8	92.3	96.2	84.6	100.0	88.5	42.3	76.9	96.2	76.9	80.8	57.7	100.0
Qu_task1_3		Dai2016	82.3	88.5	100.0	76.9	92.3	96.2	84.6	92.3	88.5	30.8	88.5	96.2	76.9	76.9	46.2	100.0
Qu_task1_4		Dai2016	80.5	80.8	100.0	84.6	88.5	92.3	84.6	92.3	92.3	42.3	76.9	96.2	76.9	76.9	23.1	100.0
Rakotomamonjy_task1_1	RAK_2016_1	Rakotomamonjy2016	82.1	80.8	96.2	46.2	92.3	84.6	100.0	96.2	88.5	42.3	80.8	96.2	88.5	73.1	65.4	100.0
Rakotomamonjy_task1_2	RAK_2016_2	Rakotomamonjy2016	79.2	92.3	92.3	69.2	84.6	80.8	96.2	84.6	88.5	38.5	96.2	100.0	73.1	57.7	34.6	100.0
Santoso_task1_1	SWW	Santoso2016	80.8	84.6	84.6	61.5	96.2	84.6	100.0	80.8	100.0	42.3	92.3	100.0	80.8	65.4	42.3	96.2
Schindler_task1_1	CQTCNN_1	Lidy2016	81.8	88.5	100.0	34.6	92.3	96.2	100.0	92.3	88.5	46.2	96.2	100.0	65.4	73.1	53.8	100.0
Schindler_task1_2	CQTCNN_2	Lidy2016	83.3	88.5	100.0	34.6	92.3	96.2	100.0	92.3	92.3	46.2	96.2	100.0	65.4	76.9	69.2	100.0
Takahashi_task1_1	UTNII_2016	Takahashi2016	85.6	92.3	100.0	61.5	100.0	88.5	88.5	96.2	84.6	57.7	80.8	100.0	92.3	80.8	61.5	100.0
Valenti_task1_1		Valenti2016	86.2	84.6	100.0	76.9	100.0	96.2	100.0	92.3	92.3	42.3	96.2	96.2	76.9	76.9	65.4	96.2
Vikaskumar_task1_1	ABSP_IITKGP_2016	Vikaskumar2016	81.3	84.6	92.3	61.5	100.0	84.6	84.6	80.8	88.5	65.4	92.3	69.2	80.8	73.1	73.1	88.5
Vu_task1_1		Vu2016	80.0	88.5	76.9	61.5	100.0	92.3	100.0	80.8	73.1	46.2	92.3	100.0	92.3	50.0	46.2	100.0
Xu_task1_1	HL-DNN-ASC_2016	Xu2016	73.3	84.6	96.2	23.1	96.2	84.6	100.0	84.6	69.2	23.1	57.7	100.0	73.1	69.2	38.5	100.0
Zoehrer_task1_1		Zoehrer2016	73.1	80.8	92.3	38.5	92.3	65.4	96.2	84.6	65.4	23.1	84.6	100.0	61.5	69.2	42.3	100.0

System characteristics

Code	Name	Technical Report	Accuracy (Eval)	Input	Features	Classifier
Aggarwal_task1_1		Vij2016	74.4	binaural	various	SVM
Bae_task1_1	CLC	Bae2016	84.1	monophonic	spectrogram	CNN-RNN
Bao_task1_1		Bao2016	83.1	monophonic	MFCC+mel energy	fusion
Battaglino_task1_1		Battaglino2016	80.0	binaural	mel energy	CNN
Bisot_task1_1		Bisot2016	87.7	monophonic	spectrogram	NMF
DCASE2016 baseline	DCASE2016_baseline	Heittola2016	77.2	monophonic	MFCC	GMM
Duong_task1_1	Tec_SVM_A	Sena_Mafra2016	76.4	monophonic	mel energy	SVM
Duong_task1_2	Tec_SVM_V	Sena_Mafra2016	80.5	monophonic	mel energy	SVM
Duong_task1_3	Tec_MLP	Sena_Mafra2016	73.1	monophonic	mel energy	DNN
Duong_task1_4	Tec_CNN	Sena_Mafra2016	62.8	monophonic	mel energy	DNN
Eghbal-Zadeh_task1_1	CPJKU16_BMBI	Eghbal-Zadeh2016	86.4	binaural	MFCC	I-vector
Eghbal-Zadeh_task1_2	CPJKU16_CBMBI	Eghbal-Zadeh2016	88.7	binaural	MFCC	I-vector
Eghbal-Zadeh_task1_3	CPJKU16_DCNN	Eghbal-Zadeh2016	83.3	monophonic	spectrogram	CNN
Eghbal-Zadeh_task1_4	CPJKU16_LFCBI	Eghbal-Zadeh2016	89.7	mono+binaural	MFCC+spectrograms	fusion
Foleiss_task1_1	JFTT	Foleiss2016	76.2	monophonic	various	SVM
Hertel_task1_1	All-ConvNet	Hertel2016	79.5	left	spectrogram	CNN
Kim_task1_1	QRK	Yun2016	82.1	mono	MFCC	GMM
Ko_task1_1	KU_ISPL1_2016	Park2016	87.2	mono	various	fusion
Ko_task1_2	KU_ISPL2_2016	Mun2016	82.3	left+right+mono	various	DNN
Kong_task1_1	QK	Kong2016	81.0	mono	mel energy	DNN
Kumar_task1_1	Gauss	Elizalde2016	85.9	mono	MFCC distribution	SVM
Lee_task1_1	MARGNet_MWFD	Han2016	84.6	mono	mel energy	CNN
Lee_task1_2	MARGNet_ZENS	Kim2016	85.4	mono	unsupervised	CNN ensemble
Liu_task1_1	liu-re	Liu2016	83.8	mono	MFCC+mel energy	fusion
Liu_task1_2	liu-pre	Liu2016	83.6	mono	MFCC+mel energy	fusion
Lostanlen_task1_1	LostanlenAnden_2016	Lostanlen2016	80.8	mixed	gammatone scattering	SVM
Marchi_task1_1	Marchi_2016	Marchi2016	86.4	mono	various	fusion
Marques_task1_1	DRKNN_2016	Marques2016	83.1	mono	MFCC	kNN
Moritz_task1_1		Moritz2016	79.0	left+right+mono	amplitude modulation filter bank	TDNN
Mulimani_task1_1		Mulimani2016	65.6	mono	MFCC+matching pursuit	GMM
Nogueira_task1_1		Nogueira2016	81.0	binaural	various	SVM
Patiyal_task1_1	IITMandi_2016	Patiyal2016	78.5	mono	MFCC	DNN
Phan_task1_1	CNN-LTE	Phan2016	83.3	mono	label tree embedding	CNN
Pugachev_task1_1		Pugachev2016	73.1	mono	MFCC	DNN
Qu_task1_1		Dai2016	80.5	mono	various	ensemble
Qu_task1_2		Dai2016	84.1	mono	various	ensemble
Qu_task1_3		Dai2016	82.3	mono	various	ensemble
Qu_task1_4		Dai2016	80.5	mono	various	ensemble
Rakotomamonjy_task1_1	RAK_2016_1	Rakotomamonjy2016	82.1	mono	various	SVM
Rakotomamonjy_task1_2	RAK_2016_2	Rakotomamonjy2016	79.2	mono	various	SVM
Santoso_task1_1	SWW	Santoso2016	80.8	mono	MFCC	CNN
Schindler_task1_1	CQTCNN_1	Lidy2016	81.8	mono	CQT	CNN
Schindler_task1_2	CQTCNN_2	Lidy2016	83.3	mono	CQT	CNN
Takahashi_task1_1	UTNII_2016	Takahashi2016	85.6	mono	MFCC	DNN-GMM
Valenti_task1_1		Valenti2016	86.2	mono	mel energy	CNN
Vikaskumar_task1_1	ABSP_IITKGP_2016	Vikaskumar2016	81.3	mono	MFCC	SVM
Vu_task1_1		Vu2016	80.0	mono	MFCC	RNN
Xu_task1_1	HL-DNN-ASC_2016	Xu2016	73.3	mono	mel energy	DNN
Zoehrer_task1_1		Zoehrer2016	73.1	mono	spectrogram	GRNN

Technical reports

Acoustic Scene Classification Using Parallel Combination of LSTM and CNN

Soo Hyun Bae, Inkyu Choi and Nam Soo Kim

Department of Electrical and Computer Engineering, Seoul National University, Seoul, South Korea

Bae_task1_1

Input	monophonic
Sampling rate	44.1kHz
Features	spectrogram
Classifier	CNN-RNN

Input	binaural; monophonic; mono+binaural
Sampling rate	44.1kHz
Features	MFCC; spectrogram; MFCC+spectrograms
Classifier	I-vector; CNN; fusion

Content

Task description

Challenge results

Systems ranking

Teams ranking

Class-wise performance

System characteristics

Technical reports

Acoustic Scene Classification Using Parallel Combination of LSTM and CNN

Acoustic Scene Classification Using Parallel Combination of LSTM and CNN

Abstract

System characteristics

Technical Report of USTC System for Acoustic Scene Classification

Technical Report of USTC System for Acoustic Scene Classification

Abstract

System characteristics

Acoustic Scene Classification Using Convolutional Neural Networks

Acoustic Scene Classification Using Convolutional Neural Networks

Abstract

System characteristics

Supervised Nonnegative Matrix Factorization for Acoustic Scene Classification

Supervised Nonnegative Matrix Factorization for Acoustic Scene Classification

Abstract

System characteristics

Acoustic Scene Recognition with Deep Neural Networks (DCASE Challenge 2016)

Acoustic Scene Recognition with Deep Neural Networks (DCASE Challenge 2016)

System characteristics

CP-JKU Submissions for DCASE-2016: a Hybrid Approach Using Binaural I-Vectors and Deep Convolutional Neural Networks

CP-JKU Submissions for DCASE-2016: a Hybrid Approach Using Binaural I-Vectors and Deep Convolutional Neural Networks

Abstract

System characteristics

Experiments on The DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording

Experiments on The DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording

Abstract

System characteristics

Mel-Band Features for DCASE 2016 Acoustic Scene Classification Task

Mel-Band Features for DCASE 2016 Acoustic Scene Classification Task

Abstract

System characteristics

Convolutional Neural Network with Multiple-Width Frequency-Delta Data Augmentation for Acoustic Scene Classification

Convolutional Neural Network with Multiple-Width Frequency-Delta Data Augmentation for Acoustic Scene Classification

Abstract

System characteristics

DCASE2016 Baseline System

DCASE2016 Baseline System

System characteristics

Classifying Variable-Length Audio Files with All-Convolutional Networks and Masked Global Pooling

Classifying Variable-Length Audio Files with All-Convolutional Networks and Masked Global Pooling

Abstract

System characteristics

Empirical Study on Ensemble Method of Deep Neural Networks for Acoustic Scene Classification

Empirical Study on Ensemble Method of Deep Neural Networks for Acoustic Scene Classification

Abstract

System characteristics

Deep Neural Network Baseline for DCASE Challenge 2016

Deep Neural Network Baseline for DCASE Challenge 2016

Abstract

System characteristics

CQT-Based Convolutional Neural Networks for Audio Scene Classification and Domestic Audio Tagging

CQT-Based Convolutional Neural Networks for Audio Scene Classification and Domestic Audio Tagging

Abstract

System characteristics

Acoustic Scene Classification by Feed Forward Neural Network with Class Dependent Attention Mechanism

Acoustic Scene Classification by Feed Forward Neural Network with Class Dependent Attention Mechanism

Abstract

System characteristics

Binaural Scene Classification with Wavelet Scattering

Binaural Scene Classification with Wavelet Scattering

Abstract

System characteristics

The Up System for The 2016 DCASE Challenge Using Deep Recurrent Neural Network and Multiscale Kernel Subspace Learning

The Up System for The 2016 DCASE Challenge Using Deep Recurrent Neural Network and Multiscale Kernel Subspace Learning

Abstract

System characteristics

TUT Acoustic Scene Classification Submission

TUT Acoustic Scene Classification Submission

Abstract

System characteristics

Acoustic Scene Classification Using Time-Delay Neural Networks and Amplitude Modulation Filter Bank Features

Acoustic Scene Classification Using Time-Delay Neural Networks and Amplitude Modulation Filter Bank Features