Proceedings - DCASE

The proceedings of the DCASE2020 Workshop have been published as an electronic publication:

Nobutaka Ono, Noboru Harada, Yohei Kawaguchi, Annamaria Mesaros, Keisuke Imoto, Yuma Koizumi, and Tatsuya Komatsu (eds.), Proceedings of the 5th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2020), Nov. 2020.

ISBN (Electronic): 978-4-600-00566-5
DOI: https://doi.org/10.5281/zenodo.4061782

Link PDF

Total cites: 1610 (updated 26.11.2024)

Microphone Array Optimization for Autonomous-Vehicle Audio Localization Based on the Radon Transform

Ohad Barak¹, Nizar Sallem¹, and Marc Fischer¹

¹Siemens Digital Industries Software Corporation

3 cites

PDF Video

Abstract

Beamforming is a standard method of determining the Direction-of-Arrival (DoA) of wave energy to an array of receivers. In the case of acoustic waves in an air medium, the array would comprise microphones. The angular resolution of an array depends on the frequency of the data, the number of microphones, the size of the array relative to the wavelengths in the medium, and the geometry of the array, i.e., the positions of the microphones in relation to each other. The task of finding the right balance between the aforementioned parameters is microphone-array optimization. This task is rendered even more complicated in the particular context of sound classification and localization for self driving cars as a result of the design limitations imposed by the automotive industry. We present a microphone array optimization method suitable for designing arrays to be placed on vehicles, which applies beamforming using the Radon transform. We show how our method produces an array geometry with reasonable angular resolution for audio frequencies that are in the range of interest for a road scenario.

Microphone Array Optimization for Autonomous-Vehicle Audio Localization Based on the Radon Transform

Abstract

Multi-Task Regularization Based on Infrequent Classes for Audio Captioning

Abstract

Event-Independent Network for Polyphonic Sound Event Localization and Detection

Abstract

SONYC-UST-V2: An Urban Sound Tagging Dataset with Spatiotemporal Context

Abstract

Audio Captioning Based on Transformer and Pre-Trained CNN

Abstract

Domain-Adversarial Training and Trainable Parallel Front-End for the DCASE 2020 Task 4 Sound Event Detection Challenge

Abstract

Task-Aware Separation for the DCASE 2020 Task 4 Sound Event Detection and Separation Challenge

Abstract

A Multi-Resolution Approach to Sound Event Detection in DCASE 2020 Task4

Abstract

Forward-Backward Convolutional Recurrent Neural Networks and Tag-Conditioned Convolutional Neural Networks for Weakly Labeled Semi-Supervised Sound Event Detection

Abstract

Self-Supervised Classification for Detecting Anomalous Sounds

Abstract

Group Masked Autoencoder Based Density Estimator for Audio Anomaly Detection

Abstract

Acoustic Scene Classification in DCASE 2020 Challenge: Generalization Across Devices and Low Complexity Solutions

Abstract

Guided Multi-Branch Learning Systems for Sound Event Detection with Sound Separation

Abstract

Detection of Anomalous Sounds for Machine Condition Monitoring using Classification Confidence

Abstract

ID-Conditioned Auto-Encoder for Unsupervised Anomaly Detection

Abstract

Audio Tag Representation Guided Dual Attention Network for Acoustic Scene Classification

Abstract

Description and Discussion on DCASE2020 Challenge Task2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

Abstract

Low-Complexity Models for Acoustic Scene Classification Based on Receptive Field Regularization and Frequency Damping

Abstract

Model Selection for Deep Audio Source Separation via Clustering Analysis

Abstract

A Speaker Recognition Approach to Anomaly Detection

Abstract

Conformer-Based Sound Event Detection with Semi-Supervised Learning and Data Augmentation

Abstract

Embedded Acoustic Scene Classification for Low Power Microcontroller Devices

Abstract

Temporal Sub-Sampling of Audio Feature Sequences for Automated Audio Captioning

Abstract

On the Effectiveness of Spatial and Multi-Channel Features for Multi-Channel Polyphonic Sound Event Detection

Abstract

Ensemble of Sequence Matching Networks for Dynamic Sound Event Localization, Detection, and Tracking

Abstract

RWCP-SSD-Onomatopoeia: Onomatopoeic Word Dataset for Environmental Sound Synthesis

Abstract

Ensemble of Pruned Low-Complexity Models for Acoustic Scene Classification

Abstract

Lightweight Convolutional Neural Networks on Binaural Waveforms for Low Complexity Acoustic Scene Classification

Abstract

DCASE 2020 Task2: Anomalous Sound Detection using Relevant Spectral Feature and Focusing Techniques in the Unsupervised Learning Scenario

Abstract

Anomalous Sound Detection using Unsupervised and Semi-Supervised Autoencoders and Gammatone Audio Representation

Abstract

Listen Carefully and Tell: An Audio Captioning System Based on Residual Learning and Gammatone Audio Representation

Abstract

Papafil: A Low Complexity Sound Event Localization and Detection Method with Parametric Particle Filtering and Gradient Boosting

Abstract

On Multitask Loss Function for Audio Event Detection and Localization

Abstract

A Dataset of Reverberant Spatial Sound Scenes with Moving Sources for Sound Event Localization and Detection

Abstract

Anomalous Sound Detection as a Simple Binary Classification Problem with Careful Selection of Proxy Outlier Examples

Abstract

Deep Autoencoding GMM-Based Unsupervised Anomaly Detection in Acoustic Signals and its Hyper-Parameter Optimization

Abstract

Sound Event Localization and Detection Based on CRNN using Rectangular Filters and Channel Rotation Data Augmentation

Abstract

Open-Window: A Sound Event Dataset for Window State Detection and Recognition

Abstract

Effects of Word-Frequency Based Pre- and Post- Processings for Audio Captioning

Abstract

Evaluation Metric of Sound Event Detection Considering Severe Misdetection by Scenes

Abstract