Proceedings - DCASE

Magdalena Fuentes, Toni Heittola, Keisuke Imoto, Annamaria Mesaros, Archontis Politis, Romain Serizel, Tuomas Virtanen (eds.), Proceedings of the 8th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2023), September 2023.

ISBN (Electronic): 978-952-03-3171-9

Link PDF

Sound Event Classification with Object-Based Labels

James Afolaranmi, Irene Martín-Morató and Annamaria Mesaros

Computing Science, Tampere University

Sound Event Classification with Object-Based Labels

Abstract

Learning in the Wild: Bioacoustics Few Shot Learning Without Using a Training Set

Abstract

Multi-Resolution Conformer for Sound Event Detection: Analysis and Optimization

Abstract

Foley Sound Synthesis at the DCASE 2023 Challenge

Abstract

STELIN-US: A Spatio-Temporally Linked Neighborhood Urban Sound Database

Abstract

Foley Sound Synthesis Based on Generative Adversarial Networks Using Oneself-Conditioned Contrastive Learning

Abstract

Description and Discussion on DCASE 2023 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

Abstract

Post-Processing Independent Evaluation of Sound Event Detection Systems

Abstract

ToyADMOS2+: New Toyadmos Data and Benchmark Results of the First-Shot Anomalous Sound Event Detection Baseline

Abstract

Evaluating Classification Systems Against Soft Labels with Fuzzy Precision and Recall

Abstract

META-SELD: Meta-Learning for Fast Adaptation to the New Environment in Sound Event Localization and Detection

Abstract

Leveraging Geometrical Acoustic Simulations of Spatial Room Impulse Responses for Improved Sound Event Detection and Localization

Abstract

Speech Obfuscation in Mel Spectra That Allows for Centralised Annotation and Classification of Sound Events

Abstract

FALL-E: A Foley Sound Synthesis Model and Strategies

Abstract

Label Filtering-Based Self-Learning for Sound Event Detection Using Frequency Dynamic Convolution with Large Kernel Attention

Abstract

Improving Automated Audio Captioning Fluency Through Data Augmentation and Ensemble Selection

Abstract

Weakly-Supervised Automated Audio Captioning via Text Only Training

Abstract

Killing Two Birds with One Stone: Can an Audio Captioning System Also Be Used for Audio-Text Retrieval?

Abstract

Few Shot Bioacoustic Detection Boosting with Finetuning Strategy Using Negative-Based Prototypical Learning

Abstract

Masked Modeling Duo Vision Transformer with Multi-Layer Feature Fusion on Respiratory Sound Classification

Abstract

Efficient Evaluation Algorithms for Sound Event Detection

Abstract

Aggregate or Separate: Learning From Multi-Annotator Noisy Labels for Best Classification Performance

Abstract

Active Learning in Sound-Based Bearing Fault Detection

Abstract

Auditory Neural Response Inspired Sound Event Detection Based on Spectro-Temporal Receptive Field

Abstract

Creating a Good Teacher for Knowledge Distillation in Acoustic Scene Classification

Abstract

Pretraining Representations for Bioacoustic Few-Shot Detection Using Supervised Contrastive Learning

Abstract

Incremental Learning of Acoustic Scenes and Sound Events

Abstract

Frequency & Channel Attention for Computationally Efficient Sound Event Detection

Abstract

Unsupervised Domain Adaptation for the Cross-Dataset Detection of Humpback Whale Calls

Abstract

Few-Shot Bioacoustic Event Detection at the DCASE 2023 Challenge

Abstract

Advancing Natural-Language Based Audio Retrieval with Passt and Large Audio-Caption Data Sets

Abstract

Foley Sound Synthesis with a Class-Conditioned Latent Diffusion Model

Abstract

Distilling the Knowledge of Transformers and CNNs with CP-Mobile

Abstract

Device Generalization with Inverse Contrastive Loss and Impulse Response Augmentation

Abstract

Multi-Label Open-Set Audio Classification

Abstract

Spectral Transcoder : Using Pretrained Urban Sound Classifiers on Undersampled Spectral Representations

Abstract

Audio Difference Captioning Utilizing Similarity-Discrepancy Disentanglement

Abstract

Cross-Dimensional Interaction with Inverted Residual Triplet Attention for Low-Complexity Sound Event Detection

Abstract

Exploring Multi-Task Learning with Weighted Soft Label Loss for Sound Event Detection with Soft Labels

Abstract

Event Classification with Class-Level Gated Unit Using Large-Scale Pretrained Model for Optical Fiber Sensing

Abstract