Poster Session I
CLAP4SED: Training-Free Multimodal Few-Shot Retrieval for Real-Time Sound Event Detection on Embedded Devices
|
Improving Language-Based Audio Retrieval Using Llm Augmentations
|
Prediction of Pleasantness and Eventfulness Perceptual Sound Qualities in Urban Soundscapes”
|
Does Paid Crowdsourcing Still Pay off? Sifting Through Annotator Noise in Crowdsourced Audio Labels
|
A Sound Description: Exploring Prompt Templates and Class Descriptions to Enhance Zero-Shot Audio Classification
|
Moflenet: A Low Complexity Model for Acoustic Scene Classification
|
SALT: Standardized Audio Event Label Taxonomy
|
Task 1 Data-Efficient Low-Complexity Acoustic Scene Classification in the DCASE 2024 Challenge
Florian Schmid (Johannes Kepler University), Paul Primus (Johannes Kepler University), Toni Heittola (Tampere University), Annamaria Mesaros (Tampere University), Irene Martin (Tampere University), Khaled Koutini (Johannes Kepler University), Gerhard Widmer (Johannes Kepler University) |
Task 2 Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring
Tomoya Nishida (Hitachi, Ltd.), Noboru Harada (NTT Corporation), Daisuke Niizumi (NTT Corporation), Davide Albertini (ST Microelectronics), Roberto Sannino (STMicroelectronics N.V.), Simone Pradolini (STMicroelectronics N.V.), Filippo Augusti (STMicroelectronics ), Keisuke Imoto (Doshisha University), Kota Dohi (Hitachi Ltd.), Harsh Purohit (Hitachi Ltd.), Takashi Endo (Hitachi, Ltd.), Yohei Kawaguchi (Hitachi, Ltd.) |
Task 3 Baseline Models and Evaluation of Sound Event Localization and Detection with Distance Estimation in DCASE 2024 Challenge
David Diaz-Guerra (University of Zaragoza), Archontis Politis (Tampere University), Parthasaarathy Ariyakulam Sudarsanam (Tampere University), Kazuki Shimada (Sony), Daniel A. Krause (Tampere University), Kengo Uchida (Sony), Yuichiro Koyama (Sony), Naoya Takahashi (Sony Research), Shusuke Takahashi (Sony Group Corporation), Takashi Shibuya (Sony), Yuki Mitsufuji (Sony AI), Tuomas Virtanen (Tampere University) |
Task 4 DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels
Samuele Cornell (Carnegie Mellon University), Janek Ebbers (MERL), Constance Douwes (Inria), Irene Martin (Tampere University), Manu Harju (Tampere University), Annamaria Mesaros (Tampere University), romain serizel (Université de Lorraine) |
Task 5 Few-shot Bioacoustic Event Detection
Burooj Ghani (Naturalis Biodiversity Center), Ines Nolasco (Queen Mary University of London), Jinhua Liang (Queen Mary University of London), Shubhr Singh (Queen Mary University of London), Vincent Lostanlen (Centre National de la Recherche Scientifique(CNRS) Laboratoire des Sciences du Numérique de Nantes (LS2N)), Ariana Strandburg-Peshkin (University of Konstanz Max Planck Institute of Animal Behavior), Emily Grout (University of Konstanz Max Planck Institute of Animal Behavior), Hanna Pamula (AGH University of Science and Technology), Helen Whitehead (University of Salford), Joe Morford (University of Oxford), Michael Emmerson (Queen Mary University of London), Frants Jensen (Syracuse University), Ester Vidana Vila (La Salle, Universitat Ramon Llull), Dan Stowell (Tilburg University) |