Poster Session III

Poster Session III

 

ToyADMOS2#: Yet Another Dataset for the DCASE2024 Challenge Task 2 First-Shot Anomalous Sound Detection
Daisuke Niizumi (NTT Corporation), Noboru Harada (NTT), Yasunori Ohishi (NTT Corporation), Daiki Takeuchi (NTT Corporation), Masahiro Yasuda (NTT Corporation)

WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System
Yang Xiao (Fortemedia Singapore), Rohan Kumar Das (Fortemedia)

IMAD-DS: A Dataset for Industrial Multi-Sensor Anomaly Detection Under Domain Shift Conditions
Davide Albertini (ST Microelectronics), Filippo Augusti (STMicroelectronics ), Kudret Esmer (Politecnico di Milano), Alberto Bernardini (Politecnico di Milano), Roberto Sannino (ST Microelectronics)

Heterogeneous Sound Classification with the Broad Sound Taxonomy and Dataset
Panagiota Anastasopoulou (Music Technology Group, Universitat Pompeu Fabra), Jessica Torrey (Music Technology Group, Universitat Pompeu Fabra), Frederic Font (Music Technology Group - Universitat Pompeu Fabra)

Audio Captioning in Finnish and English with Task-Dependent Output
Irene Martin (Tampere University), Manu Harju (Tampere University), Annamaria Mesaros (Tampere University)

EnCLAP++: Analyzing the EnCLAP Framework for Optimizing Automated Audio Captioning Performance
Jaeyeon Kim (MindsLab Inc.), Jaeyoon Jung (Soongsil University), Minjeong Jeon (maum.ai), Sang Hoon Woo (N/A), JINJOO LEE (MAUM.AI)

Pre-Trained Models, Datasets, Data Augmentation, and Inference Time Augmentation for Language-Based Audio Retrieval
Hokuto Munakata (LY Corporation), Taichi Nishimura (LY Corporation), Shota Nakada (LY Corporation), Tatsuya Komatsu (LY Corporation)

Representational Learning for an Anomalous Sound Detection System
Seunghyeon Shin (Kyungpook National University), Seokjin Lee (Kyungpook National University)

Towards Learning a Difference-aware General-purpose Audio Representation
Daiki Takeuchi (NTT Corporation), Masahiro Yasuda (NTT Corporation), Daisuke Niizumi (NTT Corporation), Noboru Harada (NTT)

Improving Query-by-Vocal Imitation with Contrastive Learning and Audio Pretraining
Jonathan Greif (Student at JKU), Florian Schmid (Johannes Kepler University), Paul Primus (Johannes Kepler University), Gerhard Widmer (Johannes Kepler University)

Machine Listening in a Neonatal Intensive Care Unit
Modan Tailleur (LS2N, École Centrale Nantes), Vincent Lostanlen (LS2N, CNRS), Jean-Philippe Rivière (LS2N, Nantes Université), Pierre Aumond (UMRAE, Université Gustave Eiffel)