| 0:10 | 
* Challenge spotlights | 
| 
 Reports on the 6 tasks of the DCASE 2020 challenge (Session Co-Chairs: Annamaria Mesaros and Romain Serizel) 
 | 
 
 
 | 
| 1:10 | 
Oral session II (1st play) | 
| 
 Sound Event Detection and Localization II (Session Co-Chairs: Sharath Adavanne and Yasunori Ohishi) 
| 
                              L05
                               | 
 
                              On multitask loss function for audio event detection and localization
                               
Huy Phan, Lam Pham, Philipp Koch, Ngoc Q. K. Duong, Ian McLoughlin, Alfred Mertins
 
 | 
 
| 
                              L06
                               | 
 
                               Event-independent network for polyphonic sound event localization and detection
                               
Yin Cao, Turab Iqbal, Qiuqiang Kong, Yue Zhong, Wenwu Wang, Mark D. Plumbley
 
 | 
 
| 
                              L07
                               | 
 
                              A multi-resolution approach to sound event detection in DCASE 2020 Task4
                               
Diego de Benito-Gorron, Daniel Ramos, Doroteo T. Toledano
 
 | 
 
| 
                              L08
                               | 
 
                              Training sound event detection on a heterogeneous dataset
                               
Nicolas Turpault, Romain Serizel
 
 | 
 
 
 | 
 
  | 
| 3:00 | 
Oral session I (2nd play) | 
| 
 Sound Event Detection and Localization I (Session Co-Chairs: Yin Cao and Keisuke Imoto) 
| 
                              L01
                               | 
 
                              Conformer-based sound event detection with semi-supervised learning and data augmentation
                               
Koichi Miyazaki, Tatsuya Komatsu, Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Kazuya Takeda
 
 | 
 
| 
                              L02
                               | 
 
                              On the effectiveness of spatial and multi-channel features for multi-channel polyphonic sound event detection
                               
Thi Ngoc Tho Nguyen, Douglas L. Jones, Woon Seng Gan
 
 | 
 
| 
                              L03
                               | 
 
                              Guided multi-branch learning systems for sound event detection with sound separation
                               
Yuxin Huang, Liwei Lin, Shuo Ma, Xiangdong Wang, Hong Liu, Yueliang Qian, Min Liu, Kazushige Ouchi
 
 | 
 
| 
                              L04
                               | 
 
                              Ensemble of sequence matching networks for dynamic sound event localization, detection, and tracking
                               
Thi Ngoc Tho Nguyen, Douglas L. Jones, Woon Seng Gan
 
 | 
 
 
 |  
 
 | 
| 4:30 | 
Poster highlights | 
| 
 The short highlight videos will be cast on virtual platform by the organizers. The 15-minute presentation videos and Q&A fora can be accessed at any time during the workshop. 
List of posters 
 | 
 
 
 | 
| 6:00 | 
Keynote I  (2nd play) | 
| 
 Mounya Elhilali Johns Hopkins University, Department of Electrical and Computer Engineering 
Active listening in everyday soundscapes 
Abstract & bio 
 | 
 
 
 | 
| 7:00 | 
* Oral session II (2nd play) | 
| 
 Sound Event Detection and Localization II (Session Co-Chairs: Yasunori Ohishi and Tuomas Virtanen) 
| 
                              L05
                               | 
 
                              On multitask loss function for audio event detection and localization
                               
Huy Phan, Lam Pham, Philipp Koch, Ngoc Q. K. Duong, Ian McLoughlin, Alfred Mertins
 
 | 
 
| 
                              L06
                               | 
 
                               Event-independent network for polyphonic sound event localization and detection
                               
Yin Cao, Turab Iqbal, Qiuqiang Kong, Yue Zhong, Wenwu Wang, Mark D. Plumbley
 
 | 
 
| 
                              L07
                               | 
 
                              A multi-resolution approach to sound event detection in DCASE 2020 Task4
                               
Diego de Benito-Gorron, Daniel Ramos, Doroteo T. Toledano
 
 | 
 
| 
                              L08
                               | 
 
                              Training sound event detection on a heterogeneous dataset
                               
Nicolas Turpault, Romain Serizel
 
 | 
 
 
 | 
 
  | 
 | 
End of day1 | 
| 
 The 15-minute presentation videos and Q&A fora can be accessed at any time during the workshop. 
 | 
 
 
 | 
 | 
 | 
Day2 | 
| 16:10 | 
* Welcome | 
| 
 Yohei Kawaguchi Hitachi, Ltd. 
 | 
 
 
 | 
| 16:20 | 
* Keynote II  (1st play) | 
| 
 Shin'ichi Satoh National Institute of Informatics 
How benchmarks work for visual recognition research? --   Historical review and future prospects 
Abstract & bio 
 | 
 
 
 | 
| 17:20 | 
* Oral session III (1st play) | 
| 
 Scene Classification and Anomalous Sound Detection I (Session Co-Chairs: Yuma Koizumi and Gordon Wichern) 
| 
                              L09
                               | 
 
                              Group masked autoencoder based density estimator for audio anomaly detection
                               
Ritwik Giri, Fangzhou Cheng, Karim Helwani, Srikanth V. Tenneti, Umut Isik, Arvindh Krishnaswamy
 
 | 
 
| 
                              L10
                               | 
 
                              Detection of anomalous sounds for machine condition monitoring using classification confidence
                               
Tadanobu Inoue, Phongtharin Vinayavekhin, Shu Morikuni, Shiqiang Wang, Tuan Hoang Trong, David Wood, Michiaki Tatsubori, Ryuki Tachibana
 
 | 
 
| 
                              L11
                               | 
 
                              Acoustic scene classification with spectrogram processing strategies
                               
Helin Wang, Yuexian Zou, Dading Chong
 
 | 
 
| 
                              L12
                               | 
 
                              Searching for efficient network architectures for acoustic scene classification
                               
Yuzhong Wu, Tan Lee
 
 | 
 
 
 | 
 
 
 | 
| 18:50 | 
Poster highlights | 
| 
 The short highlight videos will be cast on virtual platform by the organizers. The 15-minute presentation videos and Q&A fora can be accessed at any time during the workshop. 
List of posters 
 | 
 
 
 | 
| 21:00 | 
* Oral session IV (1st play) | 
| 
 Audio Captioning (Session Co-Chairs: Il-Young Jeong and Tatsuya Komatsu) 
| 
                              L16
                               | 
 
                              Effects of word-frequency based pre- and post- processings for audio captioning
                               
Daiki Takeuchi, Yuma Koizumi, Yasunori Ohishi, Noboru Harada, Kunio Kashino
 
 | 
 
| 
                              L17
                               | 
 
                              Multi-task regularization based on infrequent classes for audio captioning
                               
Emre Çakir, Konstantinos Drossos, Tuomas Virtanen
 
 | 
 
| 
                              L18
                               | 
 
                              Temporal sub-sampling of audio feature sequences for automated audio captioning
                               
Khoa Nguyen, Konstantinos Drossos, Tuomas Virtanen
 
 | 
 
 
 | 
 
 
 | 
| 22:10 | 
* Oral session V (1st play) | 
| 
 Scene Classification and Anomalous Sound Detection II (Session Co-Chairs: Sakiko Mishima and Nobutaka Ono) 
| 
                              L13
                               | 
 
                              Low-complexity models for acoustic scene classification based on receptive field regularization and frequency damping
                               
Khaled Koutini, Florian Henkel, Hamid Eghbal-zadeh, Gerhard Widmer
 
 | 
 
| 
                              L14
                               | 
 
                              ID-Conditioned auto-encoder for unsupervised anomaly detection
                               
Sławomir Kapka
 
 | 
 
| 
                              L15
                               | 
 
                              Anomalous sound detection as a simple binary classification problem with careful selection of proxy outlier examples
                               
Paul Primus, Verena Haunschmid, Patrick Praher, Gerhard Widmer
 
 | 
 
 
 | 
 
 
 | 
| 23:20 | 
* Virtual tour II | 
| 
 Tokyo: A city of old and new
  
 | 
 
 
 |