| 0:00 | 
Poster highlights | 
| 
 The short highlight videos will be cast on virtual platform by the organizers. The 15-minute presentation videos and Q&A fora can be accessed at any time during the workshop. 
List of posters 
 | 
 
 
 | 
| 2:00 | 
* Virtual tour I | 
| 
 Japan: Is Japan an ideal destination for your holidays?
  
 | 
 
 
 | 
| 3:10 | 
* Challenge spotlights | 
| 
 Reports on the 6 tasks of the DCASE 2020 challenge (Session Co-Chairs: Annamaria Mesaros and Romain Serizel) 
 | 
 
 
 | 
| 4:10 | 
Oral session II (1st play) | 
| 
 Sound Event Detection and Localization II (Session Co-Chairs: Sharath Adavanne and Yasunori Ohishi) 
| 
                              L05
                               | 
 
                              On multitask loss function for audio event detection and localization
                               
Huy Phan, Lam Pham, Philipp Koch, Ngoc Q. K. Duong, Ian McLoughlin, Alfred Mertins
 
 | 
 
| 
                              L06
                               | 
 
                               Event-independent network for polyphonic sound event localization and detection
                               
Yin Cao, Turab Iqbal, Qiuqiang Kong, Yue Zhong, Wenwu Wang, Mark D. Plumbley
 
 | 
 
| 
                              L07
                               | 
 
                              A multi-resolution approach to sound event detection in DCASE 2020 Task4
                               
Diego de Benito-Gorron, Daniel Ramos, Doroteo T. Toledano
 
 | 
 
| 
                              L08
                               | 
 
                              Training sound event detection on a heterogeneous dataset
                               
Nicolas Turpault, Romain Serizel
 
 | 
 
 
 | 
 
  | 
| 6:00 | 
Oral session I (2nd play) | 
| 
 Sound Event Detection and Localization I (Session Co-Chairs: Yin Cao and Keisuke Imoto) 
| 
                              L01
                               | 
 
                              Conformer-based sound event detection with semi-supervised learning and data augmentation
                               
Koichi Miyazaki, Tatsuya Komatsu, Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Kazuya Takeda
 
 | 
 
| 
                              L02
                               | 
 
                              On the effectiveness of spatial and multi-channel features for multi-channel polyphonic sound event detection
                               
Thi Ngoc Tho Nguyen, Douglas L. Jones, Woon Seng Gan
 
 | 
 
| 
                              L03
                               | 
 
                              Guided multi-branch learning systems for sound event detection with sound separation
                               
Yuxin Huang, Liwei Lin, Shuo Ma, Xiangdong Wang, Hong Liu, Yueliang Qian, Min Liu, Kazushige Ouchi
 
 | 
 
| 
                              L04
                               | 
 
                              Ensemble of sequence matching networks for dynamic sound event localization, detection, and tracking
                               
Thi Ngoc Tho Nguyen, Douglas L. Jones, Woon Seng Gan
 
 | 
 
 
 |  
 
 | 
| 7:30 | 
Poster highlights | 
| 
 The short highlight videos will be cast on virtual platform by the organizers. The 15-minute presentation videos and Q&A fora can be accessed at any time during the workshop. 
List of posters 
 | 
 
 
 | 
| 9:00 | 
Keynote I  (2nd play) | 
| 
 Mounya Elhilali Johns Hopkins University, Department of Electrical and Computer Engineering 
Active listening in everyday soundscapes 
Abstract & bio 
 | 
 
 
 | 
| 10:00 | 
* Oral session II (2nd play) | 
| 
 Sound Event Detection and Localization II (Session Co-Chairs: Yasunori Ohishi and Tuomas Virtanen) 
| 
                              L05
                               | 
 
                              On multitask loss function for audio event detection and localization
                               
Huy Phan, Lam Pham, Philipp Koch, Ngoc Q. K. Duong, Ian McLoughlin, Alfred Mertins
 
 | 
 
| 
                              L06
                               | 
 
                               Event-independent network for polyphonic sound event localization and detection
                               
Yin Cao, Turab Iqbal, Qiuqiang Kong, Yue Zhong, Wenwu Wang, Mark D. Plumbley
 
 | 
 
| 
                              L07
                               | 
 
                              A multi-resolution approach to sound event detection in DCASE 2020 Task4
                               
Diego de Benito-Gorron, Daniel Ramos, Doroteo T. Toledano
 
 | 
 
| 
                              L08
                               | 
 
                              Training sound event detection on a heterogeneous dataset
                               
Nicolas Turpault, Romain Serizel
 
 | 
 
 
 | 
 
  | 
 | 
End of day1 | 
| 
 The 15-minute presentation videos and Q&A fora can be accessed at any time during the workshop. 
 | 
 
 
 | 
 | 
 | 
Day2 | 
| 19:10 | 
* Welcome | 
| 
 Yohei Kawaguchi Hitachi, Ltd. 
 | 
 
 
 | 
| 19:20 | 
* Keynote II  (1st play) | 
| 
 Shin'ichi Satoh National Institute of Informatics 
How benchmarks work for visual recognition research? --   Historical review and future prospects 
Abstract & bio 
 | 
 
 
 | 
| 20:20 | 
* Oral session III (1st play) | 
| 
 Scene Classification and Anomalous Sound Detection I (Session Co-Chairs: Yuma Koizumi and Gordon Wichern) 
| 
                              L09
                               | 
 
                              Group masked autoencoder based density estimator for audio anomaly detection
                               
Ritwik Giri, Fangzhou Cheng, Karim Helwani, Srikanth V. Tenneti, Umut Isik, Arvindh Krishnaswamy
 
 | 
 
| 
                              L10
                               | 
 
                              Detection of anomalous sounds for machine condition monitoring using classification confidence
                               
Tadanobu Inoue, Phongtharin Vinayavekhin, Shu Morikuni, Shiqiang Wang, Tuan Hoang Trong, David Wood, Michiaki Tatsubori, Ryuki Tachibana
 
 | 
 
| 
                              L11
                               | 
 
                              Acoustic scene classification with spectrogram processing strategies
                               
Helin Wang, Yuexian Zou, Dading Chong
 
 | 
 
| 
                              L12
                               | 
 
                              Searching for efficient network architectures for acoustic scene classification
                               
Yuzhong Wu, Tan Lee
 
 | 
 
 
 | 
 
 
 | 
| 21:50 | 
Poster highlights | 
| 
 The short highlight videos will be cast on virtual platform by the organizers. The 15-minute presentation videos and Q&A fora can be accessed at any time during the workshop. 
List of posters 
 | 
 
 
 |