|
|
Day2 |
9:10 |
* Welcome |
Yohei Kawaguchi Hitachi, Ltd.
|
|
9:20 |
* Keynote II (1st play) |
Shin'ichi Satoh National Institute of Informatics
How benchmarks work for visual recognition research? -- Historical review and future prospects
Abstract & bio
|
|
10:20 |
* Oral session III (1st play) |
Scene Classification and Anomalous Sound Detection I (Session Co-Chairs: Yuma Koizumi and Gordon Wichern)
L09
|
Group masked autoencoder based density estimator for audio anomaly detection
Ritwik Giri, Fangzhou Cheng, Karim Helwani, Srikanth V. Tenneti, Umut Isik, Arvindh Krishnaswamy
|
L10
|
Detection of anomalous sounds for machine condition monitoring using classification confidence
Tadanobu Inoue, Phongtharin Vinayavekhin, Shu Morikuni, Shiqiang Wang, Tuan Hoang Trong, David Wood, Michiaki Tatsubori, Ryuki Tachibana
|
L11
|
Acoustic scene classification with spectrogram processing strategies
Helin Wang, Yuexian Zou, Dading Chong
|
L12
|
Searching for efficient network architectures for acoustic scene classification
Yuzhong Wu, Tan Lee
|
|
|
11:50 |
Poster highlights |
The short highlight videos will be cast on virtual platform by the organizers. The 15-minute presentation videos and Q&A fora can be accessed at any time during the workshop.
List of posters
|
|
14:00 |
* Oral session IV (1st play) |
Audio Captioning (Session Co-Chairs: Il-Young Jeong and Tatsuya Komatsu)
L16
|
Effects of word-frequency based pre- and post- processings for audio captioning
Daiki Takeuchi, Yuma Koizumi, Yasunori Ohishi, Noboru Harada, Kunio Kashino
|
L17
|
Multi-task regularization based on infrequent classes for audio captioning
Emre Çakir, Konstantinos Drossos, Tuomas Virtanen
|
L18
|
Temporal sub-sampling of audio feature sequences for automated audio captioning
Khoa Nguyen, Konstantinos Drossos, Tuomas Virtanen
|
|
|
15:10 |
* Oral session V (1st play) |
Scene Classification and Anomalous Sound Detection II (Session Co-Chairs: Sakiko Mishima and Nobutaka Ono)
L13
|
Low-complexity models for acoustic scene classification based on receptive field regularization and frequency damping
Khaled Koutini, Florian Henkel, Hamid Eghbal-zadeh, Gerhard Widmer
|
L14
|
ID-Conditioned auto-encoder for unsupervised anomaly detection
Sławomir Kapka
|
L15
|
Anomalous sound detection as a simple binary classification problem with careful selection of proxy outlier examples
Paul Primus, Verena Haunschmid, Patrick Praher, Gerhard Widmer
|
|
|
16:20 |
* Virtual tour II |
Tokyo: A city of old and new
|
|
17:30 |
* Sponsor Event |
Short presentation by platinum and gold sponsors (Hitachi, LINE, and NEC)
|
|
18:00 |
Keynote II (2nd play) |
Shin'ichi Satoh National Institute of Informatics
How benchmarks work for visual recognition research? -- Historical review and future prospects
Abstract & bio
|
|
19:00 |
Poster highlights |
The short highlight videos will be cast on virtual platform by the organizers. The 15-minute presentation videos and Q&A fora can be accessed at any time during the workshop.
List of posters
|
|
21:20 |
Oral session IV (2nd play) |
Audio Captioning (Session Co-Chairs: Tatsuya Komatsu and Robin Scheibler)
L16
|
Effects of word-frequency based pre- and post- processings for audio captioning
Daiki Takeuchi, Yuma Koizumi, Yasunori Ohishi, Noboru Harada, Kunio Kashino
|
L17
|
Multi-task regularization based on infrequent classes for audio captioning
Emre Çakir, Konstantinos Drossos, Tuomas Virtanen
|
L18
|
Temporal sub-sampling of audio feature sequences for automated audio captioning
Khoa Nguyen, Konstantinos Drossos, Tuomas Virtanen
|
|
|
22:30 |
Oral session III (2nd play) |
Scene Classification and Anomalous Sound Detection I (Session Co-Chairs: Yuma Koizumi and Gordon Wichern)
L09
|
Detection of anomalous sounds for machine condition monitoring using classification confidence
Tadanobu Inoue, Phongtharin Vinayavekhin, Shu Morikuni, Shiqiang Wang, Tuan Hoang Trong, David Wood, Michiaki Tatsubori, Ryuki Tachibana
|
L10
|
Acoustic scene classification with spectrogram processing strategies
Helin Wang, Yuexian Zou, Dading Chong
|
L11
|
Searching for efficient network architectures for acoustic scene classification
Yuzhong Wu, Tan Lee
|
L12
|
Group masked autoencoder based density estimator for audio anomaly detection
Ritwik Giri, Fangzhou Cheng, Karim Helwani, Srikanth V. Tenneti, Umut Isik, Arvindh Krishnaswamy
|
|
|
0:00 |
Oral session V (2nd play) |
Scene Classification and Anomalous Sound Detection II (Session Co-Chairs: Nobutaka Ono and Mark Plumbley)
L13
|
Low-complexity models for acoustic scene classification based on receptive field regularization and frequency damping
Khaled Koutini, Florian Henkel, Hamid Eghbal-zadeh, Gerhard Widmer
|
L14
|
ID-Conditioned auto-encoder for unsupervised anomaly detection
Sławomir Kapka
|
L15
|
Anomalous sound detection as a simple binary classification problem with careful selection of proxy outlier examples
Paul Primus, Verena Haunschmid, Patrick Praher, Gerhard Widmer
|
|
|
|
End of day2 |
The 15-minute presentation videos and Q&A fora can be accessed at any time during the workshop.
|
|