Portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 2
Published in Journal 1, 2010
This paper is about the number 2. The number 3 is left for future work.
Recommended citation: Your Name, You. (2010). "Paper Title Number 2." Journal 1. 1(2). http://academicpages.github.io/files/paper2.pdf
Published in Journal 1, 2015
This paper is about the number 3. The number 4 is left for future work.
Recommended citation: Your Name, You. (2015). "Paper Title Number 3." Journal 1. 1(3). http://academicpages.github.io/files/paper3.pdf
Published in IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022
In this paper, we first develop a multi-modal Mandarin corpus, which contains air- and bone-conducted synchronized speech (ABCS). Then, we propose a multi-modal conformer ASR system based on a novel multi-modal transducer.
Recommended citation: M. Wang, J. Chen, X. -L. Zhang and S. Rahardja, "End-to-End Multi-Modal Speech Recognition on an Air and Bone Conducted Speech Corpus," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 513-524, 2023, doi: 10.1109/TASLP.2022.3224305. https://ieeexplore.ieee.org/document/9961873
Published in IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
In this work, we propose a speaker-dependent smoothed frame-level SINR estimation method for sensor selection in multi-speaker scenarios, specifically addressing source movement within DASN. Additionally, we devise an approach for similarity measurement to generate dynamic speaker embeddings resilient to variations in reference speech levels. Furthermore, we introduce a novel loss function that integrates classification and ordinal regression within a unified framework.
Recommended citation: S. Guan, M. Wang, Z. Bai, J. Wang, J. Chen and J. Benesty, "Smoothed Frame-Level SINR and Its Estimation for Sensor Selection in Distributed Acoustic Sensor Networks," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 4554-4568, 2024, 10.1109/TASLP.2024.3477277. https://ieeexplore.ieee.org/document/10711254
Published:
This is a description of your talk, which is a markdown files that can be all markdown-ified like any other post. Yay markdown!
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
Undergraduate course, University 1, Department, 2014
This is a description of a teaching experience. You can use markdown like any other post.
Workshop, University 1, Department, 2015
This is a description of a teaching experience. You can use markdown like any other post.