|
[1] Wang, DeLiang. "Deep learning reinvents the hearing aid." IEEE spectrum 54.3 (2017): 32-37.
[2] Weninger, Felix, et al. "Speech enhancement with LSTM recurrent neural networks and its application to noise-robust ASR." International conference on latent variable analysis and signal separation. Springer, Cham, 2015.
[3] Shon, Suwon, Hao Tang, and James Glass. "Voiceid loss: Speech enhancement for speaker verification." arXiv preprint arXiv:1904.03601 (2019).
[4] Loizou, Philipos C. Speech enhancement: theory and practice. CRC press, 2007.
[5] Boll, Steven. "Suppression of acoustic noise in speech using spectral subtraction." IEEE Transactions on acoustics, speech, and signal processing 27.2 (1979): 113-120.
[6] Lim, Jae Soo, and Alan V. Oppenheim. "Enhancement and bandwidth compression of noisy speech." Proceedings of the IEEE 67.12 (1979): 1586-1604.
[7] Ephraim, Yariv, and David Malah. "Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator." IEEE Transactions on acoustics, speech, and signal processing 32.6 (1984): 1109-1121.
[8] Ephraim, Yariv, and David Malah. "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator." IEEE transactions on acoustics, speech, and signal processing 33.2 (1985): 443-445.
[9] Xu, Yong, et al. "A regression approach to speech enhancement based on deep neural networks." IEEE/ACM Transactions on Audio, Speech, and Language Processing 23.1 (2014): 7-19.
[10] Zhao, Yan, Zhong-Qiu Wang, and DeLiang Wang. "Two-stage deep learning for noisy-reverberant speech enhancement." IEEE/ACM transactions on audio, speech, and lan-guage processing 27.1 (2018): 53-62.
[11] Peng, Chiang-Jen, et al. "Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario." 2021 IEEE International Symposi-um on Circuits and Systems (ISCAS). IEEE, 2021.
[12] Chuang, Fu-Kai, et al. "Speaker-Aware Deep Denoising Autoencoder with Embedded Speaker Identity for Speech Enhancement." Interspeech. 2019.
[13] Upadhyay, Navneet, and Rahul Kumar Jaiswal. "Single channel speech enhancement: us-ing Wiener filtering with recursive noise estimation." Procedia Computer Science 84 (2016): 22-30.
[14] Bees, Duncan, Maier Blostein, and Peter Kabal. "Reverberant speech enhancement using cepstral processing." Acoustics, Speech, and Signal Processing, IEEE International Conference on. IEEE Computer Society, 1991.
[15] Veisi, Hadi, and Hossein Sameti. "Speech enhancement using hidden Markov models in Mel-frequency domain." Speech Communication 55.2 (2013): 205-220.
[16] Boucheron, Laura E., and Phillip L. De Leon. "On the inversion of mel-frequency cepstral coefficients for speech enhancement applications." 2008 International Conference on Signals and Electronic Systems. IEEE, 2008.
[17] Fukushima, Kunihiko, Sei Miyake, and Takayuki Ito. "Neocognitron: A neural network model for a mechanism of visual pattern recognition." IEEE transactions on systems, man, and cybernetics 5 (1983): 826-834.
[18] LeCun, Yann, et al. "Backpropagation applied to handwritten zip code recogni-tion." Neural computation 1.4 (1989): 541-551.
[19] Hochreiter, Sepp, et al. "Gradient flow in recurrent nets: the difficulty of learning long-term dependencies." (2001).
[20] Lu, Xugang, et al. "Speech enhancement based on deep denoising autoencod-er." Interspeech. Vol. 2013. 2013.
[21] Caruana, Rich. "Multitask learning." Machine learning 28.1 (1997): 41-75.
[22] Hochreiter, Sepp, and Jürgen Schmidhuber. "Long short-term memory." Neural computa-tion 9.8 (1997): 1735-1780.
[23] https://colah.github.io/posts/2015-08-Understanding-LSTMs/
[24] Huang, M. "Development of taiwan mandarin hearing in noise test." Department of speech language pathology and audiology, National Taipei University of Nursing and Health science (2005).
[25] Hu, Guoning, and DeLiang Wang. "A tandem algorithm for pitch estimation and voiced speech segregation." IEEE Transactions on Audio, Speech, and Language Processing 18.8 (2010): 2067-2079.
[26] Hu, Yi, and Philipos C. Loizou. "Evaluation of objective quality measures for speech en-hancement." IEEE Transactions on audio, speech, and language processing 16.1 (2007): 229-238.
[27] Taal, Cees H., et al. "An algorithm for intelligibility prediction of time–frequency weighted noisy speech." IEEE Transactions on Audio, Speech, and Language Processing 19.7 (2011): 2125-2136.
[28] http://www.pal-acoustics.com/index.php?a=services&id=143&lang=cn
|