References

BPR+05

Felix Burkhardt, Astrid Paeschke, Miriam Rolfes, Walter F Sendlmeier, and Benjamin Weiss. A database of german emotional speech. In Ninth European Conference on Speech Communication and Technology. 2005.

NCZ17

Arsha Nagrani, Joon Son Chung, and Andrew Zisserman. Voxceleb: a large-scale speaker identification dataset. In Proc. Interspeech 2017, 2616–2620. 2017. URL: https://arxiv.org/abs/1706.08612, doi:10.21437/Interspeech.2017-950.

RPS18

Dario Rethage, Jordi Pons, and Xavier Serra. A wavenet for speech denoising. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 5069–5073. 2018. URL: https://arxiv.org/abs/1706.07162, doi:10.1109/ICASSP.2018.8462417.