December 1, 2017
Learning Neural Audio Embeddings for Grounding Semantics in Auditory Perception
Journal of Artificial Intelligence Research, Vol. 60
In this paper we examine grounding semantic representations in raw auditory data, using standard evaluations for multi-modal semantics. After having shown the quality of such auditorily grounded representations, we show how they can be applied to tasks where auditory perception is relevant, including two unsupervised categorization experiments, and provide further analysis.
By: Douwe Kiela, Stephen Clark