Quote
M. Liu, Z. Lu, X. Wang, J. P. J. da Cost, and T. Fei, "Sound source localization via distance metric learning with regularization," Signal Processing, p. 109721, 2024 [Online]. Available: https://www.sciencedirect.com/science/article/pii/S0165168424003414
Content
Sound source localization (SSL) or simply direction of arrival (DOA) classification is an important ingredient in acoustic applications. Traditional model-based algorithms are susceptible to the effects of noise and reverberation, while data-driven deep learning algorithms maintain strong performance across a variety of acoustic circumstances, but typically require a large amount of labeled data. Nevertheless, the existing datasets for SSL are not sufficiently big and diverse to achieve the full potential of deep learning algorithms. Then, it is an imperative work to develop a non-data-hungry algorithm of SSL using small or medium data volume. To this end, we propose a regularized distance metric learning algorithm, that is, by means of the kernel method, we design a nonlinear feature transformation from two aspects: feature points and feature distributions. It transforms the data into a new feature space that brings features of the same class as close as possible and removes features of different classes as far away as possible, which can significantly improve the output of a DOA classifier that follows. Experimental results show that the proposed algorithm outperforms deep learning algorithms in diverse acoustic conditions.