Abstract:
Simultaneous vocalization of multiple birds leads to overlapping bird sound.In this paper a bird sound separation method, with integrated spatial features, is proposed.In this method, both spectral and spatial features of overlapped sound signals are used as input, U-Conformer is used as a separation model to predict spectral magnitude mask (SMM).The sound source signal is recovered from mixed sound signal by estimated SMM.The generated multi-channel bird sound data confirm that this method has better performance in bird sound separation compared with existing methods.