Monoaural Audio Source Separation Using Deep Convolutional Neural Networks

详细信息查看全文

关键词：Convolutional autoencoder ; Music source separation ; Deep learning ; Convolutional Neural Networks ; Low ; latency
刊名：Lecture Notes in Computer Science
出版年：2017
出版时间：2017
年：2017
卷：10169
期：1
页码：258-266
丛书名：Latent Variable Analysis and Signal Separation
ISBN：978-3-319-53547-0
卷排序：10169

文摘

In this paper we introduce a low-latency monaural source separation framework using a Convolutional Neural Network (CNN). We use a CNN to estimate time-frequency soft masks which are applied for source separation. We evaluate the performance of the neural network on a database comprising of musical mixtures of three instruments: voice, drums, bass as well as other instruments which vary from song to song. The proposed architecture is compared to a Multilayer Perceptron (MLP), achieving on-par results and a significant improvement in processing time. The algorithm was submitted to source separation evaluation campaigns to test efficiency, and achieved competitive results.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700