论文标题
一维多项式神经网络,用于音频信号相关问题
1-Dimensional polynomial neural networks for audio signal related problems
论文作者
论文摘要
除了极其非线性外,现代问题还需要数百万(如果不是数十亿个参数)来解决或至少要获得溶液的近似值,并且已知神经网络可以通过加深和扩大拓扑来吸收这种复杂性,以提高获得更好近似值所需的非线性性水平。但是,紧凑的拓扑始终比更深的拓扑提供了使用较少的计算单元和更少参数的优势。这种综合性是基于降低的非线性性,因此是有限的解决方案搜索空间的代价。我们提出了一维多项式神经网络(1DPNN)模型,该模型使用自动多项式内核来完成1维卷积神经网络(1DCNN),并引入了第一层的高度非线性,可以补偿对深层和/或/或宽拓扑的需求。我们表明,这种非线性使该模型能够在与音频信号相关的各种分类和回归问题上的常规1DCNN产生更好的计算和空间复杂性结果,即使它在神经元水平上引入了更多的计算和空间复杂性。实验是在三个可公开可用的数据集上进行的,并证明,在解决的问题上,所提出的模型可以从数据中提取更多相关信息,而不是在更少的时间内,记忆力较少。
In addition to being extremely non-linear, modern problems require millions if not billions of parameters to solve or at least to get a good approximation of the solution, and neural networks are known to assimilate that complexity by deepening and widening their topology in order to increase the level of non-linearity needed for a better approximation. However, compact topologies are always preferred to deeper ones as they offer the advantage of using less computational units and less parameters. This compacity comes at the price of reduced non-linearity and thus, of limited solution search space. We propose the 1-Dimensional Polynomial Neural Network (1DPNN) model that uses automatic polynomial kernel estimation for 1-Dimensional Convolutional Neural Networks (1DCNNs) and that introduces a high degree of non-linearity from the first layer which can compensate the need for deep and/or wide topologies. We show that this non-linearity enables the model to yield better results with less computational and spatial complexity than a regular 1DCNN on various classification and regression problems related to audio signals, even though it introduces more computational and spatial complexity on a neuronal level. The experiments were conducted on three publicly available datasets and demonstrate that, on the problems that were tackled, the proposed model can extract more relevant information from the data than a 1DCNN in less time and with less memory.