论文标题
关于信号分解的有效性,特征提取和选择肺部声音分类
On the effectiveness of signal decomposition, feature extraction and selection on lung sound classification
论文作者
论文摘要
肺部声音是指通过呼吸系统流动的空气产生的声音。这些声音,因为大多数生物医学信号是非线性和非平稳性的。使用肺部声音检测的重要部分是歧视正常的肺声和异常肺声。在本文中,探索了几种分类的方法,用于探索无裂缝和crack肺声音。分解方法,例如经验模式分解,集合经验模式分解和离散小波转换以及多种功能提取技术(例如主成分分析和自动编码器),以探索各种分类器在给定任务中的性能。从Kaggle下载的开源数据集,其中包含质量不同的胸部听诊来确定使用不同分解和特征提取组合的结果。据发现,当高阶统计和光谱特征以及MEL频率Cepstral系数被馈送到Classier时,我们可以通过KNN分类器获得最佳性能。此外,还证明,使用特征选择方法的组合可以显着减少输入功能的数量,而不会不利地影响分类器的准确性。
Lung sounds refer to the sound generated by air moving through the respiratory system. These sounds, as most biomedical signals, are non-linear and non-stationary. A vital part of using the lung sound for disease detection is discrimination between normal lung sound and abnormal lung sound. In this paper, several approaches for classifying between no-crackle and crackle lung sounds are explored. Decomposition methods such as Empirical Mode Decomposition, Ensemble Empirical Mode Decomposition, and Discrete Wavelet Transform are used along with several feature extraction techniques like Principal Component Analysis and Autoencoder, to explore how various classifiers perform for the given task. An open-source dataset downloaded from Kaggle, containing chest auscultation of varying quality is used to determine the results of using the different decomposition and feature extraction combinations. It is found that when higher-order statistical and spectral features along with the Mel-frequency cepstral coefficients are fed to the classier we get the best performance with the kNN classifier giving the best accuracy. Furthermore, it is also demonstrated that using a combination of feature selection methods one can significantly reduce the number of input features without adversely affecting the accuracy of the classifiers.