模型压缩对面部表达识别中公平性的影响

论文标题

模型压缩对面部表达识别中公平性的影响

The Effect of Model Compression on Fairness in Facial Expression Recognition

论文作者

Stoychev, Samuil, Gunes, Hatice

论文摘要

事实证明，深层神经网络已取得了巨大的成功，在各种任务上取得了类似人类的表现。但是，它们在计算上也很昂贵，这激发了模型压缩技术的开发，这些技术减少了与深度学习模型相关的资源消耗。然而，最近的研究表明，模型压缩可能会对算法公平产生不利影响，从而扩大机器学习模型中的现有偏见。在这个项目中，我们旨在将这些研究扩展到面部表达识别的背景。为此，我们设置了一个神经网络分类器，以执行面部表达识别并在其上实现多种模型压缩技术。然后，我们在两个面部表达数据集上进行实验，即扩展的Cohn-Kanade数据集（CK+DB）和现实世界情感面部数据库（RAF-DB），以检查压缩技术对模型的大小，准确性和公平性具有的单个和组合效果。我们的实验结果表明：（i）压缩和定量的模型大小显着降低，对CK+DB和RAF-DB的总体准确性的影响最小；（ii）就模型精度而言，与CK+ DB相比，在RAF-DB上训练和测试的分类器似乎更强大；（iii）对于RAF-DB，不同的压缩策略似乎并没有增加性别，种族和年龄敏感属性的预测性能差距，这与CK+DB的结果形成鲜明对比，在CK+DB上，压缩似乎会扩大对性别的现有偏见。我们分析结果并讨论我们发现的潜在原因。

Deep neural networks have proved hugely successful, achieving human-like performance on a variety of tasks. However, they are also computationally expensive, which has motivated the development of model compression techniques which reduce the resource consumption associated with deep learning models. Nevertheless, recent studies have suggested that model compression can have an adverse effect on algorithmic fairness, amplifying existing biases in machine learning models. With this project we aim to extend those studies to the context of facial expression recognition. To do that, we set up a neural network classifier to perform facial expression recognition and implement several model compression techniques on top of it. We then run experiments on two facial expression datasets, namely the Extended Cohn-Kanade Dataset (CK+DB) and the Real-World Affective Faces Database (RAF-DB), to examine the individual and combined effect that compression techniques have on the model size, accuracy and fairness. Our experimental results show that: (i) Compression and quantisation achieve significant reduction in model size with minimal impact on overall accuracy for both CK+DB and RAF-DB; (ii) in terms of model accuracy, the classifier trained and tested on RAF-DB seems more robust to compression compared to the CK+ DB; (iii) for RAF-DB, the different compression strategies do not seem to increase the gap in predictive performance across the sensitive attributes of gender, race and age which is in contrast with the results on the CK+DB, where compression seems to amplify existing biases for gender. We analyse the results and discuss the potential reasons for our findings.

下载PDF全文

下载文献需遵守相关版权规定

论文标题