论文标题
通过意识到公开演讲的口头和手势质量,评级预测的公平性
Fairness in Rating Prediction by Awareness of Verbal and Gesture Quality of Public Speeches
论文作者
论文摘要
数十年来,言语和非语言提示在伟大的公开演讲中的作用一直是探索的话题。我们确定了跨渠道或交流方式中“多样性或异质性”的元素(例如,诉诸故事,科学事实,情感联系,面部表情等)的共同点,这对于有效传达信息至关重要。我们使用此观察结果将新型的异质性度量标准正式化,该指标量化了口头和非语言领域(成绩单和面部手势)中谈话的质量。我们将TED演讲用作公开演讲的输入存储库,因为它由来自多元化社区的演讲者组成,除了进行广泛的宣传。我们表明,下摆与观众对演讲者的TED演讲的评分之间存在有趣的关系。它强调下摆固有而成功地代表了基于“品种或异质性”的演讲质量。此外,我们还发现,下摆成功地捕获了种族和性别评级中普遍的偏见,我们称之为敏感属性(因为基于这些的预测可能会导致不公平的结果)。我们将HEM指标纳入神经网络的损失函数,目的是减少对种族和性别的评级预测的不公平性。我们的结果表明,修改后的损失函数可以提高预测的公平性,而不会极大地影响神经网络的预测准确性。我们的作品将一个新颖的指标与言语和非语言领域的公开演讲联系在一起,并与神经网络的计算能力一起设计,以设计扬声器的公平预测系统。
The role of verbal and non-verbal cues towards great public speaking has been a topic of exploration for many decades. We identify a commonality across present theories, the element of "variety or heterogeneity" in channels or modes of communication (e.g. resorting to stories, scientific facts, emotional connections, facial expressions etc.) which is essential for effectively communicating information. We use this observation to formalize a novel HEterogeneity Metric, HEM, that quantifies the quality of a talk both in the verbal and non-verbal domain (transcript and facial gestures). We use TED talks as an input repository of public speeches because it consists of speakers from a diverse community besides having a wide outreach. We show that there is an interesting relationship between HEM and the ratings of TED talks given to speakers by viewers. It emphasizes that HEM inherently and successfully represents the quality of a talk based on "variety or heterogeneity". Further, we also discover that HEM successfully captures the prevalent bias in ratings with respect to race and gender, that we call sensitive attributes (because prediction based on these might result in unfair outcome). We incorporate the HEM metric into the loss function of a neural network with the goal to reduce unfairness in rating predictions with respect to race and gender. Our results show that the modified loss function improves fairness in prediction without considerably affecting prediction accuracy of the neural network. Our work ties together a novel metric for public speeches in both verbal and non-verbal domain with the computational power of a neural network to design a fair prediction system for speakers.