论文标题
从规则中学习概括标记的示例
Learning from Rules Generalizing Labeled Exemplars
论文作者
论文摘要
在许多应用中,标记的数据不容易获得,需要通过痛苦的人类监督来收集。我们提出了一种规则执行方法,用于收集人类监督,以将规则的效率与实例标签的质量相结合。监督的耦合使得它对人类既自然又对学习是协同作用。我们提出了一种培训算法,该算法通过潜在覆盖变量共同确定规则,并通过对覆盖范围和标签变量的软含义损失进行训练。被授予的规则和训练有素的模型共同用于推断。对五个不同任务的经验评估表明,(1)我们的算法比从干净和嘈杂的监督的混合中学习的几种现有方法更准确,并且(2)耦合规则 - 任命监督有效地是有效的。
In many applications labeled data is not readily available, and needs to be collected via pain-staking human supervision. We propose a rule-exemplar method for collecting human supervision to combine the efficiency of rules with the quality of instance labels. The supervision is coupled such that it is both natural for humans and synergistic for learning. We propose a training algorithm that jointly denoises rules via latent coverage variables, and trains the model through a soft implication loss over the coverage and label variables. The denoised rules and trained model are used jointly for inference. Empirical evaluation on five different tasks shows that (1) our algorithm is more accurate than several existing methods of learning from a mix of clean and noisy supervision, and (2) the coupled rule-exemplar supervision is effective in denoising rules.