论文标题
法律沉积的方面分类
Aspect Classification for Legal Depositions
论文作者
论文摘要
律师和其他人对拥有合适的服务(例如,总结,搜索和浏览)的数字图书馆有浓厚的兴趣,以帮助他们使用大量的法律证词。他们的需求通常涉及了解此类文档的语义。这部分取决于诉讼的作用,例如原告,被告,执法人员,专家等。在与财产和伤亡保险索赔相关的侵权诉讼的情况下,例如与伤害有关的诉讼,不仅要了解责任,而且还了解事件,事件,事件,身体状况和治疗。 我们假设法律证词由作为Deponent证词的一部分讨论的各个方面组成。因此,我们开发了有关事故和伤害案件法律证词中方面的本体论。使用它,我们开发了一个分类器,可以为各个方面识别部分文本。这种类型的特殊性(例如,沉积转录本通常由问题解答(QA)对形式组成的数据)使这样做很复杂。因此,我们的自动化系统从预处理开始,然后将质量检查对转换为由声明性句子组成的规范形式。然后,根据这一方面生成的声明性句子可以帮助完成下游任务,例如摘要,细分,提问和信息检索。 我们的方法已达到0.83的分类F1得分。将各个方面的精度分类为良好,将有助于选择可以用作候选摘要句子的质量检查对,并为法律专业人士或保险索赔代理人提供信息的摘要。我们的方法可以扩展到其他类型的法律证词,并提供诸如搜索之类的服务。
Attorneys and others have a strong interest in having a digital library with suitable services (e.g., summarizing, searching, and browsing) to help them work with large corpora of legal depositions. Their needs often involve understanding the semantics of such documents. That depends in part on the role of the deponent, e.g., plaintiff, defendant, law enforcement personnel, expert, etc. In the case of tort litigation associated with property and casualty insurance claims, such as relating to an injury, it is important to know not only about liability, but also about events, accidents, physical conditions, and treatments. We hypothesize that a legal deposition consists of various aspects that are discussed as part of the deponent testimony. Accordingly, we developed an ontology of aspects in a legal deposition for accident and injury cases. Using that, we have developed a classifier that can identify portions of text for each of the aspects of interest. Doing so was complicated by the peculiarities of this genre, e.g., that deposition transcripts generally consist of data in the form of question-answer (QA) pairs. Accordingly, our automated system starts with pre-processing, and then transforms the QA pairs into a canonical form made up of declarative sentences. Classifying the declarative sentences that are generated, according to the aspect, can then help with downstream tasks such as summarization, segmentation, question-answering, and information retrieval. Our methods have achieved a classification F1 score of 0.83. Having the aspects classified with a good accuracy will help in choosing QA pairs that can be used as candidate summary sentences, and to generate an informative summary for legal professionals or insurance claim agents. Our methodology could be extended to legal depositions of other kinds, and to aid services like searching.