论文标题

修订文档图像中的键值检测的FUNSD数据集

Revising FUNSD dataset for key-value detection in document images

论文作者

Vu, Hieu M., Nguyen, Diep Thi-Ngoc

论文摘要

FUNSD是有限的公开数据集之一,可从文档IMAGE中提取信息。 FUNSD数据集中的信息由四个类别的文本区域(“键”,“值”,“标题”,“其他”和“背景”)定义,以及作为键值关系之间的连接性。在镜头上,我们发现标签上有几种不一致的不一致性,这阻碍了其对key-value提取问题的适用性。在本报告中,我们描述了FUNSD和数据集中的一些标签问题。我们还报告了使用UNET模型作为基线结果以及具有Channel-InvariantDeformable卷积的改进的UNET模型的键值检测实施。

FUNSD is one of the limited publicly available datasets for information extraction from document im-ages. The information in the FUNSD dataset is defined by text areas of four categories ("key", "value", "header", "other", and "background") and connectivity between areas as key-value relations. In-specting FUNSD, we found several inconsistency in labeling, which impeded its applicability to thekey-value extraction problem. In this report, we described some labeling issues in FUNSD and therevision we made to the dataset. We also reported our implementation of for key-value detection onFUNSD using a UNet model as baseline results and an improved UNet model with Channel-InvariantDeformable Convolution.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源