论文标题

对OCR和文档理解的深度学习方法的调查

A Survey of Deep Learning Approaches for OCR and Document Understanding

论文作者

Subramani, Nishant, Matton, Alexandre, Greaves, Malcolm, Lam, Adrian

论文摘要

文件是许多领域的许多业务的核心部分,例如法律,金融和技术。自动理解诸如发票,合同和简历之类的文件是有利可图的,这开辟了许多新的业务途径。自然语言处理和计算机视觉领域通过发展深度学习的发展取得了巨大的进步,使得这些方法开始在当代文档理解系统中注入。在本调查文件中,我们回顾了用英语编写的文档的文档理解的不同技术,并巩固了文献中存在的方法,以作为研究人员探索该领域的研究人员的起点。

Documents are a core part of many businesses in many fields such as law, finance, and technology among others. Automatic understanding of documents such as invoices, contracts, and resumes is lucrative, opening up many new avenues of business. The fields of natural language processing and computer vision have seen tremendous progress through the development of deep learning such that these methods have started to become infused in contemporary document understanding systems. In this survey paper, we review different techniques for document understanding for documents written in English and consolidate methodologies present in literature to act as a jumping-off point for researchers exploring this area.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源