论文标题
NSTM:彭博社的实时查询驱动的新闻概述构图
NSTM: Real-Time Query-Driven News Overview Composition at Bloomberg
论文作者
论文摘要
全球成千上万来源的数百万新闻报道每天都在新闻集合商中出现。消费这样的新闻几乎带来了几乎无法克服的挑战。例如,一位读者在彭博系统中搜索有关英国的新闻的读者将在典型的一天中找到10,000篇文章。苹果公司(Apple Inc. 我们意识到需要一种新型的摘要引擎,它将大量新闻浓缩为简短,易于吸收的点。该系统将过滤噪音和重复,以识别和总结有关公司,国家或市场的关键新闻。 当给出用户查询,彭博的解决方案,关键新闻主题(或NSTM)时,请利用最先进的语义聚类技术和新颖的摘要方法来产生全面而简洁的消化,以极大地简化新闻消费过程。 NSTM可供全球成千上万的读者使用,每天都有次秒延迟的要求。在ACL 2020,我们将展示NSTM的演示。
Millions of news articles from hundreds of thousands of sources around the globe appear in news aggregators every day. Consuming such a volume of news presents an almost insurmountable challenge. For example, a reader searching on Bloomberg's system for news about the U.K. would find 10,000 articles on a typical day. Apple Inc., the world's most journalistically covered company, garners around 1,800 news articles a day. We realized that a new kind of summarization engine was needed, one that would condense large volumes of news into short, easy to absorb points. The system would filter out noise and duplicates to identify and summarize key news about companies, countries or markets. When given a user query, Bloomberg's solution, Key News Themes (or NSTM), leverages state-of-the-art semantic clustering techniques and novel summarization methods to produce comprehensive, yet concise, digests to dramatically simplify the news consumption process. NSTM is available to hundreds of thousands of readers around the world and serves thousands of requests daily with sub-second latency. At ACL 2020, we will present a demo of NSTM.