论文标题

安全:可扩展的自动功能工程框架用于工业任务

SAFE: Scalable Automatic Feature Engineering Framework for Industrial Tasks

论文作者

Shi, Qitao, Zhang, Ya-Lin, Li, Longfei, Yang, Xinxing, Li, Meng, Zhou, Jun

论文摘要

机器学习技术已被广泛应用于互联网公司,用于各种任务,充当基本的驱动力,并且在构建机器学习系统时,特征工程通常被认为是至关重要的Tache。最近,为开发自动功能工程方法做出了越来越多的努力,以便可以解放出巨大而乏味的手动努力。但是,对于工业任务,这些方法的效率和可扩展性仍然远非令人满意。在本文中,我们提出了一种名为Safe(可扩展的自动功能工程)的分阶段方法,该方法可以提供出色的效率和可扩展性,以及必要的可解释性和有希望的性能。进行了广泛的实验,结果表明,在与其他方法进行比较时,提出的方法可以提供明显的效率和竞争效率。更重要的是,所提出的方法的适当可伸缩性可确保将其部署到大规模的工业任务中。

Machine learning techniques have been widely applied in Internet companies for various tasks, acting as an essential driving force, and feature engineering has been generally recognized as a crucial tache when constructing machine learning systems. Recently, a growing effort has been made to the development of automatic feature engineering methods, so that the substantial and tedious manual effort can be liberated. However, for industrial tasks, the efficiency and scalability of these methods are still far from satisfactory. In this paper, we proposed a staged method named SAFE (Scalable Automatic Feature Engineering), which can provide excellent efficiency and scalability, along with requisite interpretability and promising performance. Extensive experiments are conducted and the results show that the proposed method can provide prominent efficiency and competitive effectiveness when comparing with other methods. What's more, the adequate scalability of the proposed method ensures it to be deployed in large scale industrial tasks.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源