论文标题
Apache软件基金会在大数据项目中的作用
Role of Apache Software Foundation in Big Data Projects
论文作者
论文摘要
随着每年生成的大数据数量的增加,为存储,处理和分析大数据而开发和使用的工具和技术也有所改善。开源软件一直是大数据领域成功和创新的重要因素,而Apache Software Foundation(ASF)通过提供许多最先进的项目,免费并向公众开放,在这一成功和创新中发挥了至关重要的作用。 ASF已将其项目分为不同的类别。在本报告中,对大数据类别列出的项目进行了深入分析和讨论,并参考定义的七个子类别。我们的调查表明,许多Apache Big Data项目都是自主的,但有些是基于其他Apache项目而构建的,并且有些工作与其他项目结合在一起,以改善和简化大数据领域的开发。
With the increase in amount of Big Data being generated each year, tools and technologies developed and used for the purpose of storing, processing and analyzing Big Data has also improved. Open-Source software has been an important factor in the success and innovation in the field of Big Data while Apache Software Foundation (ASF) has played a crucial role in this success and innovation by providing a number of state-of-the-art projects, free and open to the public. ASF has classified its project in different categories. In this report, projects listed under Big Data category are deeply analyzed and discussed with reference to one-of-the seven sub-categories defined. Our investigation has shown that many of the Apache Big Data projects are autonomous but some are built based on other Apache projects and some work in conjunction with other projects to improve and ease development in Big Data space.