论文标题
Stacerbot:堆栈搜索引擎用于堆栈溢出
StacerBot: A Stacktrace Search Engine for Stack Overflow
论文作者
论文摘要
作为软件开发人员或研究人员,我们通常在尝试编写一些代码或安装一些软件包时会收到StackTrace错误消息。很多时候,这些错误消息非常晦涩和详细。对我们没有多大意义。其他人也很有可能在各种在线开发人员的论坛上共享类似的问题。但是,传统的Google搜索或其他搜索引擎对于找到具有类似stackTraces的网页并不是很有帮助。为了解决此问题,我们已经开发了一个Web界面。一个更好的搜索引擎:作为该研究项目的结果,用户可以通过提交整个StackTrace错误消息找到适当的堆栈溢出帖子。当前开发的解决方案可以使用带有300GB RAM的服务器在50秒内使用顶部匹配的堆栈溢出帖子提供实时并行用户查询。这项研究提供了本研究中使用的NLP技术的全面概述以及研究管道的广泛概述。未来的研究人员和软件开发人员可以使用本研究中提到的全面结果,局限性和计算间接费用,以解决相同问题或类似的大规模文本匹配相关任务的更好解决方案。
We as software developers or researchers very often get stacktrace error messages while we are trying to write some code or install some packages. Many times these error messages are very obscure and verbose; do not make much sense to us. There is a good chance that someone else has also faced similar issues probably shared similar stacktrace in various online developers' forums. However traditional google searches or other search engines are not very helpful to find web pages with similar stacktraces. In order to address this problem, we have developed a web interface; a better search engine: as an outcome of this research project where users can find appropriate stack overflow posts by submitting the whole stacktrace error message. The current developed solution can serve real-time parallel user queries with top-matched stack overflow posts within 50 seconds using a server with 300GB RAM. This study provides a comprehensive overview of the NLP techniques used in this study and an extensive overview of the research pipeline. This comprehensive result, limitations, and computational overhead mentioned in this study can be used by future researchers and software developers to build a better solution for this same problem or similar large-scale text matching-related tasks.