论文标题

基于文本挖掘的课程推荐

Curriculum Vitae Recommendation Based on Text Mining

论文作者

Alanoca, Honorio Apaza, Vidal, Americo A. Rubin de Celis, Saire, Josimar Edinson Chire

论文摘要

在过去的几年中,与计算机科学和互联网有关的不同领域的发展允许在选择州和私人公司的人员中为决策提供新的替代方案。为了优化此选择过程,推荐系统最适合与雇主或最终用户的喜欢和不喜欢的明确信息一起工作,因为此信息允许根据协作或内容的相似性生成建议列表。因此,这项研究以课程和工作报价数据库中包含的这些特征为基础,这些特征与秘鲁范围相对应,秘鲁范围凸显了每个候选人的经验,知识和技能,这些经验,知识和技能是用文本术语或单词描述的。这项研究的重点是问题:我们如何从有关工作优惠的非结构化信息和不同网站上的课程中利用,以供简历推荐。因此,我们使用文本挖掘和自然语言处理的技术。然后,作为本研究的相关技术,我们强调了术语术语的技术频率 - 文档的反频率(TF-IDF),该技术允许通过平均值(TF-IDF)识别与网站工作机会相关的最相关的CV。因此,加权值可以用作相关课程Vitae的资格值。

During the last years, the development in diverse areas related to computer science and internet, allowed to generate new alternatives for decision making in the selection of personnel for state and private companies. In order to optimize this selection process, the recommendation systems are the most suitable for working with explicit information related to the likes and dislikes of employers or end users, since this information allows to generate lists of recommendations based on collaboration or similarity of content. Therefore, this research takes as a basis these characteristics contained in the database of curricula and job offers, which correspond to the Peruvian ambit, which highlights the experience, knowledge and skills of each candidate, which are described in textual terms or words. This research focuses on the problem: how we can take advantage from the growth of unstructured information about job offers and curriculum vitae on different websites for CV recommendation. So, we use the techniques from Text Mining and Natural Language Processing. Then, as a relevant technique for the present study, we emphasize the technique frequency of the Term - Inverse Frequency of the documents (TF-IDF), which allows identifying the most relevant CVs in relation to a job offer of website through the average values (TF-IDF). So, the weighted value can be used as a qualification value of the relevant curriculum vitae for the recommendation.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源