• English

  • 北京外国语大学官网

秦颖

信息科学技术学院教授,博士生导师,主要研究方向:计算语言学,自然语言处理,机器生成内容评价等。

Applying Frequency and Location Information to Keyword Extraction in Single Document

发布时间:2025-01-03 点击次数:

  • 所属单位:信息科学技术学院
  • 发表刊物:Proceeding of CCIS
  • 项目来源:国家社科基金项目
  • 摘要:Keyword extraction from single document is not same to the task of text classification, in which a collection of texts can be compared and referred to. The paper focuses on the keyword extraction based on statistical information of words, that is, self features of keywords in the single document. Besides of general features such as word frequency and POS of a word, location features of a keyword are deep investigated and applied to select the candidate words. Experimental results of the extraction approach based on this method outperform TFIDF, TextRank and other unsupervised methods by comparing with them on the same corpus.
  • 论文类型:论文集
  • 是否译文:
  • 发表时间:2012-10-18
  • 第一作者:秦颖