Automatic Keyphrase Extraction for Multi Document

Abstract

Keyphrase extraction only consider the connections between words in a document, ignoring the impact of the sentence. Motivated by the fact that a word must be important if it appears in many important sentences, we propose to take full advantage of the reinforcement between words and sentences by melting three kinds of relationships between them. Moreover, a document is grouped with many topics. The extracted keyphrases should be synthetic in the sense that they should deal with all the main topics in a document. Inspired by this, we take topic model into consider. Experimental results show that our approach performs better than state-of-the-art keyphrase extraction method on two datasets under three evaluation metrics.

Country : India

1 Aluri brahma reddy2 Dr.Vaka Muralimohan3 Dr.Kanaka Durga Returi

  1. 1Research Scholar, Chaudhary Charan Singh University Meerut, Uttar Pradesh, India
  2. 3Professor, Department of CSE, Malla Reddy College of Engineering for Women, Hyderabad, India
  3. 3Professor, Department of CSE, Malla Reddy College of Engineering for Women, Hyderabad, India

IRJIET, Volume 3, Issue 11, November 2019 pp. 110-112

.

References

  1. C.D. Manning and H. Schutze. Foundations of statistical natural language processing. MIT Press. 2000.
  2. KaziSaidul Hasan and Vincent Ng. Automatic Keyphrase Extraction: A survey of the State of the Art. (2015).
  3. Mihalcea and P. Tarau. 2004. TextRank: Bringing order into texts. In Proceedings of EMNLP(2004)
  4. 255Yan Ying et al. / Procedia Computer Science 107 (2017) 248 – 255.
  5. Xiaojun Wan and Jianguo Xiao. CollabRank: Towards a collaborative approach to single-document keyphrase extraction. In Proceedings of the 22nd International Conference on Computational Linguistics, pages 969–976, (2008a).
  6. Liu, Z., Li, P., Zheng, Y., Sun, M.: Clustering to find exemplar terms for keyphrase extraction. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing 257–266 (2009).
  7. Eibe Frank, Gordon W. Paynter, Ian H. Witten, Carl Gutwin, and Craig G. Nevill-Manning. Domain-specific keyphrase extraction. In Proceedings of 16th International Joint Conference on Artificial Intelligence, pages 668–673(1999).
  8. Gerard Salton and Christopher Buckley. Term weighting approaches in automatic text retrieval. Information Processing and Management, 24(5):513–523)(1988).
  9. Zhiyuan Liu, Wenyi Huang, Yabin Zheng, and Maosong Sun. Automatic keyphrase extraction via topic decomposition. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pages 366–376.(2010).
  10. Sergey Brin and Lawrence Page. The anatomy of a large-scale hyper textual Web search engine, Computer Networks, 30(1–7):107–117(1998).
  11. HongyuanZha. Generic summarization and keyphrase extraction using mutual reinforcement principle and sentence clustering. In Proceedings of 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 113–120(2002).
  12. R. Collobert, J. Weston, L. Bottou, M. Karlen, K. Kavukcuoglu and P. Kuksa. Natural Language Processing (Almost) from Scratch, Journal of Machine Learning Research (JMLR), (2011).
  13. R. Collobert. Deep Learning for Efficient Discriminative Parsing, in International Conference on Artificial Intelligence and Statistics (AISTATS), (2011).
  14. Hartigan J A, Wong M A. Algorithm AS 136: A K-Means Clustering Algorithm[J]. Applied Statistics, 28(1):100-108. (1979).
  15. Anette Hulth. Improved automatic keyword extraction given more linguistic knowledge. In Proceedings of EMNLP, pages 216–223, (2003).