Improving Results of TF-IDF based Retrieval System using Co-reference Resolution and Pronoun Substitution

.

Authors

  • S. Srihari SCOPE Vellore Institute of Technology,Chennai Chennai, Tamilnadu, India
  • Dr. M Premalatha Associate Professor, SCOPE Vellore Institute of Technology Chennai Chennai, Tamilnadu, India
October 24, 2020

Downloads

Information Retrieval systems involve the process of retrieving relevant information based on user queries. TF-IDF is one of the most popular techniques of Information Retrieval. It is widely used and been successful in retrieving relevant information. But still it has some disadvantages. In this paper we propose a method to improve the performance of TF/IDF based systems using Co-reference Resolution and Pronoun Substitution. The system is found to be effective as there has been significant changes in the order of rankings of documents retrieved due to the relative increase in the amount

of content that have taken into consideration during the retrieval process. Graphical analysis of the observed improvement is given by visualizations of TF-IDF, Cosine Similarity and Effective improvement in rank for various documents before and after the change of algorithm.