INFORMATION RETRIEVAL OF TEXT DOCUMENT WITH WEIGHTING TF-IDF AND LCS
Abstract: Information
retrieval of text document requires a method that is able to restore a number
of documents that have high relevance according to the user's request. One
important step in the process is a text representation of the weighting
process. The use of LCS in Tf-Idf weighting adjustments considers the
appearance of the same order of words between the query and the text in the
document. There is a very long document but irrelevant cause weight produced is
not able to represent the value relevance of documents. This research proposes
the use of LCS which gives weight to the word order by considering long
documents related to the average length of documents in the corpus. This method
is able to return a text document effectively. Additional features of word
order by normalizing the ratio of the overall length of the document to the
documents in the corpus generate values of precision and recall as well as the
method of Tasi et al.
Author: Munjiah Nur Saadah,
Rigga Widar Atmagi, Dyah S. Rahayu, Agus Zainal Arifin
Journal Code: jptkomputergg130010