題名: Improving the Syntax-based Retrieval System Using Collocation Indexing
作者: Chen, Ruey-Jinng
Kuo, Chin-Hwa
Tsao, Nai-Lung
Hung, Tsung-Fu
關鍵字: POS tagging
Lemmatizing
Collocation
k-gram
Indexing
期刊名/會議名稱: 2008 ICS會議
摘要: The purpose of this paper is to design a syntax search system and to apply it to a movie search system. The concepts applied include those in the field of linguistics and collocation, to increase the speed of the syntax search system. First, we must process the keywords in the database by labeling them according to their part of speech. From the results of the process, we will construct a K-gram index and Collocation index.In this proposal we bring out a few examples of common English syntax rules and sentence structures as test models. After the run through, the K-gram index and the Collocation index are compared. We have found that part of the sentence, after having gone through the Collocation index search, has a far smaller sample space that the K-gram index alone, which is to say that the Collocation index is able to find the most correct result from fewer samples, thus minimizing the time cost in Query Match.
日期: 2009-01-03T09:13:49Z
分類:2008年 ICS 國際計算機會議

文件中的檔案:
檔案 描述 大小格式 
ce07ics002008000003.pdf264.4 kBAdobe PDF檢視/開啟


在 DSpace 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。