Latent Semantic Indexing(LSI) is one method using for text categorization as improvement to VSM. The idea behind the LSI is to map each document and test vector into lower dimensional space which is associated with concepts and compare the documents in this space.
In Dr.Tliu's courseware about Information Retrieval, I saw some example about it. And after reading some papers which were downloaded from CNKI about LSI, I believed it could be used in my evaluation task. I should discuss the situation with Dr.Tliu and senior.
There were some homeworks of our curriculums. I must come back and prepare for them now.
没有评论:
发表评论