In recent years, Plone(Contents Management System) is attracting attention in Japan. It has rich function and very security, many businesses and public institution use to creating website. However, it
... [More]
has search system is don’t understand the East Asian languages (Chinese, Japanese, Korean), so we use it unfavorable for the state. Therefore, I develop search system for East Asian languages of Plone3.x.x. If development is successful, it is more effective use can be in East Asian.
The present time, search system of Plone compare the query and with the word of the pages. In the case of English and European, words is in the between space. But, CJK languages have no space in the between each word. So, this is the cause of the failure.
I think that Search of CJK languages for a way to achieve is N-gram. It method is at regular intervals to share the string, and make up the index by it, the occurrence frequency of search terms from search method. This method is Korean and Chinese which can be applied. This project is editing source of Plone using by Python language.
After implementation, verify search system operation. 1 Operating environment of Plone. 2 Chinese, Japanese, Korean languages can be properly search. 3 English and other languages to verify the impact. This is maintenance of recall and precision.
Thank you very much for reading. [Less]