Select a tag to browse associated projects and drill deeper into the tag cloud.
Smart Common Input Method platform is a development platform that significantly reduces the difficulty of input method development. SCIM splits input method into three parts: FrontEnd, which handles user interface and communication with client application
This project aims to develop the most complete, standard compliant, high-quality Chinese (and CJKV) fonts and resources, including bitmap and outline fonts of various styles. We also develop web-based tools to facilitate online font-dev collaborations.
Cjklib provides language routines related to Han characters (characters based on Chinese characters named Hanzi, Kanji, Hanja and chu Han respectively) used in writing of the Chinese, the Japanese, infrequently the Korean and formerly the Vietnamese language(s). Functionality is included for ... [More]
Eclectus is a small Han character dictionary especially designed for learners of Chinese character based languages like Mandarin Chinese or Japanese.
A library for manipulating Chinese and Japanese scripts using Python. The API includes methods for script detection, reading alternations, common dictionary formats, as well as general enhancements for working with iterators, sequences and python objects
Font meta-family, multiple styles, for Japanese, English, and Korean, made with Metafont. Full coverage of hiragana, katakana, hangul, and Latin. Partial coverage of grade-school kanji. Also includes IDSgrep, a tool for querying kanji databases by partial layout, like a more advanced version of the ... [More]
Paoding Analysis摘要Paoding's Knives 中文分词具有极 高效率 和 高扩展性 。引入隐喻，采用完全的面向对象设计，构思先进。 高效率：在PIII 1G内存个人机器上，1秒 可准确分词 100万 汉字。 采用基于 不限制个数 ... [More]
CJK Decomposition FileThe CJK Decomposition File is a graphical analysis of the most common 20,934 Chinese/Japanese characters in Unicode (the 20,922 characters in the Unicode CJK common ideograph block, plus the 12 unique characters from the CJK compatibility block). For each character, I've ... [More]
pymmseg-cpp is a Python port of the rmmseg-cpp project. rmmseg-cpp is a MMSEG Chinese word segmenting algorithm implemented in C++ with a Ruby interface.
Font Industry industrialize the procedure of big charset font production. It free the big charset font creation from artists' studio to the average John's basement. The font market will be flooded with huge amount of cheap, low quality, big charset, hand script font in no time. Be scared! Be ... [More]