Projects tagged ‘chinese’ and ‘python’


Jump to tag:

Projects tagged ‘chinese’ and ‘python’

Filtered by Project Tags chinese python

Refine results Project Tags cjk (5) django (3) 中文 (3) appengine (2) xml (2) japanese (2) xbmc (2) segment (2) nlp (2) language (2) book (2) gae (2)

[43 total ]

1 Users
   

Make Python language could be written in Chinese. Zhpy is the full feature python language with fully tested chinese keywords, variables, and parameters support, independent on python version ... [More] , bundle with command line tool, interpreter, pluggable keyword system and great document. [Less]
Created about 1 year ago.

1 Users

Cjklib provides language routines related to Han characters (characters based on Chinese characters named Hanzi, Kanji, Hanja and chu Han respectively) used in writing of the Chinese, the Japanese ... [More] , infrequently the Korean and formerly the Vietnamese language(s). Functionality is included for character pronunciations, radicals, glyph components, stroke decomposition and variant information. Cjklib is implemented in Python. [Less]
Created 12 months ago.

0 Users

基于python的中文分词项目。 ... [More] 第一个版本实现了基于的MMSEG中文分词算法Python实现。MMSEG实际上是一个正向最大匹配+多个规则的分词算法。链接给出的几个网站写的很清楚了。在开发过程中我增加了一个规则来处理原来的算法中有可能出现的冲突问题。当所有的规则都无法唯一的确定一个chunk时,优先选择后面比较长的词。开发过程中参照了MMSEG的Java实现和ruby实现。并且对性能进行了初步的优化。 目前的性能数据:在Pentium D 2.8G的CPU下处理2.9MB的文本数据,全切分的复杂算法不开启pysco的情况下104s,开启pysco的情况下90s,能达到32KB/s。简单算法可以达到64KB/s。经测试速度能达到Java版本MMSEG的1/3,未来如果要进一步优化速度的话应该是把关键的算法的实现移植到c语言中。 实现了简单的余弦相似度计算的算法。 TODO: 实现NLTK兼容的接口。(目前已经增加了tokenizer接口) C语言级别的优化 (测试中,增加了is_basic_latin的c实现,考虑字典用c语言优化) 实现其他算法,目前考虑一个ICTCLAS的python实现,要看有没有时间。 支持停用词,支持unicode的字母数字检测等。 与分词有关的其他想法 研究一下ferret/cferret,能否实现一个python binding并且结合进去。(研究发现ferret的实现非常复杂,ruby绑定的接口部分的c代码都有上万行,放弃了,还是用solr吧) 与nlp/datamining的进一步结合 [Less]
Created about 1 year ago.

0 Users

It only opened for Google App Engine. 本页的开通主要是为Google的一些编程爱好者们提供一个交流和互相学习的地方,为共同开发出大家期待而又充满活力的Google ... [More] App Engine应用程序努力! 同时,本站上传的程序和作品仅供大家学习之用,若涉及到您的合法权益请告诉我们(在本页留言),我们会慎重处理! 若大家要将本站的程序用于商业用途,请大家自觉联系作者! [Less]
Created 9 months ago.

0 Users

A python language binding for SCIM
Created about 1 year ago.

0 Users

a simple experimental python web based chinese lunar solar calendar.
Created 4 months ago.

0 Users

If you intertest in Baidu Space(http://hi.baidu.com)which is owned by China's Search Engine called Baidu,you will want to control it.It is so easy to use this class.Maybe just Chinese will be fond of ... [More] it! You can see the read.txt for further devolopment. [Less]
Created 12 months ago.

0 Users

Mayavi2 is a general purpose, cross-platform tool for 3-D scientific data visualization. Its features include: Visualization of scalar, vector and tensor data in 2 and 3 dimensions. Easy ... [More] scriptability using Python. Easy extendability via custom sources, modules, and data filters. Reading several file formats: VTK (legacy and XML), PLOT3D, etc. Saving of visualizations. Saving rendered visualization in a variety of image formats. Convenient functionality for rapid scientific plotting via mlab This project tries to translate its documentation into Chinese, and to perform localization in the future if possible Mayavi2 是一个通用的、跨平台 3D 科学数据可视化工具。其特点如下: 标量、向量、张量的 2D 和 3D 的可视化。 可方便地使用Python编写脚本。 可方便地修改源代码,模块和数据过滤器进行扩展。 可读入多种数据格式:VTK (legacy 和 XML), PLOT3D 等。 保存可视化操作。 以多种图形格式保存渲染的可视化结果。 通过 mlab 为快速科学绘图提供方便的功能。 本项目致力于翻译 Mayavi2 的文档,希望以后进行其本地化工作。 如果您希望加入或碰到任何问题,请移步 Mayavi2-cn论坛,或致函 esnmlt@gmail.com。 此致。 [Less]
Created about 1 year ago.

0 Users

asddasd asds adsdas
Created 11 months ago.

0 Users

SEO 跟踪 Pagerank、Alexa Rank、Google 收录数等。
Created 4 months ago.