Projects tagged ‘python’ and ‘unicode’


[12 total ]

19 Users
 

NLTK — the Natural Language Toolkit — is a suite of open source Python modules, linguistic data and documentation for research and development in natural language processing, supporting dozens of ... [More] NLP tasks, with distributions for Windows, Mac OSX and Linux. [Less]
Created over 3 years ago.

4 Users

Unknown Horizons is a 2D realtime strategy simulation with an emphasis on economy and city building. Expand your small settlement to a strong and wealthy colony, collect taxes and supply your ... [More] inhabitants with valuable goods. Increase your power with a well balanced economy and with strategic trade and diplomacy. Unknown Horizons is loosely oriented on Sunflowers Anno series however not a clone of these comercial games nor an engine to play the original content. [Less]
Created 8 months ago.

1 Users

python-elinks installs an encoding error handler that uses the same ASCII replacements as ELinks does.
Created about 1 year ago.

1 Users

PyICU is a python extension wrapping IBM's ICU C++ API.
Created over 3 years ago.

1 Users

The Link Grammar Parser is a syntactic parser of English, based on link grammar, an original theory of English syntax. Given a sentence, the system assigns to it a syntactic structure, which consists ... [More] of a set of labeled links connecting pairs of words. The parser also produces a "constituent" (Penn tree-bank style phrase tree) representation of a sentence (showing noun phrases, verb phrases, etc.). [Less]
Created about 1 year ago.

0 Users

There are many utilities that allow the input of any unicode characters, given their (hexa)decimal codes. But that's really cumbersome. This software allows searching for a character by its name, then ... [More] copies it to the clipboard in order to be pasted anywhere. InstallationOn Windows: there is no setup yet, just unzip the file. Run uibn.exe. Others: download the source code, install wxPython 2.8.x.x. Run uibn.pyw (you may need to set its permission first). Tested on Ubuntu; I have no idea if it works on Mac OS. UsageThe program automatically hides itself. Right click the notification area icon (all systems) or press Alt+"+" (numpad +) to show it (only on Windows). Type the character name in order to search for it. Press up/down arrow to navigate the list. Press enter or click the Copy button to copy the character to the clipboard; the program will hide itself again. Press esc or the Hide button to hide the program. In order to close it, close the window. The characters are sorted by use (the most copied will appear first) Help wantedIf you know a standard way to add a global shortcut to show the program window on Linux (or for specific window managers) and on Mac OS, please drop me a line. To DoMake it customizable (notification icon on/off, custom shortcut, sort by usage on/off, etc) AuthorConrado PLG (conradoplg at gmail dot com) [Less]
Created 12 months ago.

0 Users

Unibabel allows you to: 1) browse the rich visual universe of Unicode glyphs 2) understand the relationship between legacy encodings and Unicode 3) convert from legacy encodings to Unicode and ... [More] back Unibabel runs on Google App Engine. Check it out: http://unibabel.appspot.com . [Less]
Created 4 months ago.

0 Users

wxPyDictDictionary Lookup Program, based on wxWidgets, wxPython, Python, sqlite3, cburglish, and Myanmar Text Tools from Burglish Systems, currently mainly support for Myanmar Language based on Zawgyi ... [More] Encoding. Download All-in-One InstallerDownload wxPyDict All-in-One Installer with English->Myanmar(ornagai), Myanmar->English(saing dictionary), and Myanmar Villages Directory(from Ko Nyi Lynn Seck) included Features (Quick)+ ျမန္မာလိုေရာ၊ အဂၤလိပ္လုိေရာ အသံုးျပဳျပီး ရွာေဖြနိုင္သည္။ အဂၤလိပ္-ျမန္မာ (Ornagai), ျမန္မာ-အဂၤလိပ္ (စိုင္း ျမန္မာ-အဂၤလိပ္), ကိုညီလင္းဆက္ ျပုစုထားေသာ ျမန္မာေက်းရြာနာမည္မ်ားအဘိဓာန္ ကို အသံုးျပဳထားသည္။ + ျမန္မာစာ၏ ဝဏၰျဖတ္စနစ္ကို အသံုးျပဳျပီး ရွာေဖြျခင္းကို ပိုမိုတိက်ေအာင္ ျပဳလုပ္ထားသည္။ ဥပမာ - ကံ ကိုရွာစဥ္တြင္ ၾကံ ကို ရွာေဖြမိျခင္းမရွိေစရ။ + စာလံုးေပါင္း အတိုင္းအတာတစ္ခု အထိျပင္ေပးႏိုင္ေသာ အလိုအေလွ်ာက္ စစ္စတမ္ တစ္ခုပါရွိသည္။ ဥပမာ - ကေျခာ္ကျခြတ္ လို႕ရိုက္ျပီး Enter ကီးသံုးေပးပါက အလုိအေလ်ာက္ ကေခ်ာ္ကခြၽတ္ သို႕ေျပာင္းေပးသြားမည္။ Features (Detailed and more technical)+ custom dictionary ဖိုင္ေတြကို tsv (tabs seperated value) format အေနနဲ႕ ထည့္ေပးနိုင္သည္။ (wxPyDict.ini file ကို edit လုပ္ေပးျခင္းအားျဖင့္) + built-in dictionary builder (custom dictionary ထည့္ျပီးလွ်င္ Update->Rebuild Database ျပန္လုပ္ေပးရမည္) (if you have sqlite3.exe in same folder, dictionary building will get around 30% faster speed) + custom dictionary ဖိုင္ သည္ encoding utf8 သို႕ utf16 ၾကိဳက္တာျဖစ္နိုင္သည္။ + dictionary data should be in Zawgyi 2008 or Zawgyi 2009 Encoding, It will store in DB as, my encoding ျဖင့္ sqlite3 အေနျဖင့္ wxPyDict.db ဖိုင္တြင္ သိမ္းသြားမည္ ျဖစ္သည္။ + support reverse lookup, အဂၤလိပ္-ျမန္မာ အဘိဓာန္ ကို ျမန္မာစာျဖင့္ျပန္ရွာနိုင္သည္။ (output is not perfect but still usable feature) + support syllable breaker real-time, ကံ ကိုရွာတဲ့ အခ်ိန္မွာ ၾကံ ေတြဘာေတြမပါလာဘူး။ + built-in Normalization of Zawgyi to Zawgyi 2009, က--ိ-ု ပဲျဖစ္ျဖစ္ က--ု--ိ အတူတူပါပဲ၊ မွဳ ပဲရိုက္ရိုက္ မွု ပဲ ရိုက္ရိုက္ မႈ ပဲရို္က္ရိုက္အတူတူပါပဲ + built-in spell autocorrection for Myanmar Language, for eg, if you type "ကေျခာ္ကျခြတ္" it will automatically correct to "ကေခ်ာ္ကခၽြတ္" (need spell dictionary from thanlwinsoft) Download Separate Binaries (Don't need if All-in-One Installer is used)Main program only wxPyDict.7z + If you don't have dependencies files, like python25.dll, wx*28*.dll, MSVCR7.dll, and MSVCP7.dll, Download all files from download list Installations of Separate Binaries (Don't need if All-in-One Installer is used)There is no installations needs, just need to run wxPyDict.exe file, but make sure you already have python25.dll, wx*28*.dll, MSVCR7.dll, MSVCP7.dll Spell dictionary (my_MM.dic) (Don't need if All-in-One Installer is used)ျမန္မာစာ စာလံုးေပါင္းစစ္ေပးရန္ အတြက္ my-MMDict.oxt thanlwinsoft (Ko Wunna Ko Ko & Keith Stribley) မွ ထုတ္ထားေသာ openoffice အတြက္သံုးထားေသာ spell dictionary ကို 7z စတာတို႕ျဖင့္ my_MM.dic ကို extract လုပ္ျပီး wxPyDict.exe နွင့္ ေနရာအတူတူတြင္ ထားေပးရန္လိုအပ္။ Prebuilt Dictionaries (Don't need if All-in-One Installer is used)အဂၤလိပ္-ျမန္မာ ornagai dict folder - from mysteryzillion (Saturngod) ျမန္မာ-အဂၤလိပ္ saing dictionary.7z - from Saing Khan Tun ကိုညီလင္းဆက္ ျပုစုထားေသာ ျမန္မာေက်းရြာနာမည္မ်ားအဘိဓာန္ Dictionary File building (For Dictionary Creators & Advanced Users)There is built-in dictionary builder, from Menu-Update->Rebuild Database, will load all dictionaries from wxPyDict.ini for eg., here is sample of wxPyDict.ini [DICTS] ornagai.tsv,1 saidict.tsv,0 userdict.tsv villages.tsv [/DICTS]ornagai.tsv is tsv (tabs separated value) format file, 1 - means content searchable, for reverse lookup feature, 0 is not reverse searchable (default) File Details (For some people, who want to know which files are doing what)wxPyDict.exe သည္ အဓိက အလုပ္လုပ္မည့္ ဖိုင္ျဖစ္သည္။ အင္စေတာ လုပ္ျပီးခါစ run လွ်င္ wxPyDict.db ကို တည္ေဆာက္မည္ျဖစ္ေသာေၾကာင့္ စကၠန္႔ အနည္းငယ္ ၾကာမည္ျဖစ္သည္။ (အဘိဓာန္အရြယ္အစားနဲ႕ အသံုးျပဳမည့္စက္ေပၚမူတည္သည္။) wxPyDict.ini သည္ custom dictionary ထပ္ထည့္ႏိုင္ရန္ အသံုးျပဳနိုင္ပါသည္။ wxPyDict.ini မရွိေနပါက အလုိအေလ်ာက္ create လုပ္သြားမည္ ျဖစ္ပါသည္။ wxPyDict.db သည္ sqlite3 db ျဖစ္ျပီး database build လုပ္သည့္အခ်ိန္တြင္ tsv format ဖိုင္မ်ားမွ ဖတ္ျပီး သိမ္းယူသြားမည္။ wxPyDict.spell သည္ spell dictionary (my_MM.dic) မွ ေဒတာကို ဆြဲထုတ္ျပီး ပိုင္သြန္က နားလည္ေသာ format ျဖင့္ သိမ္းထားေသာဖိုင္ျဖစ္သည္။ (loading ျမန္ရန္အတြက္) Screenshots AuthorwxPyDict Demo is done by Soe Min (Mark) - soemin AT my HYPHEN MM DOT org ChargesThis Program is Free of Charges, If you paid for it, get a refund!!! LicenseAnd It is by My Special License Not Allowed to use for Commercial or Ads related things Not Allowed for Redistributions (Program must be get from my sites or from me directly) Not Allowed from Embeding / Linking / Calling / Any Kinds of Usage from Another programs Not Allowed for Modification / Reverse Engineering of the Program for any reason Above License Statements may change anytime for any reason WarningsThis is NOT Opensource application, but it is Free as in Beer. googlecode need to set a license, so I set it as GPL, its just a dummy, nobody is allowed to claim my proprietary things to those license [Less]
Created 4 months ago.

0 Users

Loquacious Etymologist is a simple program for generating made-up words that sound and look like they could be part of an actual language. The words are generated based on word lists. For example, a ... [More] list of English words would allow the Etymologist to generate words that look and sound like English words. Loquacious Etymologist is written in Python 2.5 with full unicode support. It has a cross-platform GUI front end written using wxWidgets. While there's more that the front end could do, it supports all basic program operations. Chris Pound's word/name lists work well with Loquacious Etymologist. [Less]
Created 12 months ago.

0 Users

AboutMindTree is an information outliner application designed for recording and organizing notes and publishing these notes to the web. PyEnchant must be installed separately if you want to use ... [More] the spell checker tool. FeaturesGeneralFull UNICODE support. All tree entires and Articles can contain UNICODE characters. Pluggable architecture. tools and importers/exporters are all plugins. New plugins can be easily added without changes to the main application. The current release runs on Python 2.5 -- This version has reached end-of-life. Future releases will run on Python 2.6 with PyQt. Outline TreeDrag-N-Drop. Tree nodes may be moved around by dragging them with your mouse. Mouse-less tree editing. If using the mouse is tedious for you move tree entires using tab, shift-tab and shift arrow keys. In-place-editing. Tree entries can be edited in-place Article EditingText Styling. Articles support a wide range of styling capabilities. Embedded images. Articles support embedding .png images. ToolsBuild Web sites. Export an entire outline to a web site complete with side-by-side panes and fully interactive tree. The generated website looks like a read-only version of MindTree. Spell checking Search and replace Customizable On-Screen Keyboards for characters not on the computer keyboard. CompatibilityThe current version runs on Python 2.5. Soon to be released, updated version to run on Python 2.6. [Less]
Created about 1 year ago.