NumWord: Difference between revisions
imported>Gfis from documentation.html |
imported>Gfis No edit summary |
||
Line 9: | Line 9: | ||
* 1..4 to words for seasons (spring, summer, autumn, winter) | * 1..4 to words for seasons (spring, summer, autumn, winter) | ||
== | ===Languages === | ||
<code>ara</code> - Arabic (??????) <code>cze</code> - Czech (ceština) <code>chi</code> - Chinese ?? (zhongwén) <code>dan</code> - Danish (Dansk) <code>deu</code> - German (Deutsch) <code>eng</code> - English <code>epo</code> - Esperanto <code>est</code> - Estonian <code>fin</code> - Finnish <code>fra</code> - French (Français) <code>gle</code> - Irish (Gaeilge) <code>geo</code> - Georgian <code>gre</code> - Greek (????????) <code>hun</code> - Hungarian (Magyar) <code>ice</code> - Icelandic (íslenska) <code>ita</code> - Italian (Italiano) <code>jpn</code> - Japanese <code>kor</code> - Korean <code>lat</code> - Latin (Latinum) <code>lav</code> - Latvian (Latviešu) <code>lit</code> - Lithuanian (Lietuviu) <code>nld</code> - Dutch (Nederlands) <code>nor</code> - Norwegian (Norsk) <code>pol</code> - Polish (Polski) <code>por</code> - Portuguese (Português) <code>roh</code> - Rumantsch Grischun <code>ron</code> - Romanian (Româna) <code>rus</code> - Russian (???????) <code>slo</code> - Slovak <code>slv</code> - Slovenian (Slovenšcina) <code>spa</code> - Spanish (Español) <code>swe</code> - Swedish (Svenska) <code>tha</code> - Thai (???) <code>tlh</code> - Klingon (tlhIngan-Hol) <code>tur</code> - Turkish (Türkçe) <code>vie</code> - Vietnamese (ti?ng Vi?t) <code>braille</code> - Braille Code <code>morse</code> - Morse Code <code>roman</code> - Roman Numbers | |||
===Categories which can be spelled=== | |||
* Digits as Word | |||
* Word as Digits | |||
* Month | |||
* Month's Abbreviation | |||
* Weekday | |||
* Weekday's Abbreviation | |||
* Season | |||
* Greeting | |||
* Time of Day | |||
** offical Time - variant 1 | |||
** Time - variant 2 | |||
** Time - variant 3 | |||
* Cardinal Direction | |||
* Planet | |||
=== Applications === | |||
* Fun for children | * Fun for children | ||
* eLearning | * eLearning | ||
Line 16: | Line 33: | ||
* Internationalized calendar | * Internationalized calendar | ||
* Extraction of numbers and dates from long texts like the bible | * Extraction of numbers and dates from long texts like the bible | ||
=== Possible Future Extensions === | |||
== Possible Future Extensions == | |||
* angle to compass direction (north, east ...) | * angle to compass direction (north, east ...) | ||
* Ordinal numbers (second, third, fourth ...) | * Ordinal numbers (second, third, fourth ...) | ||
Line 45: | Line 60: | ||
* reference to Wikipedia article for the number, in specified language (done) | * reference to Wikipedia article for the number, in specified language (done) | ||
* show on Abacus | * show on Abacus | ||
=== Problems === | |||
== Problems == | |||
* For right-to-left languages (Arabic) the lists are not always displayed properly. | * For right-to-left languages (Arabic) the lists are not always displayed properly. | ||
* Testing is rather difficult if you are not a native speaker of the language. | * Testing is rather difficult if you are not a native speaker of the language. |
Revision as of 10:10, 2 September 2016
Overview
NumWord deals with the number words in natural languages. You can enter a sequence of digits, and the program will write the number word as it is spelled in the desired language. Likewise, a number word can be entered and the program will write the corresponding number as a sequence of digits.
Furthermore, the module maps numbers to a few calendar-related sets of words in natural languages:
- 1..7 to weekday names (Monday ... Sunday) and their abbreviations
- 1..12 to month names (January ... December)
- 1..4 to words for seasons (spring, summer, autumn, winter)
Languages
ara
- Arabic (??????) cze
- Czech (ceština) chi
- Chinese ?? (zhongwén) dan
- Danish (Dansk) deu
- German (Deutsch) eng
- English epo
- Esperanto est
- Estonian fin
- Finnish fra
- French (Français) gle
- Irish (Gaeilge) geo
- Georgian gre
- Greek (????????) hun
- Hungarian (Magyar) ice
- Icelandic (íslenska) ita
- Italian (Italiano) jpn
- Japanese kor
- Korean lat
- Latin (Latinum) lav
- Latvian (Latviešu) lit
- Lithuanian (Lietuviu) nld
- Dutch (Nederlands) nor
- Norwegian (Norsk) pol
- Polish (Polski) por
- Portuguese (Português) roh
- Rumantsch Grischun ron
- Romanian (Româna) rus
- Russian (???????) slo
- Slovak slv
- Slovenian (Slovenšcina) spa
- Spanish (Español) swe
- Swedish (Svenska) tha
- Thai (???) tlh
- Klingon (tlhIngan-Hol) tur
- Turkish (Türkçe) vie
- Vietnamese (ti?ng Vi?t) braille
- Braille Code morse
- Morse Code roman
- Roman Numbers
Categories which can be spelled
- Digits as Word
- Word as Digits
- Month
- Month's Abbreviation
- Weekday
- Weekday's Abbreviation
- Season
- Greeting
- Time of Day
- offical Time - variant 1
- Time - variant 2
- Time - variant 3
- Cardinal Direction
- Planet
Applications
- Fun for children
- eLearning
- Printing of the amount in words on checks
- Internationalized calendar
- Extraction of numbers and dates from long texts like the bible
Possible Future Extensions
- angle to compass direction (north, east ...)
- Ordinal numbers (second, third, fourth ...)
- Fractions (half, third, quarter ...)
- Multiples (twice, three times ...), or threefold
- Prefixes for positive (deca, hecto, kilo ...) and negative powers of 10 (dezi, centi, milli ...)
- Genealogical hierarchy
- More languages
- Braille output
- Declination of number words (especially ordinal numbers)
- Evangelists/Gospels
- Apostel/Disciples
- Bible books
- Astrological periods (Sagittarius, Cancer ...), link to churchcal
- Planets of the sun (Mercure = 1, Venus = 2, Earth = 3 ...)
- Euro countries
- Unicode characters
- RGB color values
- Nebulae (M-number) with their galactical positions
- International telephone prefix numbers and corresponding country names (49 = Germany)
- numerical country codes
- Towns by their postal codes
- ISBN book seller prefixes
- Hexadecimal, binary, octal converter
- Text -> SMS digits (multiple or T9)
- reference to Wikipedia article for the number, in specified language (done)
- show on Abacus
Problems
- For right-to-left languages (Arabic) the lists are not always displayed properly.
- Testing is rather difficult if you are not a native speaker of the language.
- Some languages declinate words for small numbers (1,2,3). The current implementation doesn't attempt to handle variations for gender, numerus and case properly, except for singular/plural of million, milliard, billion etc. Dual is also not yet supported.
- It is an interesting fact that the numbering schemes in most languages use powers of 1000, but there are exceptions where special words for (powers of) 10000 are used (Klingon, SinoSpeller).
- For the languages based on powers of 1000, a fixed set of Latin prefixes is used for numbers >= 1 million (c.f. BaseSpeller.wordN000). For some languages these prefixes must be modified (cyrillic character set, c -> k in German).
- Some languages still use different number word sets for living entities, cattle, money etc. There is currently no attempt to handle these variations.