NumWord: Difference between revisions

From tehowiki
Jump to navigation Jump to search
imported>Gfis
List of languages and categories
imported>Gfis
Unicode for native language denotation
Line 7: Line 7:
* 1..4 to words for seasons (spring, summer, autumn, winter)
* 1..4 to words for seasons (spring, summer, autumn, winter)


===Languages ===
===Implemented Languages ===
<code>ara</code> - Arabic (??????) <code>cze</code> - Czech (ceština) <code>chi</code> - Chinese ?? (zhongwén) <code>dan</code> - Danish (Dansk) <code>deu</code> - German (Deutsch) <code>eng</code> - English <code>epo</code> - Esperanto <code>est</code> - Estonian <code>fin</code> - Finnish <code>fra</code> - French (Français) <code>gle</code> - Irish (Gaeilge) <code>geo</code> - Georgian <code>gre</code> - Greek (????????) <code>hun</code> - Hungarian (Magyar) <code>ice</code> - Icelandic (íslenska) <code>ita</code> - Italian (Italiano) <code>jpn</code> - Japanese <code>kor</code> - Korean <code>lat</code> - Latin (Latinum) <code>lav</code> - Latvian (Latviešu) <code>lit</code> - Lithuanian (Lietuviu) <code>nld</code> - Dutch (Nederlands) <code>nor</code> - Norwegian (Norsk) <code>pol</code> - Polish (Polski) <code>por</code> - Portuguese (Português) <code>roh</code> - Rumantsch Grischun <code>ron</code> - Romanian (Româna) <code>rus</code> - Russian (???????) <code>slo</code> - Slovak <code>slv</code> - Slovenian (Slovenšcina) <code>spa</code> - Spanish (Español) <code>swe</code> - Swedish (Svenska) <code>tha</code> - Thai (???) <code>tlh</code> - Klingon (tlhIngan-Hol) <code>tur</code> - Turkish (Türkçe) <code>vie</code> - Vietnamese (ti?ng Vi?t) <code>braille</code> - Braille Code <code>morse</code> - Morse Code <code>roman</code> - Roman Numbers
<code>'''ara'''</code> - Arabic (ديسمبر) <code>'''cze'''</code> - Czech (ceština) <code>'''chi'''</code> - Chinese (中文 (zhōngwén)) <code>'''dan'''</code> - Danish (Dansk) <code>'''deu'''</code> - German (Deutsch) <code>'''eng'''</code> - English <code>'''epo'''</code> - Esperanto <code>'''est'''</code> - Estonian <code>'''fin'''</code> - Finnish <code>'''fra'''</code> - French (Français) <code>'''gle'''</code> - Irish (Gaeilge) <code>'''geo'''</code> - Georgian <code>'''gre'''</code> - Greek (Ελληνικά) <code>'''hun'''</code> - Hungarian (Magyar) <code>'''ice'''</code> - Icelandic (íslenska) <code>'''ita'''</code> - Italian (Italiano) <code>'''jpn'''</code> - Japanese <code>'''kor'''</code> - Korean <code>'''lat'''</code> - Latin (Latinum) <code>'''lav'''</code> - Latvian (Latviešu) <code>'''lit'''</code> - Lithuanian (Lietuviu) <code>'''nld'''</code> - Dutch (Nederlands) <code>'''nor'''</code> - Norwegian (Norsk) <code>'''pol'''</code> - Polish (Polski) <code>'''por'''</code> - Portuguese (Português) <code>'''roh'''</code> - Rumantsch Grischun <code>'''ron'''</code> - Romanian (Româna) <code>'''rus'''</code> - Russian (Русский) <code>'''slo'''</code> - Slovak <code>'''slv'''</code> - Slovenian (Slovenšcina) <code>'''spa'''</code> - Spanish (Español) <code>'''swe'''</code> - Swedish (Svenska) <code>'''tha'''</code> - Thai (ไทย) <code>'''tlh'''</code> - Klingon (tlhIngan-Hol) <code>'''tur'''</code> - Turkish (Türkçe) <code>'''vie'''</code> - Vietnamese (ti?ng Vi?t) <code>'''braille'''</code> - Braille Code <code>'''morse'''</code> - Morse Code <code>'''roman'''</code> - Roman Numbers


===Categories which can be spelled===
===Categories which can be spelled===

Revision as of 10:21, 2 September 2016

NumWord deals with the number words in natural languages. You can enter a sequence of digits, and the program will write the number word as it is spelled in the desired language. Likewise, a number word can be entered and the program will write the corresponding number as a sequence of digits.

Furthermore, the module maps numbers to a few calendar-related sets of words in natural languages:

  • 1..7 to weekday names (Monday ... Sunday) and their abbreviations
  • 1..12 to month names (January ... December)
  • 1..4 to words for seasons (spring, summer, autumn, winter)

Implemented Languages

ara - Arabic (ديسمبر) cze - Czech (ceština) chi - Chinese (中文 (zhōngwén)) dan - Danish (Dansk) deu - German (Deutsch) eng - English epo - Esperanto est - Estonian fin - Finnish fra - French (Français) gle - Irish (Gaeilge) geo - Georgian gre - Greek (Ελληνικά) hun - Hungarian (Magyar) ice - Icelandic (íslenska) ita - Italian (Italiano) jpn - Japanese kor - Korean lat - Latin (Latinum) lav - Latvian (Latviešu) lit - Lithuanian (Lietuviu) nld - Dutch (Nederlands) nor - Norwegian (Norsk) pol - Polish (Polski) por - Portuguese (Português) roh - Rumantsch Grischun ron - Romanian (Româna) rus - Russian (Русский) slo - Slovak slv - Slovenian (Slovenšcina) spa - Spanish (Español) swe - Swedish (Svenska) tha - Thai (ไทย) tlh - Klingon (tlhIngan-Hol) tur - Turkish (Türkçe) vie - Vietnamese (ti?ng Vi?t) braille - Braille Code morse - Morse Code roman - Roman Numbers

Categories which can be spelled

  • Digits as Word
  • Word as Digits
  • Month
  • Month's Abbreviation
  • Weekday
  • Weekday's Abbreviation
  • Season
  • Greeting
  • Time of Day
    • offical Time - variant 1
    • Time - variant 2
    • Time - variant 3
  • Cardinal Direction
  • Planet

Applications

  • Fun for children
  • eLearning
  • Printing of the amount in words on checks
  • Internationalized calendar
  • Extraction of numbers and dates from long texts like the bible

Possible Future Extensions

  • angle to compass direction (north, east ...)
  • Ordinal numbers (second, third, fourth ...)
  • Fractions (half, third, quarter ...)
  • Multiples (twice, three times ...), or threefold
  • Prefixes for positive (deca, hecto, kilo ...) and negative powers of 10 (dezi, centi, milli ...)
  • Genealogical hierarchy
  • More languages
  • Braille output
  • Declination of number words (especially ordinal numbers)
  • Evangelists/Gospels
  • Apostel/Disciples
  • Bible books
  • Astrological periods (Sagittarius, Cancer ...), link to churchcal
  • Planets of the sun (Mercure = 1, Venus = 2, Earth = 3 ...)
  • Euro countries
  • Unicode characters
  • RGB color values
  • Nebulae (M-number) with their galactical positions
  • International telephone prefix numbers and corresponding country names (49 = Germany)
  • numerical country codes
  • Towns by their postal codes
  • ISBN book seller prefixes
  • Hexadecimal, binary, octal converter
  • Text -> SMS digits (multiple or T9)
  • reference to Wikipedia article for the number, in specified language (done)
  • show on Abacus

Problems

  • For right-to-left languages (Arabic) the lists are not always displayed properly.
  • Testing is rather difficult if you are not a native speaker of the language.
  • Some languages declinate words for small numbers (1,2,3). The current implementation doesn't attempt to handle variations for gender, numerus and case properly, except for singular/plural of million, milliard, billion etc. Dual is also not yet supported.
  • It is an interesting fact that the numbering schemes in most languages use powers of 1000, but there are exceptions where special words for (powers of) 10000 are used (Klingon, SinoSpeller).
  • For the languages based on powers of 1000, a fixed set of Latin prefixes is used for numbers >= 1 million (c.f. BaseSpeller.wordN000). For some languages these prefixes must be modified (cyrillic character set, c -> k in German).
  • Some languages still use different number word sets for living entities, cattle, money etc. There is currently no attempt to handle these variations.

See also the hints and bugs for developers.