The following characters are allowed in the transcription system using the ancient Greek section of Unicode.
For the twenty-four individual alphabetic characters of the Attic-Ionic alphabet, the
twenty-four upper-case Unicode code points 0391 - 03A1 and 03A3 - 03A9 may be used. For equivalent lower-case characters, the twenty-five lower case Unicode code points from 03B1 - 03C9 may be used. Either 03C2 (final sigma) or 03C3 (sigma) may be used for the character sigma; both are mapped to ASCII s
.
The mapping of ASCII to Unicode Greek transcriptions can be illustrated by creating a GreekString from a Unicode source, and then converting the GreekString to the corresponding ASCII-only transcription.
GreekString | Source String |
---|---|
a | α |
b | β |
g | γ |
d | δ |
e | ε |
z | ζ |
h | η |
q | θ |
i | ι |
k | κ |
l | λ |
m | μ |
n | ν |
c | ξ |
o | ο |
p | π |
r | ρ |
s | σ |
t | τ |
u | υ |
f | φ |
x | χ |
y | ψ |
w | ω |
A GreekString may be individual Greek vowel characters together with the combining Unicode codepoint for smooth or rough breathing, the three accent characters, iota subscript and diaeresis. Alternatively, a GreekString may use the equivalent Unicode precombined characters.
Combining/precombined | Source String | GreekString |
---|---|---|
combining | μῆνιν | mh=nin |
combining | Μῆνιν | *mh=nin |
precombined | μῆνιν | mh=nin |
precombined | Μῆνιν | *mh=nin |
The following punctuation characters are allowed:
The comma and period characters are identical in Unicode Greek transcription and ASCII transcription:
ASCII only transcription | Unicode transcription |
---|---|
. | . |
, | , |
The Greek question mark character converts to a semicolon in ASCII transcription, and the high stop character converts to a colon.
ASCII only transcription | Unicode transcription |
---|---|
; | ; |
: | · |
The elision character is transcribed in both ASCII and Unicode Greek transcriptions with the apostrophe character '
(= \u0027).
ASCII only transcription | Unicode transcription |
---|---|
' | ' |
Whether constructed from beta-code or unicode source string, Greek Strings can be converted to Unicode in NFC form, except that two code points are maintained without normalization: Greek high stop and Greek question mark.
The ASCII string *mh=nin converts to the NFC Unicode string Μῆνιν.
The Unicode string ἐπίρρημα converts to ASCII string e)pi/rrhma and NFC Unicode string ἐπίρρημα