The following characters are allowed in the transcription system using the ancient Greek section of Unicode.
For the twenty-four individual alphabetic characters of the Attic-Ionic alphabet, the
twenty-four upper-case Unicode code points 0391 - 03A1 and 03A3 - 03A9 may be used. For equivalent lower-case characters, the twenty-five lower case Unicode code points from 03B1 - 03C9 may be used. Either 03C2 (final sigma) or 03C3 (sigma) may be used for the character sigma; both are mapped to ASCII s.
The mapping of ASCII to Unicode Greek transcriptions can be illustrated by creating a GreekString from a Unicode source, and then converting the GreekString to the corresponding ASCII-only transcription.
| GreekString | Source String |
|---|---|
| a | α |
| b | β |
| g | γ |
| d | δ |
| e | ε |
| z | ζ |
| h | η |
| q | θ |
| i | ι |
| k | κ |
| l | λ |
| m | μ |
| n | ν |
| c | ξ |
| o | ο |
| p | π |
| r | ρ |
| s | σ |
| t | τ |
| u | υ |
| f | φ |
| x | χ |
| y | ψ |
| w | ω |
A GreekString may be individual Greek vowel characters together with the combining Unicode codepoint for smooth or rough breathing, the three accent characters, iota subscript and diaeresis. Alternatively, a GreekString may use the equivalent Unicode precombined characters.
| Combining/precombined | Source String | GreekString |
|---|---|---|
| combining | μῆνιν | mh=nin |
| combining | Μῆνιν | *mh=nin |
| precombined | μῆνιν | mh=nin |
| precombined | Μῆνιν | *mh=nin |
The following punctuation characters are allowed:
The comma and period characters are identical in Unicode Greek transcription and ASCII transcription:
| ASCII only transcription | Unicode transcription |
|---|---|
| . | . |
| , | , |
The Greek question mark character converts to a semicolon in ASCII transcription, and the high stop character converts to a colon.
| ASCII only transcription | Unicode transcription |
|---|---|
| ; | ; |
| : | · |
The elision character is transcribed in both ASCII and Unicode Greek transcriptions with the apostrophe character ' (= \u0027).
| ASCII only transcription | Unicode transcription |
|---|---|
| ' | ' |
Whether constructed from beta-code or unicode source string, Greek Strings can be converted to Unicode in NFC form, except that two code points are maintained without normalization: Greek high stop and Greek question mark.
The ASCII string *mh=nin converts to the NFC Unicode string Μῆνιν.
The Unicode string ἐπίρρημα converts to ASCII string e)pi/rrhma and NFC Unicode string ἐπίρρημα