This is not currently part of the peer-reviewed material of the project. Do not cite as a research publication.
Odd Einar Haugen, 16 December 2015
This list is based on the meeting held in Bergen, 8–9 September 2015, and additional mails.
It contains 36 new characters. Of these, only 1 character is in Andron Scriptor Web, but 22 are in Andron Mega Corpus. See the columns for ASW and AMC in the table below, as well as the column for the LINCUA tables (LINCUA overview). A total of 13 characters need to be composed (of existing characters) or drawn from scratch.
Of the 36 new characters, 4 are already in the Unicode Standard. They have been listed with codepoints in black & bold below. Another 4 are already in the Private Use Area (PUA) of TITUS. They have been listed with codepoints in black & regular below. Furthermore, 18 characters were allocated codepoints by Jost Gippert in 2011, and have been listed in blue below.
Finally, 10 characters have no code points in TITUS, and have been listed in red below. In agreement with Jost Gippert, code points have been allocated by Odd Einar Haugen. When allocating these codepoints, the latest version of the TITUS font has been checked, as well as the LINCUA resource set up by Andreas Stötzner
Also, see the overview of new codepoints on these pages:
http://folk.uib.no/hnooh/mufi/pipeline/for-v3.html
http://folk.uib.no/hnooh/mufi/pipeline/for-v4.html
One character should be decommissioned from the MUFI PUA and moved to a new codepoint in Latin Extended-E, i.e. LATIN LETTER SMALL X WITH LONG LEFT LEG: From F232 to AB57
Code | ASW | AMC | LIN | Chart | Descriptive name |
019C | — | Ɯ | n/a | LatExtB | LATIN CAPITAL LETTER TURNED M |
026F | — | ɯ | n/a | IPAExt | LATIN SMALL LETTER TURNED M |
2056 | — | ⁖ | n/a | GenPunct | THREE DOT PUNCTUATION |
2E40 | — | — | n/a | SupplPunct | DOUBLE HYPHEN |
E262 | — | — | yes | PUA-22 | LATIN CAPITAL LIGATURE OE WITH OGONEK |
E268 | — | — | yes | PUA-17 | LATIN CAPITAL LETTER P WITH DOUBLE ACUTE |
E34E | — | # | yes | PUA-30 | LATIN CAPITAL LETTER V WITH VERTICAL LINE ABOVE |
E662 | — | $ | yes | PUA-22 | LATIN SMALL LIGATURE OE WITH OGONEK |
E668 | — | — | yes | PUA-17 | LATIN SMALL LETTER P WITH DOUBLE ACUTE |
E74F | — | % | yes | PUA-30 | LATIN SMALL LETTER V WITH VERTICAL LINE ABOVE |
E8A1 | | & | yes | PUA-5 | LATIN SMALL LETTER I WITH TWO STROKES (has a decommissioned character in Andron Scriptor Web; the code point E8A1 should now be used for LATIN SMALL LETTER I WITH TWO STROKES) |
E8A2 | | ' | yes | PUA-5 | LATIN SMALL LETTER J WITH TWO STROKES (has a decommissioned character in Andron Scriptor Web; the code point E8A2 should now be used for LATIN SMALL LETTER J WITH TWO STROKES) |
E8A3 | | ( | yes | PUA-4 | LATIN ABBREVIATION SIGN AUTEM (has a decommissioned character in Andron Scriptor Web; the code point E8A3 should now be used for LATIN ABBREVIATION SIGN AUTEM) |
E8BB | — | ) | yes | PUA-5 | LATIN SMALL LETTER V WITH SHORT SLASH ABOVE RIGHT |
E8C6 | — | * | yes | PUA-1 | LATIN CAPITAL LIGATURE UU |
E8C7 | — | + | yes | PUA-1 | LATIN SMALL LIGATURE UU |
E8C8 | — | , | yes | PUA-1 | LATIN CAPITAL LIGATURE UE |
E8C9 | — | - | yes | PUA-1 | LATIN SMALL LIGATURE UE (equivalent to 1D6B in PhonExt) |
E8BC | — | . | yes | PUA-5 | LATIN SMALL LETTER V WITH TWO SHORT SLASHES ABOVE RIGHT |
E8CE | — | / | yes | PUA-5 | LATIN SMALL LETTER X WITH TWO SHORT SLASHES BELOW RIGHT |
E8DD | — | — | no | PUA-22 | LATIN SMALL LETTER DOTLESS I WITH OGONEK |
E8DE | — | — | no | PUA-1 | LATIN SMALL LIGATURE O R ROTUNDA |
E8DF | — | — | no | PUA-1 | LATIN SMALL LIGATURE LONG S L WITH STROKE |
EFD8 | — | — | no | PUA-17 | LATIN SMALL LIGATURE UU WITH DOUBLE ACUTE |
EFD9 | — | — | no | PUA-17 | LATIN CAPITAL LIGATURE UU WITH DOUBLE ACUTE |
EFDB | — | — | no | PUA-32 | LATIN CAPITAL LETTER AE WITH DOT ABOVE AND ACUTE |
EFDC | — | — | no | PUA-32 | LATIN SMALL LETTER AE WITH DOT ABOVE AND ACUTE |
F1BB | | 0 | yes | PUA-1 | LATIN SMALL LIGATURE CH |
F1D2 | — | 1 | yes | PUA-8 | TRIPLE DAGGER SIGN |
F23C | — | — | no | PUA-51 | LATIN SMALL LETTER M UNCIAL FORM [having the same x height as ordinary ‘m’] (F225 should be retained, but renamed LATIN MEDIUSCULE LETTER M UNCIAL FORM) |
F23D | — | — | no | PUA-51 | LATIN SMALL LETTER M UNCIAL FORM WITH RIGHT DESCENDER [same x height as ‘m’] (F226 should be retained, but renamed LATIN MEDIUSCULE LETTER M UNCIAL FORM WITH RIGHT DESCENDER) |
F23E | — | — | no | PUA-16 | LATIN SMALL LETTER M UNCIAL FORM WITH ACUTE ACCENT [ same x height as ‘m’] (Based on F23C) |
F2F A | — | 2 | yes | PUA-12 | KRONE SIGN |
F2FB | — | 3 | yes | PUA-12 | HELBELING SIGN |
F2FE | — | 4 | yes | PUA-11 | ROMAN NUMERAL CAPITAL C WITH TWO BARS |
F2FF | — | 5 | yes | PUA-11 | ROMAN NUMERAL SMALL C WITH TWO BARS |