This is not currently part of the peer-reviewed material of the project. Do not cite as a research publication.
This month I made a major change to the database in the form of changing the encoding to be compatible with Unicode's supplementary multilingual planes (i.e. converting the encoding to utf8mb4, or 4-byte compatible utf8). This allows for a much greater variety of characters and symbols.
I also did the preliminary development of a new editing interface in the JQuery Mobile (JQM) framework (rather than the home-made Skaldic Project framework that forms previously have been restricted to). This is not a simple task!
The new interface uses JQM form fields and highly minimalist information sent back to server — basically only fields changed are sent back to the server.
The new code:
Various problems with passing and processing data have been debugged, but a number of form field types are not implemented (esp. more complex ajax types, images, files and maps).
Other changes to interface:
I also created a new lemmatisation form in the new interface. This requires fewer updates because of ajax lookups, and has more information in ajax popups. Likewise, I also adapted the lexical variants form for the new interface with a simpler layout and information closer at hand thanks to ajax popups, and lookups for lemmas (previously if nothing was matched for a variant when the form loaded, then lemmatising would have to be done separately).
Stanzas in corpus: | 5791 | |
Stanzas entered in database: | 4776 | (82.5%) |
Variant readings: | 46473 | |
Words in corpus: | 150368 | |
Words lemmatised: | 96463 | (64.2%) |
Stanzas with lexical variants: | 602 | (10.4%) |
Lexical variants added: | 3687 | |
Lexical variants lemmatised: | 3012 | (81.7%) |
Headwords linked to corpus: | 13314 |