This is not currently part of the peer-reviewed material of the project. Do not cite as a research publication.
This month I made some minor updates to the variant lemmatising form, including a button to look up word forms in ONP that have not previously been recorded in the database.
I also worked a bit with ONP's data on compounds and links to dictionaries/glosses with a view to incorporating and / or using the information in LP.
Progress at 28/8/17
Stanzas in corpus: | 5797 | |
Stanzas entered in database: | 4845 | (83.6%) |
Variant readings: | 46654 | |
Words in corpus: | 150501 | |
Words lemmatised: | 110416 | (73.4%) |
Stanzas with lexical variants: | 752 | (13.0%) |
Lexical variants added: | 6120 | |
Lexical variants lemmatised: | 5010 | (81.9%) |
Headwords linked to corpus: | 14268 |