Published in Proceedings of the 4th Celtic Language Technology Workshop, 2022
This paper introduces a Universal Dependencies treebank covering a range of Irish dialects and time periods since 1600. We also establish baselines for lemmatization, tagging, and dependency parsing on this corpus by experimenting with a variety of machine learning approaches.
Recommended citation: Kevin P. Scannell. Diachronic Parsing of Pre-Standard Irish. In Proceedings of the 4th Celtic Language Technology Workshop at LREC 2022, pages 7–13, Marseille, France. European Language Resources Association. https://kevinscannell.com/files/dppsi.pdf