Crynodeb
The advancement of Natural Language Processing (NLP) allows the process of deriving information from large volumes of text to be automated, making text-based resources more discoverable and useful. The attention is turned to one of the most important, but traditionally difficult to access resources in archaeology; the largely unpublished reports generated by commercial or “rescue” archaeology, commonly known as “grey literature”. The paper presents the development and evaluation of a Named Entity Recognition system of Dutch archaeological grey literature targeted at extracting mentions of artefacts, archaeological features, materials, places and time entities. The role of domain vocabulary is discussed for the development of a KOS-driven NLP pipeline which is evaluated against a Gold Standard, human-annotated corpus.
Iaith wreiddiol | Saesneg |
---|---|
Teitl | Communications in Computer and Information Science |
Is-deitl | Metadata and Semantic Research. MTSR 2020 |
Golygyddion | Emmanouel Garoufallou, María-Antonia Ovalle-Perandones |
Man cyhoeddi | Cham |
Cyhoeddwr | Springer |
Tudalennau | 53-64 |
Nifer y tudalennau | 12 |
Cyfrol | 1335 |
ISBN (Electronig) | 978-3-030-71903-6 |
ISBN (Argraffiad) | 978-3-030-71902-9978-3-030-71903-6 |
Dynodwyr Gwrthrych Digidol (DOIs) | |
Statws | Cyhoeddwyd - 21 Maw 2021 |
Digwyddiad | Metadata and Semantic Research, 14th International Conference (2020) - Madrid, Sbaen Hyd: 2 Rhag 2020 → 4 Rhag 2020 Rhif y gynhadledd: 14 |
Cyfres gyhoeddiadau
Enw | Communications in Computer and Information Science |
---|---|
Cyhoeddwr | Springer |
Cyfrol | 1355 |
ISSN (Argraffiad) | 1865-0929 |
ISSN (Electronig) | 1865-0937 |
Cynhadledd
Cynhadledd | Metadata and Semantic Research, 14th International Conference (2020) |
---|---|
Teitl cryno | MTSR 2020 |
Gwlad/Tiriogaeth | Sbaen |
Dinas | Madrid |
Cyfnod | 2/12/20 → 4/12/20 |