• Achille Felicetti
  • Daniel Williams
  • Ilenia Galluccio
  • Douglas Tudhope
  • Franco Niccolucci
This paper deals with the development of advanced tools and technologies for creating relevant information and suitable metadata out of textual documentation produced by Italian archaeological research. A set of Natural Language Processing tools were developed to recognize and annotate various archaeological entities in Italian language textual reports. The CIDOC CRM is the ontology chosen for encoding resulting output, allowing for a maximum degree of standardisation of the produced metadata to guarantee interoperability with archaeological information already existing in other semantically enabled digital archives. The work took place as part of the development for the TEXTCROWD platform for the European Open Science Cloud for Research Pilot Project.
Original languageEnglish
Title of host publicationProceedings 3rd International Congress on Digital Heritage
PublisherInstitute of Electrical and Electronics Engineers
Number of pages8
StatePublished - 11 Dec 2018
EventDigital Heritage 2018 - 3rd International Congress & Expo - San Francisco , United States
Duration: 26 Oct 201830 Oct 2018


ConferenceDigital Heritage 2018 - 3rd International Congress & Expo
Abbreviated titleDH2018
CountryUnited States
CitySan Francisco

    Research areas

  • NLP, NER, Italian language archaeology, textual documents, Grey Literature, Metadata integration, Standards, CIDOC CRM

ID: 2929492