Participation is now being invited in the second edition of the HIPE shared task on named entity processing in historical documents, as part of CLEF 2022 Evaluation Labs.
The HIPE evaluation lab series is organized by researchers from the EPFL Digital Humanities Lab (DHLAB), the University of Lausanne, the University of Zurich and the University of La Rochelle. It is part of the ongoing efforts of the natural language processing and digital humanities communities to adapt and develop appropriate technologies to efficiently retrieve and explore information from historical texts. On such material, however, named entity processing techniques face the challenges of domain and document type heterogeneity, input noisiness, dynamics of language, and lack of resources.
Following the first CLEF-HIPE-2020 evaluation lab on historical newspapers in three languages, HIPE-2022 confronts systems with the challenges of dealing with more languages, learning domain-specific entities, and adapting to diverse annotation tag sets. The objective is to gain new insights into the transferability of named entity processing approaches across languages, time periods, document types, and annotation tag sets.
Compared to the first edition, HIPE-2022 features a new document type and additional languages, and proposes three challenges (newspapers, commentaries, global adaptation), with 35 tracks.
Tasks: NERC and Entity Linking in Multilingual Historical Documents.
Registration: https://clef2022-labs-registration.dei.unipd.it/ (until 22 April 2022)
Training data: 15 February 2022
Evaluation period: 26 April – 6 May 2022
Workshop venue: during CLEF conference, 5-8 September 2022, Bologna, Italy.
Twitter: #HIPE2022 / @clef_initiative / #clef2022