The wave of digitization has begun. Organizations deal with huge amounts of data, such as logs, websites, and documents. A common way to make the information contained in these sources machine-accessible for automated processing is to first extract the information and then store it in a knowledge graph. A key task in this approach is to recognize entities. While common named entity recognition (NER) models work well for common entity types, they typically fail to recognize custom entities. Custom entity recognition requires data to be manually annotated and custom NER models to be trained. To efficiently extract the information, this paper proposes an innovative solution: Our Gazetteer approach uses a knowledge graph to create a coarse and fast NER component, reducing the need for manual annotation and saving human effort. Focusing on a university use case, our Gazetteer is integrated into a chatbot for entity recognition. In addition, data can be annotated using the Gazetteer and an NER model can be trained. Subsequently, the NER model can be used to recognize unseen custom entities, which are then added to the knowledge graph. This will improve the knowledge graph and make it self-extending.
| Titel | Knowledge-Grounded and Self-Extending NER |
|---|---|
| Medien | In: Stephanidis, C., Antona, M., Ntoa, S., Salvendy, G. (eds) HCI International 2023 Posters. HCII 2023. Communications in Computer and Information Science, Springer, Cham |
| Verlag | Springer, Cham |
| Band | 1836 |
| ISBN | 978-3-031-36003-9 |
| Verfasser | Sudarshan Kamath Barkur, Prof. Dr. Sigurd Schacht, Carsten Lanquillon |
| Seiten | 439–446 |
| Veröffentlichungsdatum | 09.07.2023 |
| Projekttitel | DIAS |
| Zitation | Kamath Barkur, Sudarshan; Schacht, Sigurd; Lanquillon, Carsten (2023): Knowledge-Grounded and Self-Extending NER . In: Stephanidis, C., Antona, M., Ntoa, S., Salvendy, G. (eds) HCI International 2023 Posters. HCII 2023. Communications in Computer and Information Science, Springer, Cham 1836, 439–446. DOI: 10.1007/978-3-031-36004-6_60 |