Supervisor: Romana Pernisch (r.pernisch@vu.nl)
When ontologies and knowledge graphs are engineered, they capture the domain at a specific moment in time. However, domains do not remain static and it is important to keep ontologies up to date.
In this project, you will investigate how to detect weather an ontology is need of changes because of new source data You can make use of an existing pipeline to investigate the coverage of a specific domain ontology against a corpus of text which could in turn help you determin where updates are needed. You can also take a different approach. You can work on two specific domains (or choose a third one yourself): Clinical Trial Outcomes or Companion Planting. Pretraining of a language model might be necessary
In the case of Clinical Trial Outcomes, the text are actual clinical trials, from clinicaltrails.gov. The outcome measures which are captured in the ontology, can be extrasted easily as the clinical trial inforamtion is provided in a semi-structured way.
In the case of Companion Planting, the texts against which this evaluation could be conducted are general literature on companion planting or websites. For this, first a collection of documents would need to be done, before one can apply OntoEval.
In the case of a MSc thesis, this project can be extended into different directions.
“OntoEval: an Autoamted Ontology Evaluation System” [link]