Title: |
Enhancing Biomedical Knowledge Discovery for Diseases: An Open-Source Framework Applied on Rett Syndrome and Alzheimer's Disease |
Authors: |
Theodoropoulos, Christos, Coman, Andrei Catalin, Henderson, James, Moens, Marie-Francine |
Publication Year: |
2024 |
Collection: |
Computer Science |
Subject Terms: |
Computer Science - Computation and Language, Computer Science - Artificial Intelligence |
More Details: |
The ever-growing volume of biomedical publications creates a critical need for efficient knowledge discovery. In this context, we introduce an open-source end-to-end framework designed to construct knowledge around specific diseases directly from raw text. To facilitate research in disease-related knowledge discovery, we create two annotated datasets focused on Rett syndrome and Alzheimer's disease, enabling the identification of semantic relations between biomedical entities. Extensive benchmarking explores various ways to represent relations and entity representations, offering insights into optimal modeling strategies for semantic relation detection and highlighting language models' competence in knowledge discovery. We also conduct probing experiments using different layer representations and attention scores to explore transformers' ability to capture semantic relations. Comment: Published in IEEE Access, doi: 10.1109/ACCESS.2024.3509714 |
Document Type: |
Working Paper |
DOI: |
10.1109/ACCESS.2024.3509714 |
Access URL: |
http://arxiv.org/abs/2407.13492 |
Accession Number: |
edsarx.2407.13492 |
Database: |
arXiv |