Enhancing text categorization with semantic-enriched representation and training data augmentation.
Acquiring and representing biomedical knowledge is an increasingly important component of contemporary bioinformatics. A critical step of the process is to identify and retrieve relevant documents among the vast volume of modern biomedical literature efficiently. In the real world, many information retrieval tasks are difficult because of high data dimensionality and the lack of annotated examples to train a retrieval algorithm. Under such a scenario, the performance of information retrieval [...]
Author(s): Lu, Xinghua, Zheng, Bin, Velivelli, Atulya, Zhai, Chengxiang
DOI: 10.1197/jamia.M2051