Quantitative assessment of dictionary-based protein named entity tagging.
Natural language processing (NLP) approaches have been explored to manage and mine information recorded in biological literature. A critical step for biological literature mining is biological named entity tagging (BNET) that identifies names mentioned in text and normalizes them with entries in biological databases. The aim of this study was to provide quantitative assessment of the complexity of BNET on protein entities through BioThesaurus, a thesaurus of gene/protein names for [...]
Author(s): Liu, Hongfang, Hu, Zhang-Zhi, Torii, Manabu, Wu, Cathy, Friedman, Carol
DOI: 10.1197/jamia.M2085