Annotation and initial evaluation of a large annotated German oncological corpus.
We present the Berlin-Tübingen-Oncology corpus (BRONCO), a large and freely available corpus of shuffled sentences from German oncological discharge summaries annotated with diagnosis, treatments, medications, and further attributes including negation and speculation. The aim of BRONCO is to foster reproducible and openly available research on Information Extraction from German medical texts.
Author(s): Kittner, Madeleine, Lamping, Mario, Rieke, Damian T, Götze, Julian, Bajwa, Bariya, Jelas, Ivan, Rüter, Gina, Hautow, Hanjo, Sänger, Mario, Habibi, Maryam, Zettwitz, Marit, de Bortoli, Till, Ostermann, Leonie, Ševa, Jurica, Starlinger, Johannes, Kohlbacher, Oliver, Malek, Nisar P, Keilholz, Ulrich, Leser, Ulf
DOI: 10.1093/jamiaopen/ooab025