Mining long-COVID symptoms from Reddit: characterizing post-COVID syndrome from patient reports.
Our objective was to mine Reddit to discover long-COVID symptoms self-reported by users, compare symptom distributions across studies, and create a symptom lexicon. We retrieved posts from the /r/covidlonghaulers subreddit and extracted symptoms via approximate matching using an expanded meta-lexicon. We mapped the extracted symptoms to standard concept IDs, compared their distributions with those reported in recent literature and analyzed their distributions over time. From 42 995 posts by 4249 users [...]
Author(s): Sarker, Abeed, Ge, Yao
DOI: 10.1093/jamiaopen/ooab075