Identifying relationships between drugs and medical conditions: winning experience in the Challenge 2 of the OMOP 2010 Cup
Vladimir Nikulin

TL;DR
This paper introduces a novel ensemble method using random re-sampling to identify hidden drug-condition associations in large unlabelled observational data, achieving top performance in the OMOP Cup challenge.
Contribution
The study presents a new ensemble approach based on homogeneous bagging for detecting drug safety signals in large unlabelled datasets, outperforming previous methods.
Findings
Achieved top performance in OMOP Cup Challenge N2
Effectively identified hidden drug-condition associations
Demonstrated the utility of ensemble methods in unlabelled data
Abstract
There is a growing interest in using a longitudinal observational databases to detect drug safety signal. In this paper we present a novel method, which we used online during the OMOP Cup. We consider homogeneous ensembling, which is based on random re-sampling (known, also, as bagging) as a main innovation compared to the previous publications in the related field. This study is based on a very large simulated database of the 10 million patients records, which was created by the Observational Medical Outcomes Partnership (OMOP). Compared to the traditional classification problem, the given data are unlabelled. The objective of this study is to discover hidden associations between drugs and conditions. The main idea of the approach, which we used during the OMOP Cup is to compare the numbers of observed and expected patterns. This comparison may be organised in several different ways,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Statistical Methods in Clinical Trials · Statistical Methods and Inference
