Improving Negative Sampling in Graph Neural Networks for Predicting Drug-Drug Interactions

Abstract

Predicting drug-drug interactions (DDIs) becomes an increasingly important problem in the computational domain, due to the acceleration of drug discovery and the high costs of solving this task through in vitro experiments. It can be modelled as a link prediction task on a DDIs network, where each node represents a drug and each link represents a meaningful interaction between a pair of drugs. Graph Neural Networks (GNNs) have achieved state-of-the-art results in node classification and graph classification and consequently there is an increasing interest into their application on link prediction. The well-established GNN-based encoder-decoder framework requires negative links (i.e. non-interactions) for training, as it is performed in a supervised fashion. Most of the current DDI datasets do not provide laboratory-confirmed negative interactions (i.e. non-interacting pairs of drugs) and so negative sampling from the unobserved links becomes an exclusively computational task.There is a lack of research investigating the quality and importance of negative sampling techniques when predicting on homogeneous graphs. This paper focuses on improving negative sampling in GNNs on a homogeneous drug-drug interaction graph. An analysis into the effect of negative sampling on predicting DDIs in such a network is carried out and current approaches are formalised and tested. Following this, LANS, a novel Loss-based Adaptive Negative Sampling technique, is introduced. The lightweight LANS method achieves an increase in performance of 16.49% in the Hits@20 metric without adding any trainable parameters to the architecture. Finally, some of its limitations are identified and potential improvements are proposed.

Publication
IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
Alexandra Herghelegiu
Alexandra Herghelegiu
BSc Student (now at Apple)
Haiping Lu
Haiping Lu
Professor of Machine Learning, Head of AI Research Engineering, and Turing Academic Lead

I am a Professor of Machine Learning. I develop translational AI technologies for better analysing multimodal data in healthcare and beyond.