Motivation

Side effects of drugs could cause severe health problems and the failure of drug development. Drug–target interactions are the basis for side effect production and are important for side effect prediction. However, the information on the known targets of drugs is incomplete. Furthermore, there could be also some missing data in the existing side effect profile of drugs. As a result, new methods are needed to deal with the missing features and missing labels in the problem of side effect prediction.

Results

We propose a novel computational method based on transductive matrix co-completion and leverage the low-rank structure in the side effects and drug–target data. Positive-unlabelled learning is incorporated into the model to handle the impact of unobserved data. We also introduce graph regularization to integrate the drug chemical information for side effect prediction. We collect the data on side effects, drug targets, drug-associated proteins and drug chemical structures to train our model and test its performance for side effect prediction. The experiment results show that our method outperforms several other state-of-the-art methods under different scenarios. The case study and additional analysis illustrate that the proposed method could not only predict the side effects of drugs but also could infer the missing targets of drugs.

Availability and implementation

The data and the code for the proposed method are available at https://github.com/LiangXujun/GTMCC.

Supplementary information

Supplementary data are available at Bioinformatics online.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.