TY - JOUR
T1 - DeepCLIP: predicting the effect of mutations on protein–RNA binding with deep learning
AU - Grønning, Alexander Gulliver Bjørnholt
AU - Doktor, Thomas Koed
AU - Larsen, Simon Jonas
AU - Petersen, Ulrika Simone Spangsberg
AU - Holm, Lise Lolle
AU - Bruun, Gitte Hoffmann
AU - Hansen, Michael Birkerod
AU - Hartung, Anne-Mette
AU - Baumbach, Jan
AU - Andresen, Brage Storstein
N1 - Publisher Copyright: © The Author(s) 2020.
PY - 2020/7/27
Y1 - 2020/7/27
N2 - Nucleotide variants can cause functional changes by altering protein–RNA binding in various ways that are not easy to predict. This can affect processes such as splicing, nuclear shuttling, and stability of the transcript. Therefore, correct modeling of protein–RNA binding is critical when predicting the effects of sequence variations. Many RNA-binding proteins recognize a diverse set of motifs and binding is typically also dependent on the genomic context, making this task particularly challenging. Here, we present DeepCLIP, the first method for context-aware modeling and predicting protein binding to RNA nucleic acids using exclusively sequence data as input. We show that DeepCLIP outperforms existing methods for modeling RNA-protein binding. Importantly, we demonstrate that DeepCLIP predictions correlate with the functional outcomes of nucleotide variants in independent wet lab experiments. Furthermore, we show how DeepCLIP binding profiles can be used in the design of therapeutically relevant antisense oligonucleotides, and to uncover possible position-dependent regulation in a tissue-specific manner. DeepCLIP is freely available as a stand-alone application and as a webtool at http://deepclip.compbio.sdu.dk.
AB - Nucleotide variants can cause functional changes by altering protein–RNA binding in various ways that are not easy to predict. This can affect processes such as splicing, nuclear shuttling, and stability of the transcript. Therefore, correct modeling of protein–RNA binding is critical when predicting the effects of sequence variations. Many RNA-binding proteins recognize a diverse set of motifs and binding is typically also dependent on the genomic context, making this task particularly challenging. Here, we present DeepCLIP, the first method for context-aware modeling and predicting protein binding to RNA nucleic acids using exclusively sequence data as input. We show that DeepCLIP outperforms existing methods for modeling RNA-protein binding. Importantly, we demonstrate that DeepCLIP predictions correlate with the functional outcomes of nucleotide variants in independent wet lab experiments. Furthermore, we show how DeepCLIP binding profiles can be used in the design of therapeutically relevant antisense oligonucleotides, and to uncover possible position-dependent regulation in a tissue-specific manner. DeepCLIP is freely available as a stand-alone application and as a webtool at http://deepclip.compbio.sdu.dk.
UR - http://www.scopus.com/inward/record.url?scp=85088495170&partnerID=8YFLogxK
U2 - 10.1093/nar/gkaa530
DO - 10.1093/nar/gkaa530
M3 - Journal article
SN - 0305-1048
VL - 48
SP - 7099
EP - 7118
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 13
ER -