Construction of Complex Features for Computational Predicting ncRNA-Protein Interaction

Dai, Qiguo and Guo, Maozu and Duan, Xiaodong and Teng, Zhixia and Fu, Yueyue (2019) Construction of Complex Features for Computational Predicting ncRNA-Protein Interaction. Frontiers in Genetics, 10. ISSN 1664-8021

[thumbnail of pubmed-zip/versions/1/package-entries/fgene-10-00018/fgene-10-00018.pdf] Text
pubmed-zip/versions/1/package-entries/fgene-10-00018/fgene-10-00018.pdf - Published Version

Download (1MB)

Abstract

Non-coding RNA (ncRNA) plays important roles in many critical regulation processes. Many ncRNAs perform their regulatory functions by the form of RNA-protein complexes. Therefore, identifying the interaction between ncRNA and protein is fundamental to understand functions of ncRNA. Under pressures from expensive cost of experimental techniques, developing an accuracy computational predictive model has become an indispensable way to identify ncRNA-protein interaction. A powerful predicting model of ncRNA-protein interaction needs a good feature set of characterizing the interaction. In this paper, a novel method is put forward to generate complex features for characterizing ncRNA-protein interaction (named CFRP). To obtain a comprehensive description of ncRNA-protein interaction, complex features are generated by non-linear transformations from the traditional k-mer features of ncRNA and protein sequences. To further reduce the dimensions of complex features, a group of discriminative features are selected by random forest. To validate the performances of the proposed method, a series of experiments are carried on several widely-used public datasets. Compared with the traditional k-mer features, the CFRP complex features can boost the performances of ncRNA-protein interaction prediction model. Meanwhile, the CFRP-based prediction model is compared with several state-of-the-art methods, and the results show that the proposed method achieves better performances than the others in term of the evaluation metrics. In conclusion, the complex features generated by CFRP are beneficial for building a powerful predicting model of ncRNA-protein interaction.

Item Type: Article
Subjects: OA STM Library > Medical Science
Depositing User: Unnamed user with email support@oastmlibrary.com
Date Deposited: 25 Feb 2023 12:03
Last Modified: 24 May 2024 06:23
URI: http://geographical.openscholararchive.com/id/eprint/230

Actions (login required)

View Item
View Item