Sequential protein-GDP binding residues prediction

Abstract

Abstract: Accurately identifying the protein-GDP binding sites is of significant importance for both protein function analysis and drug design. Protein-GDP binding residues prediction is a typical imbalanced learning problem. Directly applying the traditional machine learning approach for this task is not suitable as the learning results will be severely biased towards the majority class. To circumvent this problem, on the basis of position specific scoring matrix feature based on sparse representation, weighted under-sampling is developed to make samples balanced. Finally support vector machine is used for prediction. Experimental results show that the proposed method achieves higher prediction performances.

Key words: protein-GDP binding prediction, position specific scoring matrix, sparse representation, weighted under-sampling, support vector machine

摘要： 正确地识别蛋白质-二磷酸鸟苷（Guanosine Diphosphate，GDP）绑定位点对于蛋白质功能分析和药物设计有非常重要的意义。蛋白质-GDP绑定位点预测是一个典型的不平衡学习问题。直接应用传统的机器学习方法是不合适的，而且会使预测结果偏向大多数类。为了解决这个问题，在基于稀疏表示的位置特异性得分矩阵特征基础上，提出了加权下采样方法来使得样本平衡，采用支持向量机算法来预测。实验结果表明提出的方法能获得更高的预测性能。

关键词: 蛋白质-GDP绑定预测, 位置特异性得分矩阵, 稀疏表示, 加权下采样, 支持向量机

SHI Dahong, HE Xue. Sequential protein-GDP binding residues prediction[J]. Computer Engineering and Applications, 2016, 52(13): 55-59.

石大宏，何雪. 序列蛋白质-GDP绑定位点预测[J]. 计算机工程与应用, 2016, 52(13): 55-59.

[1]	ZHANG Xiaowen, REN Yongfeng. Image Matching Algorithm Combining Sparse Representation and Topological Similarity [J]. Computer Engineering and Applications, 2021, 57(8): 198-203.
[2]	GAO Yikai, PENG Li, XU Longzhuang. Flame Recognition Method Using TWSVM with Improved Artificial Fish Swarm Algorithm [J]. Computer Engineering and Applications, 2021, 57(8): 204-213.
[3]	HAN Weiyu, CHENG Longsheng. Research on Roling Bearing Failure Mode Classification Based on MTS and SVM [J]. Computer Engineering and Applications, 2021, 57(6): 239-246.
[4]	LEI Henglin, Gulanbaier Tuerhong, Mairidan Wushouer, ZHANG Dongmei. Review of Novelty Detection [J]. Computer Engineering and Applications, 2021, 57(5): 47-55.
[5]	WEN Jiebin, YANG Wenzhong, MA Guoxiang, ZHANG Zhihao, LI Hailei. Micro-expression Recognition Based on Apex Frame Optical Flow and Convolutional Autoencoder [J]. Computer Engineering and Applications, 2021, 57(4): 127-133.
[6]	TAO Tiwei, LIU Mingxia, WANG Mingliang, WANG Linlin, YANG Deyun, ZHANG Qiang. Effective Distance Based Low-Rank Representation [J]. Computer Engineering and Applications, 2021, 57(4): 141-147.
[7]	XU Xianfeng, CAI Lulu, ZHANG Li. Photovoltaic Power Generation Prediction Algorithm Based on MLP and DBN [J]. Computer Engineering and Applications, 2021, 57(3): 266-272.
[8]	LI Junxia, ZHANG Qin, ZHENG Guimei. Overview of Human Posture Recognition by Ultra-wideband Radar [J]. Computer Engineering and Applications, 2021, 57(3): 14-23.
[9]	CHEN Fujian, XIE Weixin, XIA Ting. Adaptive Anti-occlusion Target Tracking Algorithm Based on LCT+ [J]. Computer Engineering and Applications, 2021, 57(22): 190-198.
[10]	YANG Quan. SVM Algorithm for N1+N2 Structure Syntax Relation Determination [J]. Computer Engineering and Applications, 2021, 57(20): 104-108.
[11]	GAO Jin, ZHAO Yunpeng, Godfred Kim Mensah, LI Xinyun, LIU Zhifen, CHEN Junjie, GUO Hao. Research on Spatial Dynamics Analysis and Classification of Resting-State Functional Brain Connections [J]. Computer Engineering and Applications, 2021, 57(2): 150-155.
[12]	DING Yuxiang, BIAN Weixin, JIE Biao, ZHAO Jun. Super-Resolution Image Reconstruction Based on Neighborhood Regression and Sparse Representation [J]. Computer Engineering and Applications, 2021, 57(2): 230-236.
[13]	QIN Boyu, HAO Xiaoyan, LIU Yongfang. Frame Disambiguation of FrameNet Based on SVM and CRF Two-Stage Model [J]. Computer Engineering and Applications, 2021, 57(18): 255-262.
[14]	XU Ranran, WU Xiaojun, YIN Hefeng. Face Recognition via Discriminative Non-negative Representation Based Classification [J]. Computer Engineering and Applications, 2021, 57(13): 147-153.
[15]	ZHENG Linwen, ZHOU Jinzhi, HUANG Jing. Application of Deep Sparse Auto-Encoders in ECG Feature Extraction [J]. Computer Engineering and Applications, 2021, 57(11): 156-161.

Sequential protein-GDP binding residues prediction

序列蛋白质-GDP绑定位点预测

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics