大肠杆菌启动子特征元件对启动子识别的影响
The Influence of E. Coli Promoter Feature Elements on Promoter Recognition
-
摘要: 为了进一步研究大肠杆菌启动子的识别算法,根据大肠杆菌基因分子生物学有关理论与统计事实,对大肠杆菌启动子特征元件进行了研究.从启动子样本中选取不同的保守序列,采用2种神经网络结构.分别分析了启动子特征元件对识别的影响.研究发现.包含-10和-35区2个显著保守序列元件及其他几个非显著序列元件时,正样本和负样本的识别率最高,达到77.67%和88.45%.实验结果为进一步研究启动子的特征提取和识别算法提供了参考.Abstract: In order to research the recognition method for E. coli promoter, E. coli promoter feature elements are researched combining with molecubiology theory and statistical facts of E. coli gene promoters. Two-structure neural network methods were applied to analysis the promoter sequence elements in E. coli gene promoters by selecting different promoter conservative sequences. As a result, we found that the recognition rate of the positives and negatives have the best performance when the canonical elements and the non-canonical sequence elements are all included, which is 77.67% and 88.45% respectively. This result can provide help to the feature selection and the recognition algorithm research of the promoters.