Главная страница Карта сайта Контактная информация




Главная » Наука » Публикации »
 

 

Mol Biol (Mosk). 2011 Jul-Aug;45(4):724-37. PubMed

[Machine learning study of DNA binding by transcription factors from the LacI family].

Fedonin, G. G.; Rakhmaninova, A. B.; Korostelev, I. u. D.; Laikova, O. N.; Gel'fand, M. S.

We studied 1372 LacI-family transcription factors and their 4484 DNA binding sites using machine learning algorithms and feature selection techniques. The Naive Bayes classifier and Logistic Regression were used to predict binding sites given transcription factor sequences and to classify factor-site pairs on binding and non-binding ones. Prediction accuracy was estimated using 10-fold cross-validation. Experiments showed that the best prediction of nucleotide densities at selected site positions is obtained using only a few key protein sequence positions. These positions are stably selected by the forward feature selection based on the mutual information of factor-site position pairs.



 


  Московский Государственный Университет имени М.В.Ломоносова



Почтовый адрес:
119991 г. Москва, ГСП-1, Ленинские горы МГУ 1, стр. 73,
Факультет биоинженерии и биоинформатики, комната 433.

Телефон / факс: +7 (495) 939-41-95
Справочная телефонов МГУ +7 (495) 939-10-00

E-mail: bioeng@genebee.msu.ru

© 2011 Факультет биоинженерии и биоинформатики
Московского Государственного Университета имени М.В.Ломоносова


 





- создание сайта, 2010