hrcak mascot   Srce   HID

Izvorni znanstveni članak
https://doi.org/10.2498/cit.2004.02.02

Binary Coding, mRNA Information and Protein Structure

Nikola Gotovac
Nikola Štambuk
Paško Konjevoda

Puni tekst: engleski, pdf (210 KB) str. 73-81 preuzimanja: 603* citiraj
APA 6th Edition
Gotovac, N., Štambuk, N. i Konjevoda, P. (2004). Binary Coding, mRNA Information and Protein Structure. Journal of computing and information technology, 12 (2), 73-81. https://doi.org/10.2498/cit.2004.02.02
MLA 8th Edition
Gotovac, Nikola, et al. "Binary Coding, mRNA Information and Protein Structure." Journal of computing and information technology, vol. 12, br. 2, 2004, str. 73-81. https://doi.org/10.2498/cit.2004.02.02. Citirano 24.09.2020.
Chicago 17th Edition
Gotovac, Nikola, Nikola Štambuk i Paško Konjevoda. "Binary Coding, mRNA Information and Protein Structure." Journal of computing and information technology 12, br. 2 (2004): 73-81. https://doi.org/10.2498/cit.2004.02.02
Harvard
Gotovac, N., Štambuk, N., i Konjevoda, P. (2004). 'Binary Coding, mRNA Information and Protein Structure', Journal of computing and information technology, 12(2), str. 73-81. https://doi.org/10.2498/cit.2004.02.02
Vancouver
Gotovac N, Štambuk N, Konjevoda P. Binary Coding, mRNA Information and Protein Structure. Journal of computing and information technology [Internet]. 2004 [pristupljeno 24.09.2020.];12(2):73-81. https://doi.org/10.2498/cit.2004.02.02
IEEE
N. Gotovac, N. Štambuk i P. Konjevoda, "Binary Coding, mRNA Information and Protein Structure", Journal of computing and information technology, vol.12, br. 2, str. 73-81, 2004. [Online]. https://doi.org/10.2498/cit.2004.02.02

Sažetak
We describe new binary algorithm for the prediction of α and β protein folding types from RNA, DNA and amino acid sequences. The method enables quick, simple and accurate prediction of α and β protein folds on a personal computer by means of a few binary patterns of coded amino acid and nucleotide physicochemical properties. The algorithm was tested with machine learning SMO (sequential minimal optimization) classifier for the support vector machines and classification trees, on a dataset of 140 dissimilar protein folds. Depending on the method of testing, the overall classification accuracy was 91.43% – 100% and the tenfold cross-validation result of the procedure was 83.57% – >90%.

Genetic code randomization analysis based on 100,000 different codes tested for the protein fold prediction quality indicated that: a) there is a very low chance of p = 2.7 x 10^(-4) that a better code than the natural one specified by the binary coding algorithm is randomly produced, b)dipeptides represent basic protein units with respect to the natural genetic code defining of the secondary protein structure.

Hrčak ID: 44722

URI
https://hrcak.srce.hr/44722

Posjeta: 739 *