Jump to Main Content
CDSfold: an algorithm for designing a protein-coding sequence with the most stable secondary structure
- Terai, Goro, Kamegai, Satoshi, Asai, Kiyoshi
- Bioinformatics 2016 v.32 no.6 pp. 828-834
- Internet, algorithms, amino acid sequences, bioinformatics, computer software, messenger RNA, nucleotide sequences, protein synthesis, synthetic biology
- Motivation: An important problem in synthetic biology is to design a nucleotide sequence of an mRNA that confers a desirable expression level of a target protein. The secondary structure of protein-coding sequences (CDSs) is one potential factor that could have both positive and negative effects on protein production. To elucidate the role of secondary structure in CDSs, algorithms for manipulating secondary structure should be developed. Results: We developed an algorithm for designing a CDS with the most stable secondary structure among all possible ones translated into the same protein, and implemented it as the program CDSfold. The algorithm runs the Zuker algorithm under the constraint of a given amino acid sequence. The time and space complexity is O(L³) and O(L²), respectively, where L is the length of the CDS to be designed. Although our algorithm is slower than the original Zuker algorithm, it could design a relatively long (2.7-kb) CDS in approximately 1 h. Availability and implementation: The CDSfold program is freely available for non-commercial users as stand-alone and web-based software from http://cdsfold.trahed.jp/cdsfold/. Contacts: firstname.lastname@example.org or email@example.com Supplementary information: Supplementary data are available at Bioinformatics online.