Secondary structure of prion mRNA
互联网
Secondary structure of prion mRNA
Luck R; Steger G; Riesner D
Institut fur Physikalische Biologie, Heinrich-Heine-Universitat, Dusseldorf, FRG.
J Mol Biol 258: 813-26 (1996)
An algorithm for prediction of conserved secondary structure of single-stranded RNA is presented. For each RNA of a set of homologous RNAs, optimal and suboptimal secondary structures are calculated and stored in a base-pair probability matrix. A multiple sequence alignment is performed for the set of RNAs. The resulting gaps are introduced into the individual probability matrices. These homologous probability matrices are summed to give a consensus probability matrix emphasizing the conserved secondary structure elements of the RNA set. Thus the algorithm combines the advantages of thermodynamic structure prediction by energy minimization with the information obtained from phylogenetic alignment of sequences.
The algorithm is applied to three examples. The REV-responsive element of HIV, the structure of which is well known from the literature, was chosen to test the algorithm. The second example is the 3' terminal segment of genomic single-stranded RNAs of cucumber mosaic viruses; a structure similar to that of the related brome mosaic virus was expected and was confirmed. The third example is the prion-protein mRNA from different organisms; the structure of this mRNA is not known. By application of the algorithm highly conserved hairpins were found in the prion-protein mRNA. Introduction
Abbreviations used: bp, base-pair; nt, nucleotides; ssRNA, single-stranded RNA; ORF, open reading frame; UTR, untranslated region
PrP is the major component of prions (for review see Prusiner, 1994), which cause several neurodegenerative diseases in humans and animals, including scrapie in sheep, bovine spongiform encephalopathy (BSE) in cattle, and Creutzfeldt-Jakob disease (CJD) in man. During prion infection, an abnormal isoform of PrP, in the case of scrapie designated PrP Sc , is produced from the cellular isoform of PrP C , which is encoded by a chromosomal gene, by an unknown process. In spite of intensive studies, differences between the chemical compositions of PrP C and PrP Sc could not be found (Stahl et al., 1993). Thus, a mere conformational change as the origin of the transformation of PrP C into PrP Sc is an attractive model (for review see Prusiner, 1994). It might be that putative structural elements in the PrP mRNA influence the kinetics of sequential folding of the protein during translation, and that the process of infection acts via those structural elements. A potential stemloop structure in one PrP mRNA, i.e. that of man, has been discussed (Wills & Hughes, 1990).
If, however, structural elements of the PrP mRNA are considered as functionally relevant for the development of all prion diseases, this feature must be found in every species that is susceptible to this class of disease. Therefore the new algorithm will be applied to check PrP mRNA for evolutionarily conserved secondary structures.
Prediction of structural elements in mRNA of prion protein
From the EMBL data bank, 23 PrP mRNA sequences of the following species were available: three rodents (mouse, hamster, rat), two ruminants (cattle, sheep), the human and 17 non-human primates consisting of four apes (gorilla, chim-panzee, gibbon, orangutan), seven old-world mon-keys (colobus, presbytis, baboon, mandrill, rhesus macaque, Macaca arctoides, African green monkey), and six new-world monkeys (spider, squirrel, capuchin, aotes, marmoset, titi). Calculations of individual secondary structures of these molecules with the programs RNAfold and LinAll revealed that thermodynamically stable elements of sec-ondary structure are located mainly in the 5' region including the ORF. In contrast, the 3' UTR with a lower G + C content (41% G + C versus 53% G + C in the ORF of the hamster sequence) is mainly single stranded. As examples, the calculated secondary structures of human and hamster PrP mRNAs are shown in Figure 5.
Alignment of the 23 sequences with CLUSTAL V (Higgins & Sharp, 1988; Higgins, 1994) showed considerable homology in an 1100 nt fragment (individual sequences starting between nt -42 and -10 and ending at nt 788 to 1039; numbering relative to the start of the ORF). These fragments that contain the ORF were chosen for further studies. They have a length of 735 to 795 nt and showed a high degree of homology (74 to 97%). The secondary structure distributions of the individual fragments were calculated with RNAfold at 50?C. With the alignment mentioned above, the consensus base-pair probability matrix was calculated and is presented as dot plot in Figure 6.
Surprisingly, the dot plot shows that PrP mRNAs contain conserved structural elements at only very few positions despite both the high degree of sequence conservation and the high degree of base-pairing as predicted for the individual PrP mRNAs (Figure 5). From this dot plot we identified as candidates of conserved structural elements five hairpins (HP A to E); their occurrence in the different organisms is listed in Table 1. Only HP C is present in all sequences and shows a high probability of base-pairing in most sequences. Owing to nucleotide exchanges the detailed structure of the HP C stem differs between the groups of species (Figure 7); a tendency exists, however, to preserve or to restore all base-pairs. In particular the loop of HP C is highly conserved and has the sequence 5'-ACCHCA-3' with H being a pyrimidine, except in the capuchin and the marmoset sequences. HP C is located at nt 140 to 165 (nt 149 to 174 for the ruminants) relative to the start of the ORF. On the amino acid level the triplet 5'-CCH-3' of the HP C loop represents the first amino acid of the octarepeat region, which consists of five nearly perfect tandem repeats of eight amino acids. Figure 5. Schematic drawings of the optimal secondary structure of human (left) and hamster (right) PrP mRNA as predicted by RNAfold at 50℃.
Figure 6. Consensus dot plot of 23 PrP mRNA fragments including the ORF. The individual base-pair dot plots were calculated with RNAfold at 50?C and overlaid according to the alignment produced by CLUSTAL. Size of dots is proportional to the cubic power of the base-pair probability (a = 3, b = 1); thus helices of low probability are suppressed. Hatched areas show the gaps in the alignment; with increasing darkness gaps are present in an increasing number of sequences. Hairpins marked HP A to E have a high probability of base-pairing and are conserved.
Table 1. Comparison of individual base-pairing probabilities of five hairpins (HP A to E) conserved in PrP mRNA Species HP Rodents Cattle, sheep Man, apes OW monkeys NW monkeys A +++ - - +(-)a +(-)b B ++ +++ +(++)a ++ - (+)a C +++(++)a + +++(++)a +++ +++(++)b D - ++ +++ +++ +++(++)a E ++(+)a - +++ +++ +++(++)a The hairpins are present with a base-pairing probability of at least 0.8 (+++), in between 0.03 and 0.8 (++), 0.001 and 0.03 (+) and less than 0.001 (-). a A second value in parenthesis represents one species that differs from the rest of the group. b A second value in parenthesis represents two species that differ from the rest of the group.
Figure 7. Schematic drawings of hairpin C of PrP mRNAs:
According to the widely accepted prion hypothesis (Prusiner, 1982), formation of the isoform PrP Sc from the cellular form PrP C is the basic event for prion infectivity and pathology. The molecular mechanism for the formation of PrP Sc is not known. There is no evidence for the existence of a specific nucleic acid essential for prion infectivity (for review see Riesner, 1991; Kellings et al., 1992), and no chemical difference between the two isoforms could be found (Stahl et al., 1993). Thus, it was suggested that the two PrP isoforms differ only in protein conformation (for review see Prusiner, 1994). Although the measurable differences in conformation between PrP C and PrP Sc are generated in a slow post-translational process, the possibility was never excluded that the nucleus for such a transition, which could comprise only a few PrP molecules, had already been formed during the translation process and that the PrP mRNA structure might have an influence on protein structure formation.
In particular, a conservation of structural motifs among the PrP mRNAs of susceptible species may point to a potential biological function for these motifs. Even if PrP mRNA structure were not involved in the mechanism of infection or pathogenesis of prions it could play a part in the regulation of PrP C expression. In either case, this problem seemed attractive to us for the application for the new algorithm. The minimal energy foldings of the individual PrP mRNAs revealed a high degree of structure formation inside the ORF and a much less structured 3' UTR as a consequence of the higher A + U content. In contrast to both the high degree of secondary structure in the individual mRNA sequences and the high degree of sequence homology, only a very few structural elements were found to be conserved in the ensemble of PrP mRNAs. This feature appears remarkable if the properties of PrP mRNA are compared with those of CMV RNA.
Although the sequence homology of all CMV RNAs was only 72 (213)%, a highly conserved over-all structure could be derived; nothing comparable was found in PrP mRNA in spite of 86(26)% homology in the ORF. Therefore, we have to conclude that a detailed PrP mRNA structure like those shown in Figure 5 is not of biological importance, and only the evolutionarily conserved hairpin structures (HP A to E) should be considered. Three of them (HP A to C) are located in the ORF in a cluster 5' of the octarepeat region, and two (HP D and E) are further downstream close to the stop codon.
In earlier studies of the human PrP mRNA sequence (Wills & Hughes, 1990; Wills, 1992), it was proposed that the octarepeat region contains structural elements with possible relevance for development of prion diseases, and binding of cellular proteins to this region was reported (SchroƠder et al., 1994). In the context of evolutionary conservation these structures could not be confirmed. Therefore it is unlikely that those elements act as structural targets at the RNA level. While investigating RNA secondary structure conservation inside the coding region of mRNA, one has to take into account the influence of amino acid conservation. Is a conserved mRNA structural motif a mere coincidence of codon usage for the amino acid sequence? This could be tested either by looking for compensatory base exchanges in stem regions or by comparing the general codon usage of an mRNA with the codons used in the structural element.
While conserving the amino acid sequence, compensatory base exchanges in a structural element are only possible if certain base-pairs are formed between wobble positions of codons. This is not the case with HP C (see symbols in Figure 7). An investigation of codon usage in HP C revealed that in the 5' part of the stem common codons are mostly used. In contrast, the 3' part of the stem, which contains three glycine codons, uses GGU in 70% of all codons although this codon is generally used below 30%. In addition, there is an obvious selection pressure for the wobble base in the first proline codon of the loop. It is coded by CCA in all 23 sequences despite approximately 20 % codon usage, whereas in the adjacent proline CCU, CCC and CCA are used. Therefore, we conclude that the conserved nucleic acid structures in the ORF are not a coincidence from the codon usage but are a product of selection for a nucleic acid structure.
Among the five conserved hairpins, HP C is present with a high base-pairing probability in most of the studied sequences. In contrast to the slight variability of the HP C stem, its loop region is formed by the highly conserved sequence 5'-AC-CHCA- 3'. Thus the stem of HP C might be responsible only for exposing the conserved loop region from the global molecular structure, as was suggested for stem 1 of RRE (see above). The loop may serve as the essential target element for binding to cellular proteins or even to the infectious isoform of PrP. Another potential function of HP C (and of the other conserved hairpins) during the develop-ment of prion diseases and the formation of infectious PrP Sc might be correlated to the general influence of mRNA secondary structure on protein conformation.
The in vivo folding of a polypeptide chain into its native three-dimensional structure is a sequential process (for review see Jaenicke, 1993). It was argued that the conformation of some proteins depends on the rate and kinetics of translation, which is influenced by the presence and stability of structural elements of the translated mRNA (Purvis et al., 1987). Translation initiation is most sensitive to structures close to the entry site of ribosomes (Kozak, 1989), but also structures located further 3' in the coding region still reduce translation efficiency (Liebhaber et al., 1992). A modulation of protein folding by translational pauses resulting from mRNA secondary structure and codon usage has been proposed for the MS2 coat protein of Escherichia coli (Guisez et al., 1993).The examples mentioned from the literature suggest that the conserved secondary structure elements in PrP mRNA might well affect translation and thereby the conformation of PrP.
References
Ahlquist, P., Dasgupta, R. & Kaesberg, P. (1981). Near identity of 3' RNA secondary structure in bro-moviruses and cucumber mosaic virus. Cell, 23, 183 189.
Antao, V. P. & Tinoco, I. Jr (1992). Thermodynamic parameters for loop formation in RNA and DNA hairpin tetraloops. Nucl. Acids Res. 20, 819 824.
Antao, V. P., Lai, S. Y. & Tinoco, I. Jr (1991). A thermodynamic study of unusually stable RNA and DNA hairpins. Nucl. Acids Res. 19, 5901 5905.
Bare, L. A. & Uhlenbeck, O. C. (1989). Specific substitution into the anticodon loop of yeast tyrosine transfer RNA. Biochemistry, 25, 5825 5830.
Bellman, R. & Kalaba, R. (1960). On kth best policies. SIAM J. Appl. Math. 8, 582588.
Bolton, D. C., McKinley, M. P. & Prusiner, S. B. (1982). Identification of a protein that purifies with the scrapie prion. Science, 218, 1309 1311.
Chan, L., Zuker, M. & Jacobson, A. B. (1991). A computer method for finding common base paired helices in aligned sequences: application to the analysis of random sequences Nucl. Acids Res. 19, 353 358.
Corpet, F. & Michot, B. (1994). RNAlign: alignment of RNA sequences using both primary and secondary structures. Comp. Appl. Biosci. 10, 389 399. Cullen, B. R. (1994). RNA-sequence-mediated gene regulation in HIV-1. Infect. Agents Dis. 3, 6876.
Dam, E.t., Pleij, K. & Draper, D. (1992). Structural and functional aspects of RNA pseudoknots. Biochemistry, 31, 1166511676.
Dayton, E. T., Powell, D. M. & Dayton, A. I. (1989). Functional analysis of CAR, the target sequence for the Rev protein of HIV-1. Science, 246, 16251629.
Dayton, E. T., Konings, D. A. M., Powell, D. M., Shapiro, B. A., Butini, L., Maizel, J. V. & Dayton, A. I. (1992). Extensive sequence-specific information throughout the CAR/RRE, the target sequence of the human immunodeficiency virus type 1 Rev protein. J. Virol. 66, 11391151.
Devereux, J., Haeberli, P. & Smithies, O. (1984). A comprehensive set of sequence analysis programs for the VAX. Nucl. Acids Res. 12, 387395.
Dreher, T. W. & Hall, T. C. (1988). Mutational analysis of the tRNA mimicry of brome mosaic virus RNA. Sequence and structural requirements for amino-acylation and 3'-adenylation. J. Mol. Biol. 201, 4155.
Eddy, S. R. & Durbin, R. (1994). RNA sequence analysis using covariance models. Nucl. Acids Res. 22, 20792088.
Felden, B., Florentz, C., GiegeƬ R. & Westhof, E. (1994). Solution structure of the 3'-end of brome mosaic virus genomic RNAs. Conformational mimicry with canonical tRNAs. J. Mol. Biol. 235, 508531.
Fontana, W., Konings, D. A. M., Stadler, P. F. & Schuster, P. (1993). Statistics of RNA secondary structures. Biopolymers, 33, 13891404.
Giege, R., Puglisi, J. D. & Florentz, C. (1993). tRNA structure and aminoacylation efficiency. Prog. Nucl. Acid Res. Mol. Biol. 45, 129206.
Groebe, D. R. & Uhlenbeck, O. C. (1988). Characterization of RNA hairpin loop stability. Nucl. Acids Res. 16, 1172511735.
Guisez, Y., Robbens, J., Remaut, E. & Fiers, W. (1993). Folding of the MS2 coat protein in Escherichia coli is modulated by translational pauses resulting from mRNA secondary structure and codon usage: a hypothesis. J. Theor. Biol. 162, 243252.
Gutell, R. R., Weiser, B., Woese, C. R. & Noller, H. F. (1985). Comparative anatomy of 16-S-like ribosomal RNA. Prog. Nucl. Acid Res. Mol. Biol. 32, 155216. Haenni, A.-L., Joshi, S. & Chapeville, F. (1982). tRNA-like structures in the genomes of RNA viruses. Prog. Nucl. Acid Res. Mol. Biol. 27, 85104.
Heaphy, S., Dingwall, C., Ernberg, I., Gait, M. J., Green, S. M., Karn, J., Lowe, A. D., Singh, M. & Skinner, M. A. (1990). HIV-1 regulator of virion expression (Rev) protein binds to an RNA stemloop structure located within the Rev response element region. Cell, 60, 685693.
Higgins, D. G. (1994). CLUSTAL V: multiple alignment of DNA and protein sequences. Methods Mol. Biol. 25, 307318.
Higgins, D. G. & Sharp, P. M. (1988). CLUSTAL: a package for performing multiple sequence alignment on a microcomputer. Gene, 73, 237244.
Himeno, H., Hasegawa, T., Ueda, T., Watanabe, K. & Shimizu, M. (1990). Conversion of aminoacylation specificity from tRNA Tyr to tRNA Ser in vitro. Nucl. Acids Res. 18, 68156819.
Jaeger, J. A., Turner, D. H. & Zuker, M. (1990). Predicting optimal and suboptimal secondary structure for RNA. Methods Enzymol. 183, 281306.
Jaenicke, R. (1993). What does protein refolding in vitro tell us about protein folding in the cell? Phil. Trans. R. Soc. Lond. Biol. 339, 287294.
Joshi, R. L., Joshi, S., Chapeville, F. & Haenni, A. L. (1983). tRNA-like structures of plant viral RNAs: confor-mational requirements for adenylation and aminoa-cylation. EMBO J. 2, 11231127.
Kellings, K., Meyer, N., Mirenda, C., Prusiner, S. B. & Riesner, D. (1992). Further analysis of nucleic acids in purified scrapie prion preparations by improved return refocussing gel electrophoresis (RRGE). J. Gen. Virol. 73, 10251029.
Kjems, J., Brown, M., Chang, D. D. & Sharp, P. A. (1991). Structural analysis of the interaction between the human immunodeficiency virus Rev protein and the Rev response element. Proc. Natl Acad. Sci. USA, 88, 683687.
Kjems, J., Calnan, B. J., Frankel, A. D. & Sharp, P. A. (1992). Specific binding of a basic peptide from HIV-1 Rev. EMBO J. 11, 11191129.
Kohl, R. J. & Hall, T. C. (1974). Aminoacylation of RNA from several viruses. Amino acid specificity and differential activity of plant, yeast, and bacterial synthetases. J. Gen. Virol. 25, 257261.
Kozak, M. (1989). Circumstances and mechanisms of inhibition of translation by secondary structure in eucaryotic mRNAs. Mol. Cell Biol. 9, 51345142.
Le, S. Y. & Zuker, M. (1990). Common structures of the 5' non-coding RNA in enteroviruses and rhi-noviruses. Thermodynamical stability and statistical significance. J. Mol. Biol. 216, 729741.
Le, S. Y. & Zuker, M. (1991). Predicting common foldings of homologous RNAs. J. Biomol. Struct. Dynam. 8, 10271044.
Le, S. Y., Owens, J., Nussinov, R., Chen, J. H., Shapiro, B. & Maizel, J. V. (1989). RNA secondary structures: comparison and determination of frequently recur-ring substructures by consensus. Comput. Appl. Biosci. 5, 205210.
Le, S. Y., Malim, M. H., Cullen, B. R. & Maizel, J. V. (1990). A highly conserved RNA folding region coincident with the Rev response element of primate im-munodeficiency viruses. Nucl. Acids Res. 18, 1613 1623.
Lee, C. P. & RajBhandary, U. L. (1991). Mutants of Escherichia coli initiator tRNA that suppress amber codons in Saccharomyces cerevisiae and are amino-acylated with tyrosine by yeast extracts. Proc. Natl Acad. Sci. USA, 88, 1137811382.
Liebhaber, S. A., Cash, F. & Eshleman, S. S. (1992). Translation inhibition by an mRNA coding region secondary structure is determined by its proximity to the AUG initiation codon. J. Mol. Biol. 226, 609621.
Malim, M. H., Hauber, J., Le, S. Y., Maizel, J. V. & Cullen, B. R. (1989). The HIV-1 rev trans-activator atcs through a structured target sequence to activate nuclear export of unspliced viral mRNA. Nature, 338, 254257.
Malim, M. H., Tiley, L. S., McCarn, D. F., Rusche, J. R., Hauber, J. & Cullen, B. R. (1990). HIV-1 structural gene expression requires binding of the Rev trans-activator to its RNA target sequence. Cell, 60, 675683.
McCaskill, J. S. M. (1990). The equilibrium partition function and base pair binding probabilities for RNA secondary structure. Biopolymers, 29, 1105 1119.
Mermer, B., Felber, B. K., Campbell, M. & Pavlakis, G. N. (1990). Identification of trans-dominant HIV-1 rev protein mutants by direct transfer of bacterially produced proteins into human cells. Nucl. Acids Res. 18, 20372044.
Myers, G., Korber, B., Berzofsky, J. A., Smith, R. F. & Pavlakis, G. N. (1993). Human Retroviruses and AIDS 1993. A Compilation and Analysis of Nucleic Acid and Amino Acid Sequences. Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, Los Alamos/NMEX, USA. Noller, H. F. (1984). Structure of ribosomal RNA. Annu. Rev. Biochem. 53, 119162.
Noller, H. F., Kop, J., Wheaton, V., Brosius, J., Gutell, R. R., Kopylov, A. M., Dohme, F., Herr, W., Stahl, D. A., Gupta, R. & Woese, C. R. (1981). Secondary structure model for 23S ribosomal RNA. Nucl. Acids Res. 9, 61676189. Nussinov, R., Pieczenik, G., Griggs, J. R. & Kleitman, D. J. (1978). Algorithms for loop matchings. SIAM J. Appl. Math. 35, 6882.
Oesch, B., Westaway, D., WaƠlchli, M., McKinley, M. P., Kent, S. B. H., Aebersold, R., Barry, R. A., Tempst, P., Teplow, D. B., Hood, L. E., Prusiner, S. B. & Weissmann, C. (1985). A cellular gene encodes scrapie PrP 27-30 protein. Cell, 40, 735746.
Olsen, H. S., Cochrane, A. W., Dillon, P. J., Nalin, C. M. & Rosen, C. A. (1990a). Interaction of the human immunodeficiency virus type 1 Rev protein with a structured region in env mRNA is dependent on multimer formation mediated through a basic stretch of amino acids. Genes Dev. 4, 13571364.
Olsen, H. S., Nelbock, P., Cochrane, A. W. & Rosen, C. A. (1990b). Secondary structure is the major determi-nant for interaction of HIV Rev protein with RNA. Science, 247, 845848.
Pleij, C. W. A., Rietveld, K. & Bosch, L. (1985). A new principle of RNA folding based on pseudoknotting. Nucl. Acids. Res. 13, 17171731.
Prusiner, S. B. (1982). Novel proteinaceous infectious particles cause scrapie. Science, 216, 136144. Prusiner, S. B. (1994). Biology and genetics of prion diseases. Annu. Rev. Microbiol. 48, 655686.
Purvis, I. J., Bettany, A. J., Santiago, T. C., Coggins, J. R., Duncan, K., Eason, R. & Brown, A. J. (1987). The efficiency of folding of some proteins is increased by controlled rates of translation in vivo. A hypothesis. J. Mol. Biol. 193, 413417. Riesner, D. (1991). The search for a nucleic acid component to scrapie infectivity. Semin. Virol. 2, 22152262.
Rietveld, K., van Poelgeest, R., Pleij, C. W. A., van Boom, J. H. & Bosch, L. (1982). The tRNA-like structure at the 3' terminus of turnip yellow mosaic virus RNA. Differences and similarities with canonical tRNA. Nucl. Acids Res. 10, 19291946.
Schaƴzl, H. M., Da Costa, M., Taylor, L., Cohen, F. E. & Prusiner, S. B. (1995). Prion protein gene variation among primates. J. Mol. Biol. 245, 362374.
Schmitz, M. & Steger, G. (1992). Base-pair probability profiles of RNA secondary structures. Comp. Appl. Biosci. 8, 389399.
SchroƠder, H. C., Scheffer, U., Forrest, J. M. S., Seve, A. P., Rytik, P. G. & MuƠller, W. E. G. (1994). Association of scrapie prion protein and prion protein-RNA stemloop with nuclear carbohydrate-binding pro-tein 35 and other RNA-binding proteins. Neurodegen-eration, 3, 177189.
Stahl, N., Baldwin, M. A., Teplow, D. B., Hood, L., Gibson, B. W., Burlingame, A. L. & Prusiner, S. B. (1993). Structural analysis of the scrapie prion protein using mass spectrometry and amino acid sequencing. Biochemistry, 32, 19912002.
Steger, G., Hofmann, H., FoƲtsch, J., Gross, H. J., Randles, J. W., SaƠnger, H. L. & Riesner, D. (1984). Conformational transitions in viroids and virusoids: comparison of results from energy minimization algorithm and from experimental data. J. Biomol. Struct. Dynam. 2, 543571.
Tinoco, I. Jr, Uhlenbeck, O. C. & Levine, M. D. (1971). Estimation of secondary structure in ribonucleic acids. Nature, 230, 363367.
Trifonov, E. N. & Bolshoi, G. (1983). Open and closed 5 S ribosomal RNA, the only two universal structures encoded in the nucleotide sequences. J. Mol. Biol. 169, 113.
Walter, A. E., Turner, D. H., Kim, J., Lyttle, M. H., MuƠller, P., Mathews, D. H. & Zuker, M. (1994). Coaxial stacking of helices enhances binding of oligoribo-nucleotides and improves predictions of RNA folding. Proc. Natl Acad. Sci. USA, 91, 92189222.
Waterman, M. S. & Smith, T. F. (1978). RNA secondary structure: A complete mathematical analysis. Math. Biosci. 42, 257266.
Williams, A. L. & Tinoco, I. Jr (1986). A dynamic programming algorithm for finding alternative RNA secondary structures. Nucl. Acids Res. 14, 299315.
Wills, P. R. (1992). Potential pseudoknots in the PrP-encoding mRNA. J. Theor. Biol. 159, 523527.
Wills, P. R. & Hughes, A. L. (1990). Stem loops in HIV and prion protein mRNAs. J. Acquir. Immune Def. Syndr. 3, 9597.
Wyatt, J. R., Puglisi, J. D. & Tinoco, I. Jr (1989). RNA folding: pseudoknots, loops and bulges. Bioessays, 11, 100106.
Zuker, M. & Stiegler, P. (1981). Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information. Nucl. Acids Res. 9, 133148.
Zuker, M., Jaeger, J. A. & Turner, D. H. (1991). A comparison of optimal and suboptimal RNA secondary structures predicted by free energy minimization with structures determined by phylo-genetic comparison. Nucl. Acids Res. 19, 27072714.
Edited by R. Huber (Received 5 September 1995; received in revised form 26 February 1996; accepted 4 March 1996)
Supplementary material for this paper, comprising one Figure, is available from JMB Online.