Identification of Motifs in Protein Sequences
互联网
- Abstract
- Table of Contents
- Figures
- Literature Cited
Abstract
This brief appendix serves as a guide for the analysis of functional motifs in proteins. Several database search engines that can be accessed via the World Wide Web are described. Such computerized searches have become the preferred method to scan large sequence and motif databases, as the searches are efficient and the databases are updated frequently. A short list of sorting signals is also included, since these motifs often cannot be predicted reliably by a computer search.
Table of Contents
- Databases and Servers on the WWW
- Analysis Example
- Sorting Signals
- Figures
- Tables
Materials
Figures
-
Figure a0.1C.1 (at right) Sample output from a BLASTP search on the NCBI BLAST server, showing the highest scoring hits of the C. elegans protein R151.5. The graphical display draws a shaded bar for each database sequence; multiple matches to the same sequence are attached with a hatched bar. Matches to multiple sequences may be packed on one line for compactness. The shading of each bar reflects the score of the match according to the key on top. By pointing the cursor on a bar, the description of the matching sequence is shown in the text box above (thrombospondin). The bottom of the figure shows a list of the hits. Clicking on a link in the left column leads to the record for that sequence, while following a link on the right gives the alignment of the query sequence with that hit. View Image -
Figure a0.1C.2 Sample WWW output of a Pfam search with R151.5. Links are provided to browse the documentation for each of the matching HMM families. The score threshold had to be lowered slightly from the default of 15 bits to detect the EGF domain. The E value given in the top bar gives the number of matches expected at the indicated score(s). View Image -
Figure a0.1C.3 Schematic combined representation of results from searching protein domains in R151.5 with Pfam, Prints, Blocks, and Prosite. Matches to zinc‐metalloproteases (Zn, including the astacin subfamily of zinc‐metalloproteases), EGF, CUB, and thrombospondin type 1 (tsp‐1) are indicated. Half‐height boxes mark matches that were deemed false after manual inspection, and quarter‐height boxes mark matches to Prosite patterns known to occur spuriously, such as phosphorylation and myristylation sites. For the sake of comparison, only matches to entries in the Blocks database are shown on line three; the Blocks WWW server can also report matches to Prints families. Likewise, only matches to Prosite patterns are shown here; the Prosite server also contains profile entries for the CUB and thrombospondin type 1 domains. View Image
Videos
Literature Cited
Literature Cited | |
Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D.J. 1997. Gapped BLAST and PSI‐BLAST: A new generation of protein database search programs. Nucl. Acids Res. 25:3389‐3402. | |
Attwood, T.K., Beck, M.E., Flower, D.R., Scordis, P., and Selley, J.N. 1998. The PRINTS protein fingerprint database in its fifth year. Nucl. Acids Res. 26:304‐308. | |
Bairoch, A., Bucher, P., and Hofmann, K. 1997. The PROSITE database, its status in 1997. Nucl. Acids Res. 25:217‐221. | |
Blattner, J., Dorsam, H., and Clatyon, C.E. 1995. Function of N‐terminal import signals in trypanosome microbodies. FEBS Lett. 360:310‐314. | |
Chen, W.J., Goldstein, J.L., and Brown, M.S. 1990. NPXY, a sequence often found in cytoplasmic tails, is required for coated pit‐mediated internalization of the low‐density lipoprotein receptor. J. Biol. Chem. 265:3116‐3123. | |
Claros, M.G. 1995. MitoProt, a Macintosh application for studying mitochondrial proteins. Comput. Appl. Biosci. 11:441‐447. | |
Claros, M.G., Brunak, S., and von Heijne, G. 1997. Prediction of N‐terminal protein sorting signals. Curr. Opin. Struct. Biol. 7:394‐398. | |
Cline, K. and Henry, R. 1996. Import and routing of nucleus‐encoded chloroplast proteins. Annu. Rev. Cell. Dev. Biol. 12:1‐26. | |
Colley, K.J. 1997. Golgi localization of glycosyltransferases: More questions than answers. Glycobiology 7:1‐13. | |
Corbett, A.H. and Silver, P.A. 1997. Nucleocytoplasmic transport of macromolecules. Microbiol. Mol. Biol. Rev. 61:193‐211. | |
Dalbey, R.E., Lively, M.O., Bron, S., and van Dijl, J.M. 1997. The chemistry and enzymology of the type I signal peptidases. Protein Sci. 6:1129‐1138. | |
Gavel, Y. and von Heijne, G. 1990a. Cleavage‐site motifs in mitochondrial targeting peptides. Protein Eng. 4:33‐37. | |
Gavel, Y. and von Heijne, G. 1990b. A conserved cleavage‐site motif in chloroplast transit peptides. FEBS Lett. 261:455‐458. | |
Gomord, V., Denmat, L.A., Fitchette‐Laine, A.C., Satiat‐Jeunemaitre, B., Hawes, C., and Faye, L. 1997. The C‐terminal HDEL sequence is sufficient for retention of secretory proteins in the endoplasmic reticulum (ER) but promotes vacuolar targeting of proteins that escape the ER. Plant J. 11:313‐325. | |
Henikoff, S., Pietrokovski, S., and Henikoff, J.G. 1998. Superior performance in protein homology detection with the Blocks Database servers. Nucl. Acids Res. 26:309‐312. | |
Jackson, M.R., Nilsson, T., and Peterson, P.A. 1990. Identification of a consensus motif for retention of transmembrane proteins in the endoplasmic reticulum. EMBO J. 9:3153‐3162. | |
Keller, G.A., Krisans, S., Gould, S.J., Sommer, J.M., Wang, C.C., Schliebs, W., Kunau, W., Brody, S., and Subramani, S. 1991. Evolutionary conservation of a microbody targeting signal that targets proteins to peroxisomes, lyoxysomes, and glycosomes. J. Cell. Biol. 114:893‐904. | |
Marks, M.S., Ohno, H., Kirchhausen, T., and Bonifacino, J.S. 1997. Protein sorting by tyrosine‐based signals: Adapting to the Ys and wherefores. Trends Cell Biol. 7:124‐128. | |
Neupert, W. 1997. Protein import into mitochondria. Annu. Rev. Biochem. 66:863‐917. | |
Nielsen, H., Engelbrecht, J., Brunak, S., and von Heijne, G. 1997. Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites. Protein Eng. 10:1‐6. | |
Pearson, W.R. 1996. Effective protein sequence comparison. Methods Enzymol. 266:227‐258. | |
Pearson, W.R., Wood, T., Zhang, Z., and Miller, W. 1997. Comparison of DNA sequences with protein sequences. Genomics 46:24‐36. | |
Recipon, H.E., Schuler, G.D., and Boguski, M.S. 1995. Sequence Similarity Searching Using the BLAST Family of Programs. In Current Protocols in Molecular Biology, (F.M. Ausubel, R. Brent, R.E. Kingston, D.D. Moore, J.G. Seidman, J.A. Smith, and K. Struhl, eds.) pp. 19.3.1‐19.3.38. John Wiley & Sons, New York. | |
Sandoval, I. and Bakke, O. 1994. Targeting of membrane proteins to endosomes and lysosomes. Trends Cell Biol. 4:292‐297. | |
Schwarz, E. and Neupert, W. 1994. Mitochondrial protein import: Mechanisms, components and energetics. Biochim. Biophys. Acta 1187:270‐274. | |
Sommer, J.M. and Wang, C.C. 1994. Targeting proteins to the glycosomes of African trypanosomes. Annu. Rev. Microbiol. 48:105‐138. | |
Sonnhammer, E.L., Eddy, S.R., Birney, E., Bateman, A., and Durbin, R. 1998. Pfam: Multiple sequence alignments and HMM‐profiles of protein domains. Nucl. Acids Res. 26:320‐322. | |
Tikkanen, R., Peltola, M., Oinonen, C., Rouvinen, J., and Peltonen, L. 1997. Several cooperating binding sites mediate the interaction of a lysosomal enzyme with phosphotransferase. EMBO J. 16:6684.6693. | |
Trowbridge, I.S., Collawn, J.F., and Hopkins, C.R. 1993. Signal‐dependent membrane protein trafficking in the endocytic pathway. Annu. Rev. Cell Biol. 9:129‐161. | |
Udenfriend, S. and Kodukula, K. 1995. How glycosylphosphatidylinositol‐anchored membrane proteins are made. Annu. Rev. Biochem. 64:563‐591. | |
von Heijne, G. 1996. Computer‐assisted identification of protein sorting signals and prediction of membrane protein topology and structure. Adv. Computat. Biol. 2:1‐14. | |
Waterham, H.R. and Cregg, J.M. 1997. Peroxisome biogenesis. BioEssays 19:57‐66. |