Common File Formats

互联网2013-12-31

627

Abstract
Table of Contents
Figures
Literature Cited

Abstract

This appendix discusses a few of the file formats frequently encountered in bioinformatics. Specifically, it reviews the rules for generating FASTA files and provides guidance for interpreting NCBI descriptor lines, commonly found in FASTA files. In addition, it reviews the construction of GenBank, Phylip, MSF and Nexus files.

Keywords: file format; FASTA; NCBI descriptor lines; GenBank; Phylip; MSF; Nexus

GO TO THE FULL PROTOCOL:

PDF or HTML at Wiley Online Library

FASTA Files
GenBank Flat Files
Phylip Files
MSF Files
Nexus Files
Converting between File Formats
Disclaimer
Figures
Tables

GO TO THE FULL PROTOCOL:

PDF or HTML at Wiley Online Library

Materials

GO TO THE FULL PROTOCOL:

PDF or HTML at Wiley Online Library

Figures

Figure a0.1B.1 A sample FASTA file that contains the sequences for two homologous proteins, actophorin and yeast cofilin. Note that a greater‐than sign (>) designates the beginning of each entry and that each of the lines of sequence contains less than 80 characters.

View Image

Figure a0.1B.2 A sample GenBank record. Circled numbers identify the fields listed in Table .

View Image

Figure a0.1B.3 A sample PHYLIP‐formatted file. The five sequences shown are HIV‐1 and HIV‐2 gag proteins from a variety of isolates. See text for details.

View Image

Figure a0.1B.4 A sample MSF‐formatted file. The five sequences shown are HIV‐1 and HIV‐2 gag proteins from a variety of isolates. See text for details.

View Image

Figure a0.1B.5 A sample Nexus‐formatted file. The five sequences shown are HIV‐1 and HIV‐2 gag proteins from a variety of isolates. See text for details.

View Image

Videos

Literature Cited

Internet Resources
	http://iubio.bio.indiana.edu/cgi‐bin/readseq.cgi
	ReadSeq biosequence interconversion tool.
	http://www.ebi.ac.uk/clustalw
	ClustalW multiple sequence alignment interface.

GO TO THE FULL PROTOCOL:

PDF or HTML at Wiley Online Library

相关产品推荐

PTPRC/PTPRC蛋白Recombinant Human Receptor-type tyrosine-protein phosphatase C (PTPRC)重组蛋白Leukocyte common antigen蛋白

￥1344

WHEATON Vial File 盒& 取样瓶

询价

CSF2RB/CSF2RB蛋白Recombinant Human Cytokine receptor common subunit beta (CSF2RB) (Active)重组蛋白(CDw131)(GM-CSF/IL-3/IL-5 receptor common beta subunit)(CD antigen CD131)蛋白

￥1368

CD46/CD46蛋白Recombinant Human Membrane cofactor protein (CD46) (Active)重组蛋白TLX (Trophoblast leukocyte common antigen) (CD46)蛋白

￥1368

IL2RG/IL2RG蛋白Recombinant Human Cytokine receptor common subunit gamma (IL2RG) (N75Q)重组蛋白(Interleukin-2 receptor subunit gamma)(IL-2 receptor subunit gamma)(IL-2R subunit gamma)(IL-2RG)(gammaC)(p64)(CD antigen CD132)蛋白

￥1536

Common File Formats

Abstract

Table of Contents

Materials

Figures

Videos

Literature Cited