multialignread

Read multiple sequence alignment file

Syntax

S = multialingread(File)
[Headers, Sequences] = multialignread(File)

Arguments

File

Multiple sequence alignment file (ASCII text file). Enter a filename, a path and filename, or a URL pointing to a file. File can also be a MATLAB character array that contains the text of a multiple sequence alignment file.

You can read common multiple alignment file types, such as ClustalW (.aln) and GCG (.msf)

Description

S = multialingread(File) reads a multiple sequence alignment file. The file contains multiple sequence lines that start with a sequence header followed by an optional number (not used by multialignread) and a section of the sequence. The multiple sequences are broken into blocks with the same number of blocks for every sequence. (for an example, type open aagag.aln). The output S is a structure array where S.Header contains the header information and S.Sequence contains the amino acid or nucleotide sequences.

[Headers, Sequences] = multialignread(File) reads the file into separate variables Headers and Sequences.

Examples

Read a multiple sequence alignment of the gag polyprotein for several HIV strains.

gagaa = multialignread('aagag.aln')

gagaa = 

1x16 struct array with fields:
    Header
    Sequence

Create a phylogenetic tree with multiply aligned sequences.

Sequences = multialignread('aagag.aln')
distances = seqpdist(Sequences)
tree = seqlinkage(distances)
phytreetool(tree)

See Also

Bioinformatics Toolbox function fastaread, gethmmalignment


© 1994-2005 The MathWorks, Inc.