Read Aligned Structure Data
Read aligned PDB structures and store their C-alpha atom data, including xyz coordinates, residue numbers, residue type and B-factors.
read.fasta.pdb(aln, prefix = "", pdbext = "", fix.ali = FALSE, pdblist=NULL, ncore = 1, nseg.scale = 1, progress = NULL, ...)
aln |
an alignment data structure obtained with
|
prefix |
prefix to aln$id to locate PDB files. |
pdbext |
the file name extention of the PDB files. |
fix.ali |
logical, if TRUE check consistence between |
pdblist |
an optional list of |
ncore |
number of CPU cores used to do the calculation.
|
nseg.scale |
split input data into specified number of segments
prior to running multiple core calculation. See |
progress |
progress bar for use with shiny web app. |
... |
other parameters for |
The input aln
, produced with read.fasta
, must
have identifers (i.e. sequence names) that match the PDB file
names. For example the sequence corresponding to the structure
“1bg2.pdb” should have the identifer ‘1bg2’. See
examples below.
Sequence miss-matches will generate errors. Thus, care should be taken to ensure that the sequences in the alignment match the sequences in their associated PDB files.
Returns a list of class "pdbs"
with the following five
components:
xyz |
numeric matrix of aligned C-alpha coordinates. |
resno |
character matrix of aligned residue numbers. |
b |
numeric matrix of aligned B-factor values. |
chain |
character matrix of aligned chain identifiers. |
id |
character vector of PDB sequence/structure names. |
ali |
character matrix of aligned sequences. |
resid |
character matrix of aligned 3-letter residue names. |
sse |
character matrix of aligned helix and strand secondary structure elements as defined in each PDB file. |
call |
the matched call. |
The sequence character ‘X’ is useful for masking unusual or unknown residues, as it can match any other residue type.
Barry Grant
Grant, B.J. et al. (2006) Bioinformatics 22, 2695–2696.
# Redundant testing excluded # Read sequence alignment file <- system.file("examples/kif1a.fa",package="bio3d") aln <- read.fasta(file) # Read aligned PDBs pdbs <- read.fasta.pdb(aln) # Structure/sequence names/ids basename( pdbs$id ) # Alignment positions 335 to 339 pdbs$ali[,335:339] pdbs$resid[,335:339] pdbs$resno[,335:339] pdbs$b[,335:339] # Alignment C-alpha coordinates for these positions pdbs$xyz[, atom2xyz(335:339)] # See 'fit.xyz()' function for actual coordinate superposition # e.g. fit to first structure # xyz <- fit.xyz(pdbs$xyz[1,], pdbs) # xyz[, atom2xyz(335:339)]
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.