Home Linus Helper Programs Examples File Descriptions Index

Helper Programs

 


RIBOSOME

FASTA2RIB

PDB2FASTA

VIEWOUT

TORSION

CONTACT

CHASA


Main Menu

 

RIBOSOME

ribosome.py is a program to construct coordinates for a peptide/protein from the amino acid sequence.

Usage: python $LINUS/utils/ribosome.py -p $LINUS/utils/ribosome.dat < [filename].rib > [filename].pdb

where ribosome.dat, is a residue descriptor file. Each residue is described in a Z-matrix style.

All options to the program are passed through a command file [filename].rib. Each command must be given on a separate line. Keywords are case insensitive. All whitespace and tab characters are ignored. All blank lines and lines beginning with a "#" character are ignored. Sample residue descriptor files are given in the Examples section.

FASTA2RIB

fasta2rib.py creates an input file for ribosome.py from an amino acid sequence in FASTA format.

Usage: python $LINUS/utils/fasta2rib.py [filename].fasta

where [filename].fasta is the file containing the sequence in FASTA format. If the file has multiple sequences, only the first sequence is processed.

The output is in a file called [filename].rib.

PDBATOM2FASTA and PDBSEQRES2FASTA

pdbATOM2fasta.py creates a FASTA formatted amino acid sequence file from the ATOM records in a PDB structure file.

pdbSEQRES2fasta.py creates a FASTA formatted amino acid sequence file from the SEQRES records in a PDB structure file.

Usage:
python $LINUS/utils/pdbATOM2fasta.py [filename].pdb

python $LINUS/utils/pdbSEQRES2fasta.py [filename].pdb

where [filename].pdb is a PDB structure file.

The output is [filename].fasta containing the sequence in FASTA format.

VIEWOUT

viewout.py allows one to cycle through the saved structures in the LINUS output file, [filename].out using the molecular graphics program RASMOL.

Usage:
1. In a separate shell window launch RASMOL,
2. Then in the LINUS output directory, enter the command:
python $LINUS/utils/viewout.py [filename].out
3. A menu window will appear that provides choices for the display of the consecutive structures.

TORSION

torsion.py calculates the φ, ψ, ω and all chi angles for a protein structure from a PDB file.

Usage:
python $LINUS/utils/torsion.py pdb[code].ent

The resulting output file, angles.dat, looks like the following:


          SS MS    phi      psi     omega     chi1     chi2
  117 PRO  C P   -89.40    15.89   178.74    35.98   -33.69
  118 ASN  C Q   -87.08    68.06  -176.46   -80.42   -12.81
  119 ASN  T J  -142.64    18.19  -177.29    77.71   136.21
  120 THR  T O   -50.15   -48.72  -176.74   -62.68   999.99
  121 HIS  T J   -97.77    21.09  -178.96   -61.17   -51.21
  122 GLU  H O   -54.79   -49.56  -176.05   179.63   175.71
  123 GLN  H O   -63.61   -43.38   178.65   -70.84   -59.79

with chi angle values extending out as far as necessary to describe a complete side chain conformation.

The SS column indicates the secondary structure using the common H,E,T,C designations for HELIX, EXTENDED, TURN and COIL.

The MS column indicates the mesostate code which is described here.

CONTACT

concnt.py processes the structures written out during a LINUS simulation and lists for each pair of residues the fraction of the structures in which they were in contact. Residues which were never in contact are not listed. The output is written to stdout and consists of 3 columns:

Column 1 - Residue Number
Column 2 - Residue Number
Column 3 - Fraction of structures in which the pair of residues are in contact

Usage:
python $LINUS/utils/concnt.py [filename].pdb [filename].out winmin winmax probe

where:
[filename].pdb = name of pdb file that was input to LINUS simulation
[filename].out = name of file to which sampled structures were written during the course of the simulation
winmin = minimum sequence separation between pairs of residues that are allowed to be in contact
winmax = maximum sequence separation between pairs of residues that are allowed to be in contact
probe = maximum distance between atom surfaces that defines a contact

The resulting output looks like the following:


   4    9 0.030
   4   10 0.010
   9   12 0.060
   9   13 0.040
  10   12 0.090
  10   13 0.050
  10   18 0.020
  12   17 0.060
  12   18 0.010
  13   17 0.090
  13   18 0.010
  17   19 0.260

CHASA

chasa_linus.py estimates the solvation energy of the conformation using the CHASA algorithm. This script is an implementation of the algorithm used on the CHASA webserver (http://roselab.jhu.edu/chasa/) but works with PDB files used in LINUS (which contain unusual atom types). For standard PDB files use either the server or the script which may be downloaded from the server.

Usage:
python $LINUS/utils/chasa_linus.py [linus_type_PDB_file] > chasa.pdb

The resulting output file, chasa.pdb, is a PDB format file which has additional information.


COMPND numintHbd     numSolv    num_nonHbd      total_bb_polar
COMPND     84          147           3              111
.
.
.
ATOM      9  N   THR     2      13.719  19.413  27.573  5.00  0.00
ATOM     10  CA  THR     2      13.088  19.661  26.283  0.00  0.00
ATOM     11  C   THR     2      13.561  18.631  25.300  0.00  0.00
ATOM     12  O   THR     2      14.763  18.432  25.121  3.00  1.57
ATOM     13  CB  THR     2      13.527  20.980  25.667  0.00  7.80
ATOM     14  OG1 THR     2      13.307  22.020  26.627  0.00  5.84
ATOM     15  CG2 THR     2      12.704  21.284  24.409  0.00 16.10
.
.
.
ATOM    149  N   ALA    20       9.346  17.206  29.144  0.00  0.49
ATOM    150  CA  ALA    20       8.985  15.930  29.750  0.00  2.64
ATOM    151  C   ALA    20      10.067  15.607  30.760  0.00  3.52
ATOM    152  O   ALA    20      11.193  16.119  30.686 -1.00  1.77
ATOM    153  CB  ALA    20       8.856  14.815  28.714  0.00  2.39
.
.
.
ATOM    499  O   HOH   120       7.787   1.168  13.283  0.00  0.00
ATOM    500  O   HOH   121       9.368  -0.535   6.166  0.00  0.00
ATOM    501  O   HOH   122      22.918  12.605  15.164  0.00  0.00
ATOM    502  O   HOH   123      16.522  12.543  26.647  0.00  0.00
ATOM    503  O   HOH   124      22.421   8.292  28.255  0.00  0.00
ATOM    504  O   HOH   125      11.087  12.195  10.474  0.00  0.00
TER 1424.592  -52.662

where:

        numintHbd = number of internally hydrogen bonded backbone N and O

        numSolv = number of solvation "waters" (max = 5 per backbone polar group)

        num_nonHbd = number of backbone polar groups not satisfied by hydrogen bonding

        total_bb_polar = number of backbone polar groups in the molecule that should
                         be hydrogen bonded ([2 x Number of residues] -1)

        occupancy column (for N and O only):
            (if > 0.00) = number of solvation "waters" accessible to that atom (max = 5)
            (if = -1.00) = this atom is not hydrogen bond satisfied

        B factor column = CHASA in square angstroms

        TER record:
            (first number) = total CHASA for molecule
            (second number) = solvation free energy for molecule (See below for explanation)

        HOH atoms = solvation "waters" in hydrogen bonding proximity to backbone polar groups
                    (these are the conditional atoms added prior to ASA calculation)

Note 1: The N-terminal nitrogen is not solvated and thus the CHASA value for some atoms proximate to this atom in cartesian space will be standard hydrophobic ASA not conditional hydrophobic ASA.

Note 2: The solvation free energy is calculated as follows:
Total solvation free energy = non-polar + polar solvation free energy, where

Non-polar solvation free energy = total conditional hydrophobic accessible surface area (CHASA) x 0.03
Polar solvation free energy = backbone polar atom solvation free energy, where

Backbone N and O solvation is attempted by placing pseudo water at 5 different postions at appropriate distance and orientation for hydrogen bonding. The number of successfully placed pseudo waters with no steric clash x 0.6 = polar solvation free energy. Backbone oxygens may also be both internally hydrogen bonded AND hydrogen bonded to water. In this case a value of 2.0 is assigned to the solvation free energy of the oxygen.