AllosMod Help Pages

The AllosMod logo is comprised of two probability density functions that are used for the nonbonded distance energy. The green probability density function translates to a single energy minima for interactions in the allosteric site. The blue probability density function translates to a double energy minima for interactions in the rest of the protein, i.e. the regulated region. Varying the radius of the allosteric site changes how the distance energy is distributed across the structure, thereby driving the simulation to sample different regions of the conformational space relevant to the allosteric transition.

For help page to model glycosylated proteins, click here.

For instructions on how to input a sequence into the first page, refer to the AllosMod-FoXS help page

Running AllosMod locally

This web service is simply an interface to the base AllosMod protocol, which can be downloaded and run on any Linux machine or compute cluster.

The page below provides help to run batch jobs. Each input field in single landscape mode has a corresponding file/section below. Single landscape runs will output the files described below corresponding to the options set by the user, i.e. the files from a single landscape run can be uploaded as a batch run.

Server Input: Batch Mode

Email

A notification will be sent once the job is completed.

Name your model

Name your job so you can refer to it later.

Directories zip file

Upload a zip file containing a set of directories for each landscape that you want to model (no more than 100 such landscapes per job). To create such a file, put all the input files into one directory per landscape and type: zip -r input.zip dir1 dir2... For an example directory, click here. Please note that the directory name should not contain any spaces. Each directory should contain the following files:

PDB file(s): all structures used to create the energy landscape for the simulation.

Alignment file: (align.ali) contains one entry for each PDB file. Another entry (named pm.pdb) pertains to the sequence to be simulated. This alignment file should be generated after an alignment procedure, as this alignment will be used to generate restraints for the simulation. Multiple chains can be specified by using a "/" as a separator, the same specifications used in MODELLER. There are many ways to create an alignment file including: 1) MODELLER and 2) ClustalW. A suggested alignment file will be made if the uploaded alignment file is faulty.
***WARNING*** Small errors in the alignment can cause big errors during a simulation due to energy conservation problems. Make sure there are no misalignments in which adjacent residues are aligned far apart in sequence (alignment programs often do this at the beginning or end of chains).

Structure list: (list) contains a list of all PDB files used to create the energy landscape for the simulation. Refer to the LIGPDB and ASPDB options in input.dat to define interactions in the allosteric site.

Ligand File (optional): (lig.pdb) contains the structure of the ligand extracted from a ligand bound PDB file (defined by LIGPDB in input.dat). A radius (rAS) around the ligand is used to define the allosteric site. If lig.pdb is excluded, AllosMod will set up a landscape with as many energy minima as are described by structures in the list file.

Input file: (input.dat) contains one line per parameter as follows:
NRUNS = X

is the number of independent simulations to run. (No more than 100, please.)

is the radius (in Ångstroms) around the coordinates of the ligand that will specify the allosteric site. If the file lig.pdb is included, the allosteric site will be calculated using rAS and the coordinates in LIGPDB. Therefore, lig.pdb must be extracted from LIGPDB.

(Default) A simulation is set up for the user to run on his/her computer. Due to the large number of files generated, AllosMod cannot currently run full simulations without overwhelming our other severs. For help running the simulation, see section below called "To run the simulation."

Sampling is performed using a quick, unequilibrated simulation. This quick sampling will give a representation of the types of conformations that are consistent with the modeled energy landscape. Set "SAMPLING = simulation" to predict the relative populations of the conformations at equilibrium.

As above, sampling is performed using a quick, unequilibrated simulation. In this case, simulations should be performed at high temperature in order to access rarely populated structures. These rarely populated structures are rapidly cooled using only restraints from adjacent residues, which does not change the global structure but allows helices to form etc. Rarely populated structures are defined by their dissimilarity to the first structure in the list file (SCAN_CUTOFF percent of trajectory snapshots are outputted).

is the maximum energy for each pairwise atomic distance contact, typically between 0.09 and 0.12 kcal/mol. If excluded, the value will be assigned according to 3.6*(number of residues/number of distance interactions). See paper for more details.

is the PDB file used to define the allosteric site. AllosMod defines the allosteric site using the distance (rAS) from the effector (lig.pdb) with respect to the LIGPDB coordinates.

is the PDB file used to define the contacts in the allosteric site, i.e. the pairwise atomic distances in ASPDB are used to determine the nonbonded distance energy. As an example, to run an effector unbound simulation: 1) include the effector bound and unbound PDB files in align.ali and list, 2) set ASPDB to the effector unbound PDB file, and 3) set LIGPDB to the effector bound PDB file.

is the distance (in Ångstroms) that the atoms will be randomized when creating the initial structure (default is 1-10 Å depending on simulation type).

is the temperature (in degrees Kelvin) for the simulation (default is 300 K). Set MDTEMP to "scan" and the simulation temperature will alternate between 300 K, 350 K, 400 K, 450 K, and 500 K. Therefore, directory 0 will have a 300 K simulation, directory 1 will have a 350 K simulation, and so on until directory 5 that will restart the sequence with a 300 K simulation.

is the percent of trajectory snapshots outputted if sampling is moderate_am_scan. The default is 10 percent, i.e. 0.1*NRUNS structures will be outputted selected by differing the most from the inputted PDB structure.

is an option to include chemical frustration (Weinkam et al. 2009 Biochemistry, p2394-2402). Chemical frustration is modeled by breaking all interactions involving buried, charged residues. Regions with many buried, charged residues will have high conformational variability.

If BREAK=True, this number is used to scale the contacts with residues that cause chemical frustration.

If BREAK=True, this number is used to select which residues cause chemical frustration. ZCUTOFF is the z-score cut off of the distribution involving the number of charged contacts per residue; residues with a z-score above this threshold are predicted to cause chemical frustration.

If set to True, secondary structure, corresponding to the input PDB files, will have increased stability in the simulation. Increased stability is maintained by increasing the energy by a factor of 10 for all C alpha-C alpha contacts between 2 and 5 residues apart.

is an option to coarse grain the energy landscape by restricting the nonbonded distance energy to include C alpha and C beta atoms only. This allows very large proteins to be simulated without overwhelming the computer's memory. This option is automatically set to True for proteins over 1500 residues.

here

Alter residue contact energies (optional):

Server Output

The output is a zip file containing the same directories that were uploaded and subdirectories for each simulation. Each subdirectory will contain a MODELLER script (model_run.py), a unique starting structure (random.ini), a restraint file containing interactions that will define the energy landscape (converted.rsr), log files, and other output files. If you opted to run short, unequilibrated simulations via the server then each subdirectory will contain pdb files named pm.pdb.B1*. If you opted to set up a longer simulation to be run locally, then see the following section.

***To run the simulation

To run, navigate to a run directory. Each run directory contains a unique starting structure (random.ini) and a unique run script (model_run.py), then type: "$MODELLER_HOME/bin/mod${MODELLER_VERSION} model_run.py". The script will output 2000 snapshots by default (pm.pdb.B10010001.pdb to pm.pdb.B30000001.pdb) as well other data files. AllosMod implements the Automodel class used in MODELLER for comparative modeling, however the optimization and refinement steps have been modified to allow for constant temperature molecular dynamics simulations. For each run directory, there is a file allosmod.py. This file contains the method consttemp that controls the simulation schedule and temperature (MDtemp). For help with script details, refer to the MODELLER help pages.

To analyze the simulation

Download a zip file containing example directories and analysis scripts to run on any linux/unix based machine (click here). The allosmod_analysis.sh script will check the simulations for possible errors as well as output several files including: 1) energies 2) Boltzmann weighted probabilities 3) local structural similarity metrics Qi. See README file for a full description.

Allostery Model Help

Developers:

Acknowledgements:

AllosMod Help Pages

Running AllosMod locally

Server Input: Batch Mode

Email

Name your model

Directories zip file

Server Output

***To run the simulation

To analyze the simulation