X-PLOR - Methodology

X-PLOR

2 Methodology

This chapter provides a brief account of the basic ideas, practical consequences and implementation of the new features in this release of the X-PLOR software for X-ray crystallographic refinement and NMR structure determination.

X-ray crystallographic refinement

Torsion angle dynamics for structure refinement

Background

Conventional simulated annealing refinement treats each individual atom as an independent entity that moves subject to the forces acting upon it. However, the force constants that govern bond lengths and angles are relatively large and permit little variation from ideal values. Thus, it is natural to consider torsion angles as appropriate degrees of freedom with which to productively search the main conformational space available to the molecule and divide the structure into a set of completely rigid torsion groups.

By eliminating bond lengths and angles as degrees of freedom from the molecule it is also possible to use much larger time steps for integrating the equations of motion with torsion angle dynamics than is possible with conventional molecular dynamics. The most useful consequence of the ability to run stable refinements with large time steps is that difficult refinement problems, which require simulated annealing at high temperatures in order to escape from incorrect conformations, can be tackled. For conventional molecular dynamics the computational cost of high temperature refinements is prohibitively high because very small time steps are required to control vibrations along the bond lengths and angles.

The application of torsion angle dynamics to X-ray crystallographic refinement has been shown to give somewhat improved results over conventional simulated annealing methods at a given temperature (Rice and Brünger 1994). More significantly, torsion angle refinements starting at very high temperatures give converged refinements for poor starting models -- a result that can not be achieved using conventional simulated annealing protocols confined to lower temperatures.

The code for the torsion angle dynamics algorithm in X-PLOR (Rice and Brünger 1994), released in interim form in X-PLOR 3.851, has now been optimized to give better performance. For a 120-amino-acid protein, the integration of the equations of motion is approximately 40% faster than in the original implementation.

_A (Read 1986) taken from a set of cross-validation test data.

Practical tests involving minimizations of misfit molecular models with the maximum likelihood targets and the conventional residual target show that better convergence (lower Rfree) and reduced bias (smaller difference between R and R_free) is obtained with the maximum likelihood targets (Pannu and Read 1996). Tests in which a maximum likelihood target was used in conjunction with torsion angle dynamics showed that a greater radius of convergence could be achieved than with other refinement methods (Adams et al. 1997).

More recently, a new maximum likelihood target, the structure factor amplitude with Hendrickson-Lattman phase probability coefficients, has been developed and this target is also available in the X-PLOR program. This target makes optimal use of all the experimental information that is available since the phase probability coefficients correctly model the uncertainty in the experimentally determined values of the phase angle (that is., as determined from MIR or MAD data). Preliminary tests of the maximum likelihood target with phase probability information (Pannu and Read, unpublished results) give very promising results.

Implementation

The maximum likelihood targets are specified using the target keyword within the xrefin parameter block (example: target=MLF1). The three maximum likelihood targets currently supported are:

MLF1, the amplitude based target
MLF2, the intensity based target
MLHL, a target which makes use of Hendrickson-Lattman phase probability coefficients

The least-squares residual (target=resid) is no longer the default method for refinement with the X-PLOR program. The new default target is MLF2, the intensity based maximum likelihood target. This target may be used for all types of positional refinement and for the refinement of individual atomic thermal parameters. Note that the maximum likelihood target is not used for the special case of an overall temperature factor refinement or for refinements that make use of explicit phase values, which are flagged by a value of the phase weight parameter, wp> 0.0. For these two cases the default target is automatically reset to resid.

There are two other parameters within the xrefin target block, siga and mbins, which can alter the behavior of maximum likelihood refinement.

The siga parameter provides the ability to update the _A estimate that is used in the calculation of the maximum likelihood target. The siga parameter may be set to fix, next, or refi=<integer>.

The siga fix option (the default) fixes the estimates for _A at the values used at the beginning of the refinement. To update the estimates for _A after some cycles of refinement the siga next option may be used. The siga refi = <integer> option is used to update the estimates for the _A values after the given number of structure factor calculations. This latter option should be used with caution because frequent updating sometimes causes an increase in E_xref which can cause the line search to be abandoned in minimization.

The mbins parameter sets the number of resolution bins used for estimation of the _A values. The default value of mbins for other calculations with X-PLOR that use this parameter (for example, resolution-dependent tabulation of R-values -- see page 164 of the X-PLOR 3.1 manual) is 8. If maximum likelihood refinement is performed the default value for this parameter will automatically change to the number of reflections divided by 1000 or the number of cross-validation reflections divided by 50, whichever is the smaller. This new default will usually be a good estimate for the optimal value of mbins for the refinement. Any value of mbins that is explicitly set in the X-PLOR script will override these default modes of operation.

As in refinements against the other targets, a weight parameter, wa, is needed to scale the gradients in the X-ray energy to the gradients in the chemical energy. The value of the wa can be estimated in the same way as for the other targets, by using a script (/tutorial/xtalrefine/check.inp) which carries out a short free dynamics run and then calculates the ratios of the chemical and X-ray energy gradients. Since the maximum likelihood refinements make use of (internally) normalized data, the correct value of wa is usually orders of magnitude smaller than the value that would be used for a refinement with the least-squares residual. Note that none of the maximum likelihood targets uses the phase weight parameter, wp, (the MLHL refinement target deals directly with phase probabilities, not any explicit value for a phase angle) so wp should either be set to zero or omitted from the refinement script.

It should be noted that a set of cross-validation data is an absolute requirement for maximum likelihood refinement. If the cross-validation reflections are not flagged in the input data file and are not explicitly set up in the refinement script, 10% of the data will be automatically used for cross-validation purposes.

Example scripts for refinement using the maximum likelihood targets are /tutorial/xtaltorsion/torsion_slow_ml.inp for co-ordinate refinements and /tutorial/xtalrefine/brefinement_ml.inp for temperature factor refinements.

Andersen thermal coupling

Background

Simulated annealing calculations with the X-PLOR program have previously used the Berendsen thermal coupling method to control the temperature of the system (see pages 130-131 of the X-PLOR 3.1 manual). Temperature control with the Berendsen method is obtained by adding a force to each atom that is proportional to the individual atomic velocity. The overall scale constant for this force depends on the difference between the current temperature of the system and the desired temperature.

X-PLOR now also offers the option of temperature control by the Andersen thermal coupling method (Andersen 1980). In the Andersen method a set of atoms is randomly selected from the molecule and their velocities are replaced by randomly generated velocities selected from a Boltzmann velocity distribution at the desired temperature. The physical process that is modeled by the Andersen thermal coupling algorithm is a collision of a particle in the system with a particle in a heat bath at the desired temperature.

Torsion angle dynamics for structure refinement

Background

Molecular dynamics control

Maximum likelihood targets for structure refinement

Background

Tutorial

How to obtain phases for the non-anomalous structure factor

Data conversion

Background

Torsion angle dynamics for NMR structure determination

Background

Iterative assignment scheme

Direct coupling constant refinement

Direct secondary carbon chemical shift refinement

Direct 1H chemical shift refinement

Fast refinement using direct NOEs

Background