Using novel variable transformations to enhance conformational sampling in molecular dynamics next up previous
Next: About this document

Zhongwei Zhu and Mark E. Tuckerman

Dept. of Chemistry and Courant Institute of Mathematical Sciences
New York University, New York NY 10003

Glenn J. Martyna

Dept. of Chemistry,
Indiana University, Bloomington, IN 47405

Using novel variable transformations to enhance conformational sampling in molecular dynamics

One of the ``grand challenges'' is to develop methodology capable of providing an accurate description of conformational equilibria in systems with rough energy landscapes via computer simulation. If met, many important problems, most notably protein folding, could be impacted. Here, a significant step forward is made by combining with molecular dynamics a novel variable transformation designed to warp configuration space such that barriers are reduced and basins are enlarged while equilibrium properties are preserved. Thus, barriers are readily crossed and sampling is very efficient.

One of the premier challenges in molecular simulation is to determine accurately and efficiently conformational equilibria of systems described by complex potential surfaces that possess high energy barriers separating important minima or basins of attraction. Indeed, rough energy landscapes are ubiquitous in physical, chemical and biological problems, including protein folding. Despite recent advances [, ] this important class of problems cannot, yet, be adequately addressed by molecular simulation.

Current approaches include umbrella sampling, guiding potentials, multicanonical algorithms, and rate-enhancing schemes. The first two techniques reweight the phase space distribution in order to accelerate sampling at the price of requiring an a posteriori correction. Thus, the efficiency scales exponentially poorly with the number of degrees of freedom involved in the reweighting factor. Parallel tempering methods introduce a series of M simulations at different temperatures which are permitted to interchange. Thus, the cost of the calculation is M-fold larger, and the number of temperature interchanges accepted scales exponentially poorly with the system size. Other methods that rely on harmonic transition state approximations do not, in general, preserve equilibrium properties.

In this letter, a novel approach to the conformational sampling problem that is free of the above difficulties is presented. The method is based on an exact reformulation of the classical statistical mechanical partition function via a nonlinear transformation of the integration variables. The transformation is constructed so as to effect a warping of the configuration space wherein barrier regions are shrunk, barrier heights are lowered, and attractive basins stretched. The result is a smoother surface, which allows barriers to be easily traversed without i) altering equilibrium properties of the system, ii) requiring an a posteriori reweighting, iii) relying on harmonic approximations, or iv) scaling exponentially poorly with the number of transformed degrees of freedom. The technique is combined, here, with molecular dynamics (MD) to form a novel conformational sampling scheme, although it could also be be used with Monte Carlo. Moreover, the new approach could be combined with complementary transformations [] that help overcome entropic barriers as well as employed to enhance the efficiency of other techniques such as parallel tempering. Applications of the new method to model and complex systems as large as 400-mer alkane chains are considered in order to demonstrate its favorable scaling with system size and its enhancement in sampling efficiency (a factor of tex2html_wrap_inline422 over standard MD for the long chains, as will be seen shortly).

Consider, first, a classical particle with momentum, p, mass, m, and coordinate, x, in a one-dimensional potential, V(x), which possesses two stable minima, separated by a large energy barrier. The canonical partition function is

  equation25

where h is Planck's constant, and tex2html_wrap_inline434 . The spatial probability distribution function, tex2html_wrap_inline436 , appearing in Eq. (1), could be sampled using a classical Hamiltonian, tex2html_wrap_inline438 , in conjunction with thermostatted MD []. However, if the potential barrier is too high, barrier crossing events will be rare, and extremely long trajectories will be needed to generate an adequate sampling of the configuration space.

Since the variable, x, is just an integration variable, a transformation to a new variable, u=f(x), can be performed arbitrarily without altering the partition function in Eq. (1), assuming that the inverse tex2html_wrap_inline444 exists and is single-valued. Upon changing variables and exponentiating the Jacobian factor g'(u), the partition function becomes

  equation35

The key feature of Eq. (2) is the appearance of an effective potential

equation42

that can be adjusted via the transformation to be smoother than V(x). It is important to note that this approach completely eliminates the need for a posteriori reweighting factors required by the umbrella and guiding potential methods and does not introduce harmonic approximations. It is, therefore, not equivalent to any other existing schemes.

A spatial warping transformation is achieved via the following change of variables:

  equation46

Here, tex2html_wrap_inline450 is an arbitrary reference potential, tex2html_wrap_inline452 is an arbitrary point, and c is a constant. Since the integrand in Eq. (4) is nonnegative and u is a monotonically increasing function of x, a unique inverse x=g(u) exists. The Jacobian of the transformation is

equation51

Substituting Eq. (4) and the Jacobian into Eq.\ (2) gives an effective potential tex2html_wrap_inline462 . If tex2html_wrap_inline450 is taken to be equal to the bare potential, V(x), in the barrier region and zero outside and tex2html_wrap_inline452 is taken to be the left minimum ( tex2html_wrap_inline470 ), then tex2html_wrap_inline472 is barrier free (see Fig. 1(a)). Thus, the Jacobian weights the ``u'' space so as to reduce the barrier region to a negligibly small volume without altering the partition function. We refer to the new approach as REPSWA (REference Potential Spatial Warping Algorithm).

   figure58
Figure 1: (a) Schematic of REPSWA for a double-well potential. (b,d) x(t) in the double well potential ( tex2html_wrap_inline392 ) without(b)/with(d) REPSWA. (c,e) Probability distribution without(c)/with(e) REPSWA (dashed lines) and analytical result (solid line).

The reformulated partition function, Eq. (2), can be sampled by a canonical MD approach based on a Hamiltonian,

equation73

forming the REPSWA-MD method. Since the transformation is not canonical, tex2html_wrap_inline480 generates trajectories which sample phase space more effectively than those of the original Hamiltonian,

equation78

That is, the true dynamics, in which barrier crossing is, by definition, a rare event, is not, nor is intended to be, preserved.

Since the integral in Eq. (4) cannot generally be evaluated analytically, the function, tex2html_wrap_inline482 , is expressed a finite basis of integrable functions, thereby defining a transformation tex2html_wrap_inline484 , so that the integrals can be performed. The function, tex2html_wrap_inline486 , will be very close to f(x) in Eq. (4) for a large enough basis set. This is a key step that allows the REPSWA-MD to be applied beyond the harmonic approximation. Clearly, the transformation, tex2html_wrap_inline484 also exactly preserves the partition function.

The REPSWA-MD technique is first applied to a one-dimensional analytically solvable system in order to demonstrate its validity and performance. A double well potential of the form

equation85

is selected, and canonical MD simulations are performed for a high barrier, tex2html_wrap_inline392 . The equations of motion for tex2html_wrap_inline494 are coupled to a Nosé-Hoover chain thermostat [] to effect the canonical sampling. Runs of length of 10 tex2html_wrap_inline496 steps were performed using a time step of tex2html_wrap_inline498 ( tex2html_wrap_inline500 ,m=1 and a=1). Figure 1 shows trajectories x(t)=g(u(t)) with and without the REPSWA transformation as well as the probability distribution function, P(x). The ability of the REPSWA technique to reproduce the correct distribution, P(x), demonstrates that the partition function is perfectly preserved. In contrast, ordinary canonical MD based on H is unable to generate the correct distribution within tex2html_wrap_inline422 steps indicating, quantitatively, an increase in efficiency of over 6 orders of magnitude using REPSWA.

Next, the nontrivial extension of the REPSWA method to enhance sampling of multidimensional potential surfaces is discussed. In particular, chain molecules possess many dihedral barriers in the range of 5-10 tex2html_wrap_inline516 , which, as the previous example has demonstrated, are traversed infrequently, hindering conformational sampling. It will first be shown how to treat the commonly used ``united atom'' (UA) molecular models (e.g. CH tex2html_wrap_inline518 -like moieties are merged into pseudoatoms) [], and then the additional steps needed to treat all-atom models [] will be discussed.

Dihedral barriers can be removed in united atom models by applying the REPSWA transformation (cf. Eq. (4)) directly to the dihedral angle. However, since dihedral angles are not explicit coordinates in MD, additional steps are needed in order to apply the technique. Consider a UA representation of butane, and assume the connectivity is C tex2html_wrap_inline520 -C tex2html_wrap_inline522 -C tex2html_wrap_inline524 -C tex2html_wrap_inline404 , where C tex2html_wrap_inline528 , the ith pseudoatom, has Cartesian position tex2html_wrap_inline532 . The transformation scheme is: (i) the vector tex2html_wrap_inline536 is rotated/translated into a frame in which tex2html_wrap_inline538 is at the origin and tex2html_wrap_inline540 lies along the z axis, (ii) the new tex2html_wrap_inline546 is then resolved into spherical polar coordinates, tex2html_wrap_inline548 , (iii) REPSWA is applied to the azimuthal angle to generate tex2html_wrap_inline552 , (iv) a Cartesian tex2html_wrap_inline556 is then created using tex2html_wrap_inline558 , (v) tex2html_wrap_inline556 is placed back into the original frame by inverting the transformations (i) and (ii) to obtain a transformed set of Cartesian coordinates. The constant c is adjusted to ensure that tex2html_wrap_inline570 . The reference potential in step (iii) is chosen to be equal to the dihedral potential between the two gauche conformations and zero otherwise. The forces on the tex2html_wrap_inline574 are obtained by the chain rule. This procedure allows for facile transformation of all the dihedral angles in an N-pseudoatom chain by moving sequentially down the chain. This procedure forms an upper triangular Jacobian matrix whose determinant is the product of the exponentials of the reference potentials, which cancel the desired barriers, while preserving the partition function. Both the coordinate and force transformations can be carried out in tex2html_wrap_inline578 operations.

The behavior of the dihedral REPSWA scheme is studied using the Ryckaert-Bellemans model [] for a 400-mer chain at T=300K (ignoring intermolecular Lennard-Jones interactions to avoid freezing the chain). It should be noted that guiding potentials/umbrella sampling could not be used to enhance the sampling of all 397 dihedral angles. The molecule was simulated using canonical molecular dynamics in the gas phase with a time step of 0.5 fs for 10 tex2html_wrap_inline496 steps with and without the REPSWA dihedral angle transformation. Figure 2 demonstrates the large improvement in sampling efficiency given by REPSWA. The evolution of the end-to-end distance, tex2html_wrap_inline400 ( tex2html_wrap_inline416 Å, N=400) (Fig. 2(c), (f)) indicates that the fluctuations in this quantity are virtually nonexistent without REPSWA. Quantitatively, the number of dihedral angle flips per torsion increases by a factor of 30, and the correlation time for the relaxation of tex2html_wrap_inline590 decreases by 6 orders of magnitude! Note, the REPSWA calculation was found to be in good agreement with the prediction ( tex2html_wrap_inline592 ) of a three state rotational-isomeric state (RIS) model [], parameterized using our potential.

   figure106
Figure 2: (a,d) Dihedral angle, tex2html_wrap_inline394 , in the UA 400-mer without(a)/with(d) REPSWA. (b,e) Number of dihedrals, tex2html_wrap_inline396 , that have crossed barriers tex2html_wrap_inline398 times without(b)/with(e) REPSWA. (c,f) End-to-end distance ( tex2html_wrap_inline400 , tex2html_wrap_inline402 Å) without(c)/with(f) REPSWA.

Last, it is shown how REPSWA can be used to simulate ``all-atom'' models of chain molecules with high efficiency (see Fig. 3(a)). In all-atom models, the backbone C tex2html_wrap_inline520 -C tex2html_wrap_inline522 -C tex2html_wrap_inline524 -C tex2html_wrap_inline404 dihedral angle is strongly coupled to the H tex2html_wrap_inline520 -C tex2html_wrap_inline520 -C tex2html_wrap_inline522 -C tex2html_wrap_inline524 , H tex2html_wrap_inline522 -C tex2html_wrap_inline520 -C tex2html_wrap_inline522 -C tex2html_wrap_inline524 , and H tex2html_wrap_inline524 -C tex2html_wrap_inline520 -C tex2html_wrap_inline522 -C tex2html_wrap_inline524 dihedrals due to the tex2html_wrap_inline636 hybridization of the carbon atoms (represented by harmonic bond and bend angle potentials). The rigidity caused by the hybridization requires groups of atoms to be transformed as a rigid units. Groups of three atoms (e.g. C tex2html_wrap_inline524 , H tex2html_wrap_inline404 and H tex2html_wrap_inline642 form such a group in Fig. 3(a)) are selected, and a ``primary'' dihedral angle (e.g. H tex2html_wrap_inline520 -C tex2html_wrap_inline520 -C tex2html_wrap_inline522 -C tex2html_wrap_inline524 ), defined. A set of ``secondary'' dihedrals is formed by collecting dihedral angles involving atoms both in and to the left of the group (e.g. eight). The reference potential is then taken to be

equation130

where tex2html_wrap_inline652 and tex2html_wrap_inline654 are the primary and secondary dihedral potentials, respectively, tex2html_wrap_inline656 is a function that switches off the potential outside the region between the two gauche conformations, and tex2html_wrap_inline658 is a set of constants chosen based on the ideal differences between the secondary and primary dihedral angles for tex2html_wrap_inline636 geometry. In a simulation, the actual secondary dihedrals will generally not differ much from tex2html_wrap_inline662 . The full REPSWA transformation procedure is then the same as that for the united atom case, except that three vectors tex2html_wrap_inline664 , tex2html_wrap_inline666 and tex2html_wrap_inline668 are rotated about the C tex2html_wrap_inline520 -C tex2html_wrap_inline522 axis.

The performance of the group REPSWA transformation scheme is tested on an all-atom 400-mer using the CHARMM22 force field []. The simulation details are the same as in the united atom case (see above). Figures 3(b) and 3(c) show the histogram of backbone dihedral angle ``flips'', again, without and with REPSWA, respectively; the insets show the evolution of a typical backbone dihedral for the two cases. It can be seen that the use of REPSWA leads to a much greater dihedral flipping rate by a factor of 30, and, hence, an improved sampling of conformational space by a factor of tex2html_wrap_inline422 based on tex2html_wrap_inline400 (Figs. 3(d) and 3(e)). Again, the REPSWA calculation was found to be in good agreement with the RIS model [] prediction ( tex2html_wrap_inline678 ).

   figure158
Figure 3: (a) Schematic of group REPSWA for C tex2html_wrap_inline404 H tex2html_wrap_inline406 . (b,d) Number of dihedrals, tex2html_wrap_inline396 , that have crossed barriers tex2html_wrap_inline398 times without(b)/with(d) REPSWA; inset: tex2html_wrap_inline394 . (c,e) End-to-end distance ( tex2html_wrap_inline400 , tex2html_wrap_inline416 Å) without(c)/with(e) REPSWA.

In conclusion, a novel method for enhancing conformational sampling simulations of complex molecular systems, REPSWA-MD, has been introduced. REPSWA-MD smooths a rough potential energy surface via a nonlinear variable transformation and, thus, does not alter the equilibrium properties of a system. Unlike guiding potentials or umbrella sampling, the method does not require a posteriori reweighting and can be used to enhance sampling in a very large numbers of degrees of freedom without exponentially poor scaling. A dramatic improvement in the rate of barrier crossing events and the number of statistically independent conformations sampled has been demonstrated (e.g. 6 orders of magnitude for a 400-mer chain). The new technique can easily be combined with complementary transformations [] that help overcome entropic barriers as well as with parallel tempering schemes. Our current efforts are focused on applying REPSWA-MD to proteins and other large biomolecules.

This research was supported by PRF 32139-AC and NSF CHE-96-5015 (G.J.M.) and by Research Corp. RI0218, NSF CHE-98-75824, and an NYU Whitehead Fellowship in Biomedical and Biological Sciences (M.E.T.) and NSF-EIA-0081307 (G.J.M and M.E.T.).

references185