DC: Server-Based Tools for Manipulation of Dipolar Couplings

From the Bax Group at the National Institutes of Health ...
About DC: Server-Based Tools for Dipolar Coupling Calculations

See Also:

Bax Group DC Calculation Servers

NMRPipe Table Format

Chemical Shifts, Dipolar Couplings, and Molecular Fragment Replacement

Contacts:

Yang Shen shenyang@nih.gov

Ad Bax bax@nih.gov

About DC: Server-Based Tools for Dipolar Coupling Calculations

The DC computation servers provide facilities for analyzing dipolar couplings and their relation to molecular structure.

The mathematical form for dipolar couplings can be expressed in terms of an alignment tensor, as described in more detail below.

The most basic DC analysis involves fitting a collection of measured dipolar couplings to a known structure. This will result in a set of best-fit alignment tensor parameters. It will also provide a collection of calculated dipolar couplings which can be compared to the measured values.

It is also possible to use previously-derived alignment parameters to calculate dipolar couplings for different parts of the same molecular system. For example, given a set of dipolar couplings and a proposed structure for a two-domain molecule, we can fit the couplings for the first domain to determine alignment tensor parameters, and then use these parameters to calculate dipolar couplings for the second domain.

An alignment tensor is also associated with one or more molecular orientations. So, given alignment tensor parameters for a proposed structure, we can also rotate that structure into the possible orientations consistent with the alignment tensor.

Accordingly, the input for a typical DC Server calculation will include:

A molecular structure in PDB format
A table of dipolar couplings in NMRPipe table format, all measured in the same alignment medium.
Optional specification of one or more alignment tensor parameters or limits.
Selection of which parts of the molecular system to use in the calculation, for example as a range of residue numbers.

The typical output will include:

A report of the tensor parameter values.
A table of measured couplings with their corresponding calculated couplings.
Statistics which describe the agreement between the measured and calculated couplings.
Graphs of the measured and observed couplings in PDF format.
Versions of the input PDB structure, rotated into orientations described by the alignment tensor.

The DC Server modules include two methods for fitting dipolar couplings to a given structure. While both of these methods compute couplings, they perform the computation and fitting differently:

In the Order Matrix Fitting method, no prior assumption is made about the alignment tensor parameters, and the fit is performed by linear least squares analysis using equations 3, 4, and 8 below. Since Singular Value Decomposition is used, this is also called the SVD method. In the DC modules, the SVD fitting is weighted by uncertainty values DD from the input.
In the Non-Linear Fitting method, specific values or limits on one or more tensor parameters can be specified, and the fit is performed according to equation 2 and 7 below using repeated trials of non-linear least squares optimization. The optimization attempts to minimize the weighted RMS between the observed and calculated couplings. In the DC modules, the non-linear fitting is weighted by W values from the input.

In addition to fitting dipolar couplings to a given structure, the DC modules also include two related options for calculating dipolar couplings for a given structure when the alignment tensor is specified completely:

In the Known Order Matrix option, dipolar couplings for a given PDB structure are calculated according to five order matrix values. These five values are usually determined by previous Order Matrix Fitting using part of the same PDB structure.
In the Known Tensor Parameters option, dipolar couplings are calculated according to given values for the five parameters which specify the alignment tensor.

Calculating Dipolar Couplings from Alignment Tensor Parameters and Molecular Coordinates

The alignment tensor can be described according to five parameters:

The probability of alignment, also called the alignment magnitude, Da, which varies from 0.0 for perfectly isotropic systems to 1.0 for complete static alignment.
The axial symmetry in the alignment, also called the rhombicity, Rh, which varies from 0.0 for perfect axial symmetry to 2/3.
Three rotational angles, sometimes also called euler angles, which specify the orientation of alignment with respect to the magnetic field. DC modules always specify rotations according to an XYZ rotation convention, discussed in more detail below. The three XYZ-convention rotational angles are sometimes called RX, RY, and RZ in DC reports.

For a structure rotated into a given orientation, the dipolar coupling between atoms I and J at coordinates (xI,yI,zI) and (xJ,yJ,zJ) separated by distance r^IJ is expressed as follows; the direction cosines are given as:

X^IJ = (xI-xJ) / r^IJ Y^IJ = (yI-yJ) / r^IJ Z^IJ = (zI-zJ) / r^IJ [Eqn 1a-c]

Then, given tensor values Da and Dr such that rhombicity Rh = Dr / Da, the dipolar coupling D is calculated as:

D = |DI|[Da(3Z^IJZ^IJ - 1) + 1.5 Dr(X^IJX^IJ - Y^IJY^IJ)] [Eqn 2]

This is the functional form used to calculate dipolar couplings in the DC server restrained fit modes. The dipolar interaction value, DI, is the dipolar coupling that would be observed for the static (completely aligned) molecule. It is about -21,585 Hz for an HN-N amide coupling, as described in more detail below.

Calculating Dipolar Couplings from Order Matrix Values and Molecular Coordinates

The alignment tensor can also be described in terms of a symmetric 3x3 matrix called the order matrix, or sometimes called the saupe matrix. The relationship between the order matrix and the tensor parameters is described in more detail elsewhere. Since it is symmetric, the 3x3 order matrix can be described in terms of five unique values. In the order matrix fitting scheme, the dipolar coupling D is expressed as the linear combination of five terms:

DD^-1 D = c₁*q₁ + c₂*q₂ + c₃*q₃ + c₄*q₄ + c₅*q₅ [Eqn 3]

where DD is the estimated uncertainty in the measured coupling, and the five terms in the basis set are:

q₁ = ¹/₂ DD^-1 |DI| (3Z^IJZ^IJ - 1) q₂ = ¹/₂ DD^-1 |DI| (X^IJX^IJ - Y^IJY^IJ) q₃ = 2 DD^-1 |DI| X^IJY^IJ q₄ = 2 DD^-1 |DI| X^IJZ^IJ q₅ = 2 DD^-1 |DI| Y^IJZ^IJ

[Eqn 4a-e]

Given the five coefficients c₁ ... c₅, the elements of the order matrix S are:

       S_xx = -¹/₂(c₁ - c₂)        S_xy = S_yx = c₃

       S_yy = -¹/₂(c₁ + c₂)        S_xz = S_zx = c₄

       S_zz = c₁        S_yz = S_zy = c₅

             [Eqn 5a-f]

Given five or more measured couplings, it is possible to use linear least squares fitting to estimate values for the coefficients c1 ... c5 using a series of equations built from the q1 ... q5 terms above. In practice, the linear least squares analysis is performed via Singular Value Decomposition (SVD), so this method of tensor estimation is often called the SVD method.

Once an order matrix is determined, it can be used to compute the five tensor parameters of magnitude, rhombicity, and rotational angles. Likewise, given these five tensor parameters, it is possible to calculate the corresponding order matrix. So, as a convenience, all DC fitting modes report both order matrix information and tensor parameters. The five coefficients c1 ... c5 are reported in the DATA SAUPE line of a DC output table.

DC Calculations for More than Two Atoms: Methyl and Methylene Couplings and Ensembles of Structures

In some cases, a measured dipolar coupling can correspond to more than one pair of atoms, in particular H-C couplings for methyl groups, such as ALA HB-CB, and methylene groups, such as GLY HA-CA. In these cases, the DC modules will compute couplings based on the sum of individual couplings in the group. For example, a GLY HA-CA coupling corresponds to the sum of two terms, one for the HA2-CA atom pair, and another for the HA3-CA atom pair.

Likewise, methyl H-C dipolar couplings can be expressed as the sum of three terms, such as for ALA HB1-CB, HB2-CB, and HB3-CB. However, methyl H-C dipolar couplings are a special case, because of the rapid rotation of the methyl protons about the C-C axis. In this case, the H-C couplings are scaled by a contant which depends on the tetrahedral geometry of the methyl group. Details are given in Ottiger and Bax, J. Magn. Reson. 134, 365-369 (1998).

Dipolar coupling calculations using a single alignment tensor for an ensemble of structures can be treated in a similar way. In this case, one measured coupling is expressed as a sum of terms, one for each structure in the ensemble.

Along these lines, we can modify the equations above to accommodate the cases of multiple atom pairs and ensembles of structures. In the general case, we can consider each dipolar coupling according to a group of one or more atom pairs I and J. For example, an amide HN coupling would be a group with a single atom pair I = HN and J = N, while a GLY HA-CA coupling would be a group consisting of two atom pairs, one with I = HA2 and J = CA, and the other with I = HA3 and J = CA.

Then, given an ensemble of structures K = 1 to N_PDB rotated into a given orienation, the overall dipolar coupling for the group of atom pairs I and J over the ensemble is expressed as follows:

If the coordinates of atom I and atom J in the K-th structure of the ensemble are (x_KI,y_KI,z_KI) and (x_KJ,y_KJ,z_KJ) separated by distance r_K^IJ the direction cosines are given as:

X_K^IJ = (x_KI-x_KJ) / r_K^IJ Y_K^IJ = (y_KI-y_KJ) / r_K^IJ Z_K^IJ = (z_KI-z_KJ) / r_K^IJ [Eqn 6a-c]

Then, we can generalize Equation 2 such that the overall dipolar coupling D for the group of atom pairs I and J over the ensemble is calculated as:

D = Σ Σ Σ S_rot|DI|[Da(3Z_K^IJZ_K^IJ - 1) + 1.5 Dr(X_K^IJX_K^IJ - Y_K^IJY_K^IJ)] / N_PDB [Eqn 7]

K I J

where S_rot is an empirical scaling constant which is 3*0.309107 for H-C methyl groups, and 1.0 otherwise.

In the case of order matrix calculation, the terms in the basis set of Equation 4a-e are generalized accordingly for a group of atom pairs I and J over an ensemble of structures:

     q₁ = Σ Σ Σ ¹/₂ S_rot DD^-1 |DI| (3Z_K^IJZ_K^IJ - 1) / N_PDB

K I J

     q₂ = Σ Σ Σ ¹/₂ S_rot DD^-1 |DI| (X_K^IJX_K^IJ - Y_K^IJY_K^IJ) / N_PDB

K I J

     q₃ = Σ Σ Σ 2 S_rot DD^-1 |DI| X_K^IJY_K^IJ / N_PDB
K I J

     q₄ = Σ Σ Σ 2 S_rot DD^-1 |DI| X_K^IJZ_K^IJ / N_PDB
K I J

     q₅ = Σ Σ Σ 2 S_rot DD^-1 |DI| Y_K^IJZ_K^IJ / N_PDB
K I J

       [Eqn 8a-e]

Dipolar Interaction (DI) Values and Standardized Bond Distances

As noted above, the dipolar interaction value, DI, is the dipolar coupling that would be observed for the static (completely aligned) molecule, and is given as:

DI = (u0*hbar)*gammaI*gammaJ/(4*PI*PI*rIJ^3) [Eqn 9]

where u0 is the magnetic permittivity of vacuum, hbar is Planck's constant over 2*PI, gammaI and gammaJ are the magnetogyric ratios for spins I and J, and rIJ is the distance between nuclei I and J. Some common dipolar interaction values for protein dipolar couplings are given in the following table. Since dipolar couplings depend on the cube of the distance between atoms, they are very sensitive to structural variations. In practice, the minor uncertainties in atomic coordinates may give rise to substantial errors in calculated dipolar couplings. For this reason, the DC modules will attempt to use standardized distances when possible. The DC software uses the following standardized values for couplings, according to atom names (HN N C HA CA etc.) in the input:

Atom Pair I - J rIJ Angtroms Dipolar Interaction
Value DI

HN - N 1.041 -21,585 Hz

HN - C' 2.085 6,666 Hz

C' - N 1.329 -2,609 Hz

HA - CA 1.107 44,539 Hz

H - CH3 1.117 43,353 Hz

C' - CA 1.525 4,285 Hz

N - CA 1.458 -1,976 Hz

The DC server modules include the option to use existing Dipolar Interaction (DI) values from the input table. This is intended to allow use of standardized DI values for molecules other than proteins if desired. If DI values are not specified in the input, they will be calculated according to information in the PDB file. In the case of common fixed-geometry peptide couplings (HN-N, HN-C ', N-C ', N-CA, CA - C ', H - CH3 and HA-CA) standardized interatomic distances in the table above will be used to calculate DI. Otherwise, atomic distances from the PDB file will be used. If a DI value of 0.0 is specified in the input file, it will always be replaced by a calculated DI value.

Computing Dipolar Couplings from Measured Isotropic and Aligned Values: Sign Issues

A coupling observed in an aligned sample is the sum of a dipolar coupling D and a scalar coupling J. So, experimental dipolar coupling values are generally computed as a coupling from an aligned sample minus the corresponding J-coupling from an isotropic sample, in other words D = aligned - isotropic. Proper handling of the signs of couplings can be a confusing issue, since the intrinsic signs of couplings are often ignored, or at least not measured directly. For example, when an HN-N protein amide coupling is measured from peak separation in an HN-N doublet, the sign of the coupling isn't determined directly. Since a 1H atom has a positive gamma value and a 15N atom has a negative gamma value, we expect that the HN-N isotropic J-coupling should also be negative, usually around -95 Hz. In practice however, HN-N J-coupling values are commonly recorded as positive numbers, since the sign is not directly apparent from typical coupling measurements.

Correspondingly, the DC modules compute dipolar couplings independently of the sign of gamma values, using the absolute value of the dipolar interaction value, |DI|. So, when preparing dipolar couplings for use with DC modules, the signs of isotropic and aligned couplings should not be adjusted according to the sign of gamma values. For example, if an HN-N doublet has a 94 Hz separation in an isotropic sample, and a 82 Hz separation in an aligned sample, then the dipolar couping D is simply 82 Hz minus 94 Hz = -12 Hz.

The physical implication of this is that DC-module dipolar couplings for vectors which point in the same direction will have the same sign, regardless of the atom types. So for example, if an HA-CA bond vector points in the same direction as an HN-N bond vector, the HA-CA and HN-N dipolar couplings will both have the same sign.

Additional examples of dipolar couplings computed from isotropic and aligned coupling values are shown in the following table:

RESID	RESNAME	HA-CA Isotropic	HA-CA Aligned	HA-CA DC	HN-N Isotropic	HN-N Aligned	HN-N DC	C-CA Isotropic	C-CA Aligned	C-CA DC	C-N Isotropic	C-N Aligned	C-N DC
1	MET	.	.	.	.	.	.	.	.	.	.	.	.
2	GLN	143.16	130.66	-12.50	.	.	.	.	.	.	.	.	.
3	TYR	145.31	132.64	-12.67	93.98	84.72	-9.27	53.07	53.24	0.17	4.71	10.18	5.47
4	LYS	140.12	124.60	-15.52	93.58	87.57	-6.01	51.85	53.22	1.37	4.82	-3.76	-8.58
5	LEU	142.47	130.10	-12.37	92.69	86.67	-6.01	53.32	56.34	3.02	4.74	4.76	0.02
6	VAL	140.17	129.42	-10.75	93.37	87.90	-5.47	52.18	50.71	-1.47	4.54	-1.02	-5.56
7	ILE	139.92	126.07	-13.84	93.17	89.30	-3.87	53.16	56.77	3.61	4.74	1.78	-2.96
8	ASN	140.14	125.14	-15.00	94.60	86.04	-8.56	53.55	52.20	-1.34	5.18	1.47	-3.71
9	GLY	.	.	.	94.58	94.18	-0.40	53.86	57.46	3.60	5.37	-0.30	-5.67
10	LYS	147.12	123.31	-23.81	93.28	82.07	-11.21	50.88	51.36	0.49	4.63	8.41	3.78

[ Show Complete Table ]

DC Uncertainty Values (DD) and Weighting Factors (W)

A given DC input table might contain more than one type of dipolar coupling. For example, DC input for a protein might include both HN-N amide and HA-CA dipolar couplings. As listed in the previous section, each type of coupling has a characteristic Dipolar Interaction (DI) value. So, for example since the DI value for an HA-CA coupling is about two times larger than the DI value for an HN-N coupling, the HA-CA dipolar couplings will be about two times larger than HN-N dipolar couplings in the same sample. In order to compensate for the overall size differences between different types of couplings, each dipolar coupling value D in the input table can have two associated values:

The DD value is the uncertainty in the coupling, given in Hz. As shown in equations 3 and 4 above, the Order Matrix Fitting method weights each dipolar coupling D by the inverse of its corresponding uncertainty DD, so that couplings with large DD uncertainty values have low importance in the Order Matrix SVD Fitting calculation. Since the DD value determines the importance that a coupling has in calculating order matrix coefficients, it can affect the tensor parameters determined by order order matrix fitting.
The W value is a statistical weighting factor. The W value does not affect the Order Matrix SVD calculation directly. Instead, it determines how a coupling contributes to the RMS value and other reported statistics. Couplings with small W values have low importance in the statistics calculations. For example, in a DC output, the reported RMS between N observed couplings D_OBS and N coresponding calculated couplings D is computed as:
RMS = [ Σ ( W*D_OBS - W*D )² / N ]^1/2
Since the DC Non-Linear Fitting method works by minimizing this RMS statistic, W values can affect the tensor parameters found by non-linear fitting. By contrast, while the W values will change reported statistics like RMS, they will not affect the tensor parameters determined by order matrix fitting.

The DD and W values are optional for input tables used with the DC server modules. If these columns are missing from the input table, default values will be used. These default values will be proportional to the DI value for the given coupling, and relative to the standard DI value for a protein HN-N coupling with a 1.0 Hz uncertainty, so that the default DD = |DI|/21,585 and the default W = 21,585/|DI|

Atom Pair I - J Default DD Default W

HN - N 1.000 1.000

HN - C' 0.309 3.238

C' - N 0.121 8.273

HA - CA 2.063 0.485

H - CH3 2.008 0.498

C' - CA 0.199 5.037

N - CA 0.092 10.923

Dipolar Coupling Input Table Format

The DC modules use dipolar coupling input tables in the NMRPipe table format. A DC table groups together all couplings associated with a given aligned sample. So, a DC input table might contain more than one type of coupling measurement, for example both HN-N couplings and HA-CA couplings.

In its simplest form, entries in a DC input table identify a pair of atoms I and J, and give the coupling in Hz. The atoms are each identified by a residue ID, a residue name, and an atom name, as shown in this example of the simplest DC input format. As shown in the example every dipolar coupling input table for the DC servers must contain columns for RESID_I RESNAME_I ATOMNAME_I, RESID_J RESNAME_J ATOMNAME_J, and the dipolar coupling value in Hz, column D.

Atom names in the DC input table must correspond to atom names used in the PDB file. For example, if an atom name for a given residue is given as HN in the DC input, then the PDB file should also have an amide proton called HN for that residue, rather than one called H. Methylene and Methyl protons in the DC input can be listed using hashmark-style naming, for example GLY HA# CA, ALA HB# CB, VAL HG1# CG1 etc. In these cases of couplings for more than one atom pair, the coupling should be given as the sum of its components. For example, a GLY HA#-CA coupling is given as the sum of HA2-CA and HA3-CA couplings, while an ALA HB#-CB coupling is given as the sum of of the three (identical) one-bond H-C couplings.

In the case of dipolar coupling tables for proteins, the input table can also include amino acid sequence information in the form of a DATA FIRST_RESID line and one or more DATA SEQUENCE LINES as in this example. In current versions of the DC modules, this sequence data is used to label graphs of dipolar couplings with respect to residue.

The DC input table can also contain columns for the estimated uncertainties in the couplings DD, and the statistical weighting factors W, as shown in this example.

If the PDB file has multiple chain IDs, the DC input table will require specification of CHAINNAME_I and CHAINNAME_J. Likewise if the PDB file has multiple segment IDs, the DC input table will require specification of SEGNAME_I and SEGNAME_J. Note: in a PDB ATOM line, which spans characters 1 to 80, the chain ID is given in character 22, and the segment ID is given in characters 73 to 76.

Column Name Usage Description

RESID_I required Residue ID of Atom I.

RESNAME_I required Residue name of Atom I.

ATOMNAME_I required Atom name of Atom I, which must match the corresponding PDB atom name.

SEGNAME_I optional PDB Segment ID name of Atom I (four characters max)

CHAINNAME_I optional PDB Chain ID name of Atom I (single character)

RESID_J required Residue ID of Atom J.

RESNAME_J required Residue name of Atom J.

ATOMNAME_J required Atom name of Atom J, which must match the corresponding PDB atom name.

CHAINNAME_J optional PDB Chain ID name of Atom J (single character)

SEGNAME_J optional PDB Segment ID name of Atom J (four characters max)

D required Observed dipolar coupling, Hz.

DI optional Dipolar Interaction value, Hz.

DD optional Estimated uncertainty in the observed coupling, Hz.

W optional Weighting factor for computing RMS etc.

Selection of Residues for a DC Calculation

The many DC calculation servers provide an option to select which residues of the molecule will be included in the DC calculation. The selection is specified as a list of residue IDs or ranges separated by spaces. The keywords First and Last are used for the lowest and highest residue IDs in the molecule, respectively. The keyword to is used to specify a range of residues. So, for example, the following specification includes all couplings, i.e., all couplings from the lowest residue ID to the highest residue ID in the molecule (this is usually the default selection in the DC modules):

first to last

Likewise, a specific range of residues can be specified, for example:

22 to 34.

Multiple ranges can be specified, as well as individual residues. For example, the following specification selects residues 10 and 13 as well as the range of residues 22 to 34 and the range of residues 40 to 50:

10 13 22 to 34 40 to 50

The specifications can include commas for clarity, but the commas are ignored. So, the following two specification lines select the same residues:

10, 13, 22 to 34, 40 to 50
10 13 22 to 34 40 to 50

The keywords chain and seg are used to specify residues in systems with multiple chains or segments, for example:

chain A 30 to 40.

After a given chain or segment is specified, that selection applies to all residues that follow, until a new chain or segment is specified. For example, the following specification selects residues 30 to 40 from chain A, ranges 50 to 60 and 80 to 90 from chain B, and residues 10 to 20 from chain C:

chain A 30 to 40 chain B 50 to 60 80 to 90 chain C 10 to 20.

Note that if the PDB file has multiple chain IDs, the DC input table will require specification of CHAINNAME_I and CHAINNAME_J. Likewise if the PDB file has multiple segment IDs, the DC input table will require specification of SEGNAME_I and SEGNAME_J.

Rotational Angles and Tensor Orientation

There are several conventions for specifying rotational angles. In the case of the DC server modules, orientations are always specified as "XYZ" rotations: a rotation about the X-Axis, followed by a rotation about the Y-Axis, followed by a rotation about the Z-Axis. An orientation can also be described by its 3x3 rotation matrix, which does not depend on any particular convention for specifying rotational angles.

As a general point, the relationship between rotational angles and orientation is not unique. In practice, there will usually be two different sets of rotational angles which both result in the same orientation, and give the same rotation matrix. The results of a DC computation will report orientations as one or more sets of XYZ-convention rotational angles, given as EULER_ANGLES in a DC output table. The DC results will also report the values from the corresponding 3x3 rotation matrix, given as three ROTATION_MATRIX entries in the output.

In the specific case of dipolar couplings, the tensor parameters describe the equivalent of a 3D elipsoid. As a result, the tensor parameters are four-fold degenerate, because as with an elipsoid, there is no way to distinguish "top" from "bottom", "left" from "right", or "front" from "back". So, for a given choice of Da and Rh, and rotation angles RX RY RZ, there are at least four different alignment orientations which will give rise to the same couplings:

The orientation resulting from rotation by RX RY RZ.
The orientation resulting from rotation by RX RY RZ, followed by an additional 180-degree rotation about the X-Axis.
The orientation resulting from rotation by RX RY RZ, followed by an additional 180-degree rotation about the Y-Axis.
The orientation resulting from rotation by RX RY RZ, followed by an additional 180-degree rotation about the Z-Axis.

Dipolar Coupling Output Table

An example dipolar coupling calculation output is given here. Some details follow:

In the DC output table table, N is the number of couplings, and N_PDB is the number of molecular structures.
In the DC output table columns, D_OBS is the observed dipolar coupling value as taken from the input, and D is the calculated dipolar coupling.
Q-factor, RMS, and Correlation factor are each computed from couplings multiplied by W values from the table. For example, for a collection of N couplings, the RMS and Q-Factor are computed as:

RMS = [ Σ( W*D_observed - W*D_calculated )²/ N ]^1/2 Q_FACTOR = ( Σ( W*D_observed - W*D_calculated )²/[N*(Da²(4 + 3Rh²)/5)] )^1/2
In fitting via SVD (saupe order matrix method) coupling values are divided by uncertainty DD to determine the order matrix values. So, in this case, the DD values will influence the tensor parameters but the W values will only influence the reported statistics.
In non-linear fitting mode, the fit attempts to minimize the RMS, which is weighted by the W values. So in this case, the DD values will not influence the tensor parameters, but the W values will.
Methylene CH2 couplings, such as GLY HA#/CA, are given as the sum of both H-C couplings. Likewise, methyl H-C couplings such as ALA HB#/CB are given as the sum of three one-bond H-C values, as in the full separation between the two outer peaks of a methyl CH3 quartet.
All rotational angles are given as an XYZ Rotation which rotates the PDB file coordinates onto the alignment tensor frame.
For non-linear fitting, SAUPE (order matrix) values are back-calculated from the fitted Da, Dr, and rotations. The five values reported in the SAUPE line of the table correspond to the five coefficients c1 ... c5 which define the 3x3 order matrix S, as given in Equation 5a-f above.
The three values reported in S_XYZ are the diagonal elements of the diagonalized order matrix S, sorted so that |S_zz| > |S_yy| > |S_xx|.
In the case of nonlinear fit modes, values reported for RX=PSI RY=THETA RZ=PHI are the actual XYZ rotations applied by the fit, also given together as ROT_XYZ. Values for EULER_ANGLES are the XYZ rotations back-calculated from the order matrix values.

DC Error Analysis Modes

The DC modules include several error analysis options. In each case, a given dipolar coupling calculation will be repeated for many trials, with the input data adjusted in some way at each trial. At each trial, the fitted alignment tensor parameters are recorded, along with statistics about the quality of fit.

Error Analysis by Randomized Couplings: this analysis is intended to show how random uncertainty in the measured couplings influences the fitted alignment tensor parameters. Each trial uses couplings D_trial formed by adding noise to the original couplings D, as D_trial = D + c*R(DD), where DD is the estimated uncertainty in the coupling as specified in the input table, R(DD) is a Gaussian-random noise function with standard devition DD, and c is an adjustable scaling parameter for the error analysis, typically around 1.0.
Error Analysis by Randomized Atomic Coordinates: this analysis is intended to show how uncertainties in the structure influence the the fitted alignment tensor parameters and the quality of the dipolar coupling fit. In some cases, this might also indicate the influence of molecular motions on the fitted parameters. Each trial of this error analysis uses a PDB structure computed by adding Gaussian-random noise of a given standard deviation to the atomic coordinates of the original PDB input.
Cross-Validation by Systematic Deletion: this analysis is intended to show how well a given collection of couplings defines the alignment tensor parameters, particularly when a only a small number of couplings is available. In each trial of this analysis, one of the couplings is deleted, and the fit is performed on the remainder. The alignment tensor parameters from the fit of the remainder are then used to calculate a value for the deleted coupling, which can be compared to the value originally calculated by fitting the complete data set. The trials are repeated so that each coupling is systematically deleted in turn.
Cross-Validation by Random Deletion: this analysis is similar to the systematic cross-validation above, but in this analysis, each trial deletes a randomly selected fraction of the original data. In each trial, the fit is performed on the remainder, and the alignment tensor parameters from the fit of the remainder are then used to calculate values for the deleted couplings.

* All documents in PDF format require the free Adobe Acrobat Reader application for viewing

[ Home ] [ NIH ] [ NIDDK ] [ Terms of Use ]
last update: Dec 6 2011