Protein nuclear magnetic resonance spectroscopy

Protein nuclear magnetic resonance spectroscopy (usually abbreviated protein NMR) is a field of Nobel Prize in Chemistry in 2002. Protein NMR techniques are continually being used and improved in both academia and the biotech industry. Structure determination by NMR spectroscopy usually consists of several following phases, each using a separate set of highly specialized techniques. The sample is prepared, resonances are assigned, restraints are generated and a structure is calculated and validated.

Sample preparation

Protein nuclear magnetic resonance is performed on aqueous samples of highly isotopic labelling possible.

The most abundant isotopes of yeast.

The purified protein is usually dissolved in a buffer solution and adjusted to the desired solvent conditions.

Data collection

Protein NMR utilizes multidimensional nuclear magnetic resonance experiments to obtain information about the protein. Ideally, each distinct nucleus in the molecule experiences a distinct chemical environment and thus has a distinct chemical shifts to a specific nucleus, and the second is primarily used to generate the distance restraints used in the structure calculation, and in the assignment with unlabelled protein.

Depending on the concentration of the sample, on the magnetic field of the spectrometer, and on the type of experiment, a single multidimensional nuclear magnetic resonance experiment on a protein sample may take hours or even several days to obtain suitable signal-to-noise ratio through signal averaging, and to allow for sufficient evolution of magnetization transfer through the various dimensions of the experiment. Other things being equal, higher-dimensional experiments will take longer than lower-dimensional experiments.

Typically the first experiment to be measured with an isotope-labelled protein is a 2D Tryptophan and certain other residues with N-containing sidechains also give rise to additional signals. The 15N-HSQC is often referred to as the fingerprint of a protein because each protein has a unique pattern of signal positions. Analysis of the 15N-HSQC allows researchers to evaluate whether the expected number of peaks is present and thus to identify possible problems due to multiple conformations or sample heterogeneity. The relatively quick heteronuclear single quantum correlation experiment helps determine the feasibility of doing subsequent longer, more expensive, and more elaborate experiments. It is not possible to assign peaks to specific atoms from the heteronuclear single quantum correlation alone.

Resonance assignment

In order to analyze the nuclear magnetic resonance data, it is important to get a resonance assignment for the protein. That is to find out which chemical shift in each dimension corresponds to which atom. Several different types of experiments have been invented to achieve this. The procedure depends on whether the protein is isotopically labelled or not, since a lot of the assignment experiments depend on carbon-13 and nitrogen-15.

Homonuclear nuclear magnetic resonance

With unlabelled protein the usual procedure is to record a set of two dimensional homonuclear nuclear magnetic resonance experiments through amino acids. Thus these two experiments are used to build so called spin systems, that is build a list of resonances of the chemical shift of the peptide proton, the alpha protons and all the protons from each residue’s sidechain. Which chemical shifts corresponds to which nuclei in the spin system is determined by the conventional correlation spectroscopy connectivities and the fact that different types of protons have characteristic chemical shifts. To connect the different spinsystems in a sequential order, the nuclear Overhauser effect spectroscopy experiment have to be used. Because this experiment transfers magnetization through space, it will show crosspeaks to all protons that are close in space regardless of whether they are in the same spin system or not. The neighbouring residues are inherently close in space, so the assignments can be made by the peaks in the NOESY with other spin systems.

One important problem using homonuclear nuclear magnetic resonance is overlap between peaks. This occurs when different protons have the same or very similar chemical shifts. This problem becomes greater as the protein becomes larger, so homonuclear nuclear magnetic resonance is usually restricted to small proteins or peptides.

Nitrogen-15 nuclear magnetic resonance

The process of resonance assignment for a nitrogen-15 labelled sample is similar to the homonuclear case. No experiment can be performed that transfers magnetisation between two spin systems through bonds either. The main difference is the ability to record nitrogen-15 edited three dimensional experiments: TOCSY-N HSQC and NOESY-N-HSQC. These experiments build onto the HSQC experiment, but have an additional proton dimension. It can be visualised as each peak in the HSQC having the TOCSY or NOESY peaks stacked onto it. Thus if the TOCSY peak from an amide proton, H_N, has a cross peak to its alpha proton, H_alpha, at the coordinates (H_N, H_alpha) in the TOCSY spectrum, the corresponding peak would be at (H_N, H_alpha,N) in the TOCSY-N-HSQC. Thus it is possible to resolve overlaps in the proton dimension, if the corresponding nitrogens have chemical shifts distinct from one another.

Carbon-13 and nitrogen-15 nuclear magnetic resonance

When the protein is labelled with carbon-13 and nitrogen-15 it is possible to record an experiment that transfers magnetisation over the peptide bond, and thus connect different spin systems through bonds. This is usually done using some of the following experiments, HNCO, HNCACO, HNCOCA, HNCACB and CBCACONH. All six experiments consist of a HSQC plane expanded with a carbon dimension. In the HNCO the spectrum contains peaks at the chemical shifts of the carbonyl carbons in the residue of the HSQC peak and the previous one in the sequence. The HNCACO only contains the one from the previous residue, and it is thus possible to assign the carbonyl carbon shifts that corresponds to each HSQC peak and the one previous to that one. Thus it is possible to make the assignment by matching the shifts of each spin system's own and previous carbons. The HNCA and HNCOCA works similarly, just with the alpha carbons rather than the carbonyls, and the HNCACB and the CBCACONH contains both the alpha carbon and the beta carbon. Usually several of these experiments are required to resolve overlap in the carbon dimension. This procedure is usually less ambiguous than the NOESY based method, since it is based on through bond transfer. In the NOESY-based methods additional peaks that are close in space but not belonging to the sequential residues will appear confusing the assignment process. When the sequential assignment has been made it is usually possible to assign the sidechains using HCCH-TOCSY, which is basically a TOCSY experiment resolved in an additional carbon dimension.

Restraint generation

In order to make structure calculations a number of experimentially determined restraints have to be generated. These fall into different categories, the most widely used is distance restraints and angle restraints.

Distance restraints

A crosspeak in a NOESY experiment signifies spatial proximity between the two nuclei in question. Thus each peak can be converted in to a maximum distance between the nuclei, usually between 1,8 and 6 angstroms. The intensity of a noesy peak is proportional to the distance to the minus 6th power, so the distance is determined according to intensity of the peak. The intensity-distance relationship is not exact, so usually a distance range is used.

It is of great importance to assign the noesy peaks to the correct nuclei based on the chemical shifts. If this task is performed manually it is usually very labor intensive, since proteins usually have thousands of noesy peaks. Some computer programs such as CYANA^[3] and ARIA^[4]/CNS perform this task automatically, coupled to a structure calculation.

To obtain as accurate assignments as possible it is a great advantage to have access to carbon-13 and nitrogen-15 noesy experiments, since they help to resolve overlap in the proton dimension. This leads to faster and more reliable assignments, and in turn to better structures.

Angle restraints

In addition to distance restraints, restraints on the torsion angles of the chemical bonds, typically the psi and phi angles can be generated. One approach is to use the coupling constants. Another approach uses the chemical shifts to generate angle restraints. Both methods use the fact that the geometry around the alpha carbon affects the coupling constants and chemical shifts, so given the coupling constants or the chemical shifts, a qualified guess can be made about the torsion angles.

Orientation restraints

Main article: Residual dipolar coupling

The analyte molecules in a sample can be partially ordered with respect to the external magnetic field of the spectrometer by manipulating the sample conditions. Common techniques include addition of bacteriophages or bicelles to the sample, or preparation of the sample in a stretched polyacrylamide gel. This creates a local environment that favours certain orientations of nonspherical molecules. Normally in solution NMR the dipolar coupling between nuclei are averaged out because of the fast tumbling of the molecule. The slight overpopulation one orientation means that a solid state NMR and provides information about the relative orientation of the bond vectors relative to a single global reference frame. Typically the orientation of the N-H vector is probed in a HSQC like experiment. Initially residual dipolar couplings were used for refinement of previously determined structures, but attempts at de novo structure determination have also been made.^[5]

Hydrogen-Deuterium exchange

Main article: Hydrogen-deuterium exchange

NMR spectroscopy is nuclei specific. Thus it can distinguish between hydrogen and deuterium. The amide protons in the protein exchange readily with the solvent, and if the solvent contains a different isotope, typically deuterium, the reaction can be monitored by NMR spectroscopy. How rapidly a given amide exchanges reflects its solvent accessibility. Thus amide exchange rates can give information on which parts of the protein are buried, hydrogen bonded etc. A common application is to compare the exchange of a free form versus a complex. The amides that become protected in the complex, are assumed to be in the interaction interface.

Structure calculation

The experimentially determined restraints can be used as input for the structure calculation process. Researchers, using computer programs such as CYANA or XPLOR-NIH,^[6] attempt to satisfy as many of the restraints as possible, in addition to general properties of proteins such as bond lengths and angles. The algorithms convert the restraints and the general protein properties into energy terms, and thus tries to minimize the energy. The process results in an ensemble of structures that, if the data were sufficient to dictate a certain fold, will converge.

Dynamics

In addition to structures, deuterium have recently been developed, which enables systematic studies of motions of the amino acid side chains in proteins.

NMR spectroscopy on large proteins

Traditionally nuclear magnetic resonance spectroscopy has been limited to relatively small proteins or protein domains. This is in part caused by problems resolving overlapping peaks in larger proteins, but this has been alleviated by the introduction of isotope labelling and multidimensional experiments. Another more serious problem is the fact that in large proteins the magnetization relaxes faster, which means there is less time to detect the signal. This in turn causes the peaks to become broader and weaker, and eventually disappear. Two techniques have been introduced to attenuate the relaxation: GroES-GroEL.^[9]

Automation of the process

Structure determination by NMR has traditionally been a time consuming process, requiring interactive analysis of the data by a trained scientist. There has been a considerable interest in automating the process to increase the throughput of structure determination (See structural genomics). The two most time consuming processes are the resonance assignment and the NOE assignment. Several different computer programs have been published that do this processes automatically.^[10]^[11] Efforts have also been made to standardize the structure calculation protocol to make it quicker and more amenable to automation.^[12]

References

Citations

^ Protein structure determination in solution by NMR spectroscopy Wuthrich K. J Biol Chem. 1990 December 25;265(36):22059-62
^ Automated NMR structure calculation with CYANA. Guntert P. Methods Mol Biol. 2004;278:353-78.
^ ARIA2: automated NOE assignment and data integration in NMR structure calculation. Rieping W, Habeck M, Bardiaux B, Bernard A, Malliavin TE, Nilges M. Bioinformatics 2007;23:381-382.
^ An efficient 3D NMR technique for correlating the proton and 15N backbone amide resonances with the alpha-carbon of the preceding residue in uniformly 15N/13C enriched proteins. Bax A, Ikura M. J Biomol NMR. 1991 May;1(1):99-104.
^ Residual dipolar couplings in protein structure determination. de Alba E, Tjandra N. Methods Mol Biol. 2004;278:89-106
^ The Xplor-NIH NMR molecular structure determination package. Schwieters CD, Kuszewski JJ, Tjandra N, Clore GM. J Magn Reson. 2003 Jan;160(1):65-73
^ Attenuated T2 relaxation by mutual cancellation of dipole-dipole coupling and chemical shift anisotropy indicates an avenue to NMR structures of very large biological macromolecules in solution. Pervushin K, Riek R, Wider G, Wuthrich K. Proc Natl Acad Sci U S A. 1997 November 11;94(23):12366-71.
^ Effect of deuteration on the amide proton relaxation rates in proteins. Heteronuclear NMR experiments on villin 14T. Markus MA, Dayie KT, Matsudaira P, Wagner G. J Magn Reson B. 1994 Oct;105(2):192-5
^ NMR analysis of a 900K GroEL GroES complex. Fiaux J, Bertelsen EB, Horwich AL, Wuthrich K. Nature. 2002 July 11;418(6894):207-11.
Guntert, P. See above
Rieping, W. See above
^ NMR data collection and analysis protocol for high-throughput protein structure determination. Liu G, Shen Y, Atreya HS, Parish D, Shao Y, Sukumaran DK, Xiao R, Yee A, Lemak A, Bhattacharya A, Acton TA, Arrowsmith CH, Montelione GT, Szyperski T. Proc Natl Acad Sci U S A. 2005 July 26;102(30):10487-92.

General references

Gordon S. Rule, T. Kevin Hitchens (2006). "Fundamentals of Protein NMR Spectroscopy". Springer. ISBN 1-4020-3499-7. http://www.springer.com/1-4020-3499-7
Quincy Teng, (2005). "Structural Biology, Practical NMR Applications, Springer, ISBN 0-387-24367-4
John Cavanagh, Wayne J. Fairbrother, Arthur G. Palmer III, Nicholas J. Skelton, (1995). Protein NMR Spectroscopy: Principles and Practice. Academic Press. ISBN 0-12-164490-1.
Kurt Wuthrich (1986) NMR of Proteins and Nucleic Acids . Wiley-Interscience. ISBN 0-471-82893-9