|
FAST-NMR - Functional
Annotation Screening
Technology by NMR |
|
|
The Function of a Protein is Intrinsically Linked to the Identity of its Active-Site and Ligand FAST-NMR combines structural biology and NMR ligand affinity screens with bioinformatics to assign a function to a hypothetical protein or a protein of unknown function. This is based on basic tenets of biochemistry where proteins with similar functions will have similar active sites and exhibit similar ligand binding interactions, despite global differences in sequence and structure. Protein-ligand interactions are determined through a tiered NMR screen using a library composed of compounds with known biological activity. A rapid co-structure is determined by combining the experimental identification of the ligand-binding site from NMR chemical shift perturbations with the protein-ligand docking program AutoDock. Our CPASS (Comparison of Protein Active Site Structures) software and database is then used to compare this active site with proteins of known function. Our functional chemical library is first screened as mixtures using a 1D NMR line-broadening screen to rapidly identify binders. The relative change in peak intensity (line-width) is used to measure a semi-quantitative dissociation constant (KD) that allows the ligands to be ranked. Only the ligands that exhibit a positive binding interaction in the 1D NMR line-broadening screen are further screened as singletons by measuring chemical shift changes in a 2D 1H-15N HSQC spectrum. The observation of a consensus clustering of residues that experienced a chemical shift perturbation (CSP) on the molecular surface of the protein confirms a specific, stoichiometric binding interaction for the ligand and the ligand-defined binding site or the proteins active-site/functional-site. The CSPs are used in combination with AutoDock to rapidly determine (~30-45 min.) a protein-ligand co-structure. THe co-structure is then used as input for CPASS to identify potential functional homologs based on the sequence and structure similarity of ligand-defined binding sites (higher information content then global sequence or structure similarity searches). Approximately 50% of the genome of multiple organisms contain proteins of unknown function, where 30-50% of these proteins are amenable to NMR analysis.
In cases where a structure and/or NMR assignments of a hypothetical protein is unavailable, FAST-NMR can provide functional information by the similarity in the ligand-binding profile. Comparable to sequence homology, the relative KD measured for each ligand against a protein of unknown function can be compared against this same list of binding affinities for proteins of known function. A similarity score can be obtained based on the relative differences in the individual KD values. A function can then be implied based on a high similarity in the ligand binding profiles. Also, the output of FAST-NMR is a protein-ligand co-structure, which may provide the starting point for a structure-based drug design effort. This is especially true since the compounds in the functional chemical library have "drug-like" characteristics.
K. A. Mercier, M. Baran,
V. Ramanathan, P. Revesz, R. Xiao, G. T. Montelione and R. Powers
(2006) “FAST-NMR -Functional
Annotation Screening Technology Using NMR.”
Journal of the American
Chemical Society, 128(47):15292-15299. Flow-Diagram of FAST-NMR process Functional Assignments for hypothetical protein SAV1430 from S. aureus Sequence alignment of SAV1430 with NifU Hypothetical Proteins from Structural Genomics Table comparing FAST-NMR to other methods Percentage of bacterial proteomes that correspond to hypothetical proteins |