- Scoring functions for docking
In the fields of
computational chemistry andmolecular modelling , scoring functions are fast approximate mathematical methods used to predict the strength of thenon-covalent interaction (also referred to as binding affinity) between two molecules after they have been docked. Most commonly one of the molecules is a small organic compound such as adrug and the second is the drug's biological target such as aprotein receptor.cite journal | author = Jain AN | title = Scoring functions for protein-ligand docking | journal = Curr. Protein Pept. Sci. | volume = 7 | issue = 5 | pages = 407–20 | year = 2006 | pmid = 17073693 | doi = 10.2174/138920306778559395 ] Scoring functions have also been developed to predict the strength of other types ofintermolecular interactions, for example between two proteinscite journal | author = Lensink MF, Méndez R, Wodak SJ | title = Docking and scoring protein complexes: CAPRI 3rd Edition | journal = Proteins Structure Function and Bioinformatics| volume = 69| issue = | pages = 704| year = 2007 | pmid = 17918726 | doi = 10.1002/prot.21804 ] or between protein andDNA .cite journal | author = Robertson TA, Varani G | title = An all-atom, distance-dependent scoring function for the prediction of protein-DNA interactions from structure | journal = Proteins | volume = 66 | issue = 2 | pages = 359–74 | year = 2007 | pmid = 17078093 | doi = 10.1002/prot.21162 ]Utility
Scoring functions are particularly useful for drug and other types of molecular design where speed is more important than high accuracy. These application include:cite journal | author = Rajamani R, Good AC | title = Ranking poses in structure-based lead discovery and optimization: current trends in scoring function development | journal = Current opinion in drug discovery & development | volume = 10 | issue = 3 | pages = 308–15 | year = 2007 | pmid = 17554857 | issn = ]
*Virtual screening of small molecule databases of candidate ligands to identify novel small molecules that bind to a protein target of interest and therefore are useful starting points fordrug discovery cite journal | author = Seifert MH, Kraus J, Kramer B | title = Virtual high-throughput screening of molecular databases | journal = Current opinion in drug discovery & development | volume = 10 | issue = 3 | pages = 298–307 | year = 2007 | pmid = 17554856 | issn = ]
* De novo design (design "from scratch") of novel small molecules that bind to a protein targetcite journal | author = Böhm HJ | title = Prediction of binding constants of protein ligands: a fast method for the prioritization of hits obtained from de novo design or 3D database search programs | journal = J. Comput. Aided Mol. Des. | volume = 12 | issue = 4 | pages = 309–23 | year = 1998 | month = July | pmid = 9777490 | doi = 10.1023/A:1007999920146 | url = ]
*Lead optimization of screening hits to optimize their affinity and selectivitycite journal | author = Joseph-McCarthy D, Baber JC, Feyfant E, Thompson DC, Humblet C | title = Lead optimization via high-throughput molecular docking | journal = Current opinion in drug discovery & development | volume = 10 | issue = 3 | pages = 264–74 | year = 2007 | pmid = 17554852 | issn = ]A potentially more reliable but much more computationally demanding alternative to scoring functions are
free energy perturbation calculations.cite journal | author = Foloppe N, Hubbard R | title = Towards predictive ligand design with free-energy based computational methods? | journal = Curr. Med. Chem. | volume = 13 | issue = 29 | pages = 3583–608 | year = 2006 | pmid = 17168725 | url = | doi = 10.2174/092986706779026165 ]Prerequisites
Scoring functions are normally parameterized (or trained) against a data set consisting of experimentally determined binding affinities between molecular species similar to the species that one wishes to predict.
For predictions of affinities of ligands for proteins the following must first be known or predicted:
* Proteintertiary structure – arrangement of the protein atoms in three dimensional space. Protein structures may be determined by experimental techniques such asX-ray crystallography or solution phaseNMR methods or predicted byhomology modelling .
* Ligand active conformation – three dimensional shape of the ligand when bound to the protein
* Binding-mode – orientation of the two binding partners relative to each other in the complexThe above information yields the three dimensional structure of the complex. Based on this structure, the scoring function can then estimate the strength of the association between the two molecules in the complex using one of the methods outlined below. Finally the scoring function itself may be used to help predict both the binding mode and the active conformation of the small molecule in the complex.
Classes
There are three general classes of scoring functions:
* Force field – affinities are estimated by summing the strength of intermolecular van der Waals and
electrostatic interactions between all atoms of the two molecules in the complex. The intramolecular energies (also referred to asstrain energy ) of the two binding partners are also frequently included. Finally since the binding normally takes place in the presence of water, the desolvation energies of the ligand and of the protein are sometimes taken into account usingimplicit solvation methods such as GBSA or PBSA.
* Empirical – based on counting the number off various types of interactions between the two binding partners.cite journal | author = Böhm HJ | title = Prediction of binding constants of protein ligands: a fast method for the prioritization of hits obtained from de novo design or 3D database search programs | journal = J. Comput. Aided Mol. Des. | volume = 12 | issue = 4 | pages = 309–23 | year = 1998 | pmid = 9777490 | doi = 10.1023/A:1007999920146 ] Counting may be based on the number of ligand and receptor atoms in contact with each other or by calculating the change in solvent accessible surface area (ΔSASA) in the complex compared to the uncomplexed ligand and protein. The coefficients of the scoring function are usually fit using multiple linear regression methods. These interactions terms of the function may include for example:
** number ofhydrogen bond s (favorable electrostatic contribution to affinity),
** hydrophobic — hydrophobic contacts (favorable),
**hydrophilic — hydrophobic contacts (unfavorable),
** number of rotatable bonds immobilized in complex formation (unfavorableentropic contribution).
* Knowledge – based on statistical observations of intermolecular close contacts in large 3D databases (such as theCambridge Structural Database orProtein Data Bank ) which are used to derive "potentials of mean force". This method is founded on the assumption that close intermolecular interactions between certain types of atoms or functional groups that occur more frequently than one would expect by a random distribution are likely to be energetically favorable and therefore contribute favorably to binding affinity.cite journal | author = Muegge I | title = PMF scoring revisited | journal = J. Med. Chem. | volume = 49 | issue = 20 | pages = 5895–902 | year = 2006 | pmid = 17004705 | doi = 10.1021/jm050038s ]Finally hybrid scoring functions have also been developed in which the components from two or more of the above scoring functions are combined into one function.
References
Wikimedia Foundation. 2010.