Escolar Documentos
Profissional Documentos
Cultura Documentos
ASSIGNMENT NO. I
TABLE OF CONTENTS
1 INTRODUCTION 3
2 REVIEW OF LITERATURE 4
3 STATISTICAL CONCEPTS 7
6 APPLICATION OF QSAR 20
7 SUMMARY 23
8 REFERENCES 24
INTRODUCTION
3
response.
between the two. The mathematical expression can then be used to predict the biological
• The different physical properties that influence biological activity and use of those
properties in the development of mathematical models that relate the physical properties
activity of a molecular system and its geometric and chemical characteristics and thus it
properties.
REVIEW OF LITERATURE
4
ivonne.rietjens@wur.nl.
The description of quantitative structure-activity relationship (QSAR) models has been a topic
for scientific research for more than 40 years and a topic within the regulatory framework for
more than 20 years. At present, efforts on QSAR development are increasing because of their
promise for supporting reduction, refinement, and/or replacement of animal toxicity experiments.
However, their acceptance in risk assessment seems to require a more standardized and scientific
underpinning of QSAR technology to avoid possible pitfalls. For this reason, guidelines for
QSAR model development recently proposed by the Organization for Economic Cooperation and
Development (OECD) [ Organization for Economic Cooperation and Development (OECD)
( 2007) Guidance document on the validation of (quantitative) structure-activity relationships
[(Q)SAR] models. OECD Environment Health and Safety Publications: Series on Testing and
Assessment No. 69, Paris ] are expected to help increase the acceptability of QSAR models for
regulatory purposes. The guidelines recommend that QSAR models should be associated with (i)
a defined end point, (ii) an unambiguous algorithm, (iii) a defined domain of applicability, (iv)
appropriate measures of goodness-of-fit, robustness, and predictivity, and (v) a mechanistic
interpretation, if possible [ Organization for Economic Cooperation and Development (OECD)
( 2007) Guidance document on the validation of (quantitative) structure-activity relationships
[(Q)SAR] models. The present perspective provides an overview of these guidelines for QSAR
model development and their rationale, as well as the promises and pitfalls of using QSAR
approaches and these guidelines for predicting metabolism and toxicity of new and existing
chemicals.
MbtA (a salicyl AMP ligase) is a key target for the design of new antitubercular agents. On the
basis of structure-activity relationship (SAR) data generated in our laboratory, a structure-based
model is developed to predict the binding affinities of aryl acid-AMP bisubstrate inhibitors of
MbtA. The approach described takes advantage of the linear interaction energy (LIE) technique
to derive linear equations relating ligand structure to function. With only two parameters derived
from molecular dynamics simulations, good correlation ( R (2) = 0.70) was achieved for a set of
31 inhibitors with binding affinities spanning 6 orders of magnitude. The results were applied to
understand the effect of steric and heteroatom substitutions on bisubstrate ligand binding and to
predict second generation inhibitors of MbtA. The resulting model was further validated by
chemical synthesis of a novel inhibitor with a predicted LIE binding affinity of 1.6 nM and a
subsequently determined experimental K i (app) of 0.7 nM.
EMS Sigma Pharma - R&D, Rodovia SP 101 Km 08, Hortolândia SP 13186-401, Brazil.
STATISTICAL CONCEPTS
• In the simplest situation, a range of compounds are synthesized in order to vary one
physicochemical property (e.g. log p) and to test how this affects the biological activity (log
1/C). A graph is then drawn to plot the biological activity on the y axis versus the
physicochemical feature on the x axis. It is then necessary to draw the best possible line
through the data points on the graph. This is done by a procedure known as “Linear
regression analysis by the least square method”.
• As the data points are scattered on the either side of the line so to measure how close the data
points are, vertical lines are drawn from each point. These verticals are measured and then
squared. The squares are then added up to give a total. The best line through the points will
be the line where this total is a minimum.
8
• The equation of the straight line will be y = kx + k’ where k and k’ are constants. By varying
k and k’, different equations are obtained until the best line is obtained.
• Now to check the significance of the equation, it can be checked by regression coefficient(r).
For a perfect fit r2 = 1. Good fits have values of 0.95 or above.
There are many physical, chemical and structural properties which have been studied by the
QSAR approach, but the most commonly studied are hydrophobic, electronic, and steric.
This is because it is possible to quantify these effects relatively easily.
In particular, hydrophobic interactions are more easily quantified for complete molecules or
for individual substances rather electronic, and steric properties are difficult and
quantification is really feasible for individual substituent.
The three most studied physicochemical properties will now be considered in more detail-
a) Hydrophobicity
• The hydrophobic character of a drug is crucial to determine how easily it crosses cell
membrane and may also be important in receptor interactions. Changing substituents on a
drug may have significant effects on its hydrophobic character and hence its biological
activity.
9
• Affinity of a compound for a lipid (hydrophobic environment) that can be quantified using
octanol-water partition coefficients yielding hydrophobic substituent constants (π).
Equations for the determination of the partition coefficient, P, and the hydrophobicity
parameters, πx :-
Where Px is the partition coefficient of the substituted compound x and PH is the partition
coefficient of the parent (unsubstituted) compound.
• Larger P, more hydrophobic (nonpolar compound or lipophilic): therefore, the larger and
positive the value of π, the more hydrophobic. For a hydrophilic (polar compound or
lipophobic) the value of Px will be less than that of PH, such that the ratio Px/PH will be a
fraction leading to π being negative.
• The P value of substituted compounds is then determined by adding the πx values for the
individual substitutents to that of the parent compound.
Calculation of log P (substitutent contributions to P are not directly additive due to interactions
between the substitutents as they are added to the parent compound). Equation for the calculation
of log P using fragmental constants and interaction factors.:-
In the equation fi are “fragmental constants” associated with chemical substituents (i.e. these are
the π values), Fi are “interaction factors” to account for intramolecular electronic, steric or
hydrogen-bonding interactions between the substitutents that lead to the contribution of those
substituents not being additive. This or similar approaches are now widely used and the values of
fi and Fi are known for a huge number of substituents and pairs of substituents, respectively. This
approach avoids experiments being performed to determine P values for all compounds in a
QSAR study.
fi of methyl = 0.6
fi or aromatic fluorine = -0.4
Fi for fluorine atom ortho to a methyl group is -0.3
Therefore, log Po-fluorotoluene = 2.5 + 0.6 + (-0.4) + (-0.3) = 2.4
Multiple linear regression analysis allows for solving equations with multiple variables
(parameters) as required to include π with σ terms as in the following relationship-
11
b) Electronic effects
These effects of various substituents will clearly have an effect on a drug ionization or polarity.
This in turn may have an effect on how easily a drug can pass through cell membrane or how
strongly it can bind to a receptor. As far as substituents are concerned on an aromatic ring, the
measure used is known as Hammett substitution constant which is given by symbol σ.
log (Kx/KH) = σ
Equation used in the determination of the Hammett constant where KH is the equilibrium or rate
constant for the parent (unsubstituted) compound and Kx is the equilibrium or rate constant for
the derivative: these values are measured experimentally.
Positive σx: chemical groups that withdraw electrons from the ring (e.g. -NO2) favor the
anion, thereby increasing K.
12
Figure 2 :- Equilibrium between the unionized and ionized forms of benzoic acid and the
definition of Ka
In the given example , Electron withdrawing groups will favor the equilibrium to the right, due to
the substituent pulling electron density out of the ring thereby decreasing the electron density on
the carboxylate and making formation of the negative species less unfavorable.
Shifting the equilibruim to the right increase Kx, such the Kx/KH will be greater than 1 and the
log of Kx/KH is positive, yielding a positive σ .In the case of para substituents that have
resonance structures, the resonance structures lead to the negative charge being distributed over
the entire molecule, such that thenegative species is much less unfavorable. Accordingly, this
leads to an increased Kx and, ultimately, a positive σ.
Negative σx: chemical groups that donate electrons into the ring (e.g. -OCH3), favor the neutral
species, thereby decreasing K.
Electron donating groups will increase the electron density of the ring and the carboxylate group,
thereby favoring the equilibrium in above example to the left. This yields a decrease in Kx, such
Kx/KH will be less than 1, and the log of Kx/KH will be negative, yielding a negative σ.
13
Substituent location on the ring affects the value of σ for the substituent. Meta and para σ values
for substituents are commonly used; however, ortho σ are often unreliable due to direct
interactions of the o substitutent with the functional group (e.g. the acid in benzoic acid).
The value of ρ is an indication of the influence of the electronic effect on the binding constant. If
ρ > 1 then the electronic contribution of substituents is greater than it is for the ionization of
benzoic acid. If ρ < 1 then the electronic contribution of substituents is less than it is for the
ionization of benzoic acid.
Note that ρ can be less than 0 (e.g. a negative value), indicating that the effect is opposite that
occurring with respect to the ionization of benzoic acid.
It should be noted that when the data from a training set yields a positive slope, than increasing
σx (adding electron withdrawing substituents) will increase activity. Alternatively, when the data
from a training set yields a negative slope, than decreasing σx (adding electron donating
substituents) will increase activity.
3. Steric factors
• In order for a drug to interact with an enzyme or receptor, it has to approach, then bind to a
binding site. The bulk, size and shape of the drug may have an influence on this process.
• Taft hypothesized that the size of a chemical substituent would affect reaction rates and
equilibria (i.e. steric effects (Es)). Investigated this via QSAR analysis of the rates of base-
catalyzed hydrolysis of aliphatic esters. This idea was an important step towards the effective
application to biological systems.
Fig 4:- Reaction (ester hydrolysis) and equation used to define the Taft steric parameter, Es
Since the reaction involves nucleophilic attack of the carbonyl carbon by a hydroxide ion, the
presence of a substitutent on the methyl group adjacent to the carbonyl C will hinder the
nucleophilic attack. Thus, bulky groups will block access to the carbonyl carbon, thereby
slowing the reaction, making kx < kH in all cases, such that log (kx/kH) will be negative.
Therefore, all Es values are negative (or zero for hydrogen). Using this approach Taft, and others,
15
generated an extensive list of Es values that may be applied to other reactions and equilibria in
the same way σx values are used.
Molar refractivity, MR
Originally proposed by Pauling and Pressman as a parameter for the correlation of dispersion
forces involved in the binding of haptens to antibodies. Determined from the refractive index, n,
the molecular weight, MW and the density of a crystal, d.
Equation for the molar refractivity can be given by:-
Since refractive index doesn't change significantly for organic molecules, the term is dominated
by the MW and density. Larger MW, larger the steric effect, while greater the density, the smaller
the steric effect (the molecules tend to pack better). A smaller MR for the same MW indicates
stronger interactions in the crystal (larger density indicates that the packing is better due to
stronger interactions) indicating that molecule may have stronger dispersion interactions with the
environment (e.g. a receptor).
Craig plot facilitates selection of substituents during a QSAR study, which vary widely in one
parameter but not the other.
Fig. 7 :- In CoMFA 3-D QSAR each grid voxel corresponds to two variables in QSAR
equation: steric and electrostatic. The PLS technique is applied to compute the coefficients.
PROBLEMS
o Superposition: the molecules must be optimally aligned.
o Flexibility of the molecules.
Application of QSAR
1. Classification
Researchers have attempted for many years to develop drugs based on QSAR. Easy access to
computational resources was not available when these efforts began, so attempts consisted
21
Examples will discuss the application of QSAR to drug design, which were relied primarily on
statistical correlation and some, on computer-based
visualization and modeling. An early example of QSAR in
drug design involves a series of 1-(X-phenyl)-3,3-dialkyl
triazenes.
These compounds were of interest for their anti-tumor activity, but they also were mutagenic.
QSAR was applied to understand how the structure might be modified to reduce the
mutagenicity without significantly decreasing the anti-tumor activity. Mutagenic activity was
evaluated in the Ames test, and from those data, the following QSAR was developed:
where C is the molar concentration required to give 30 revertants per 10*8 bacteria and is a
"through resonance" electronic parameter. From the equation, it is seen that factors that favor
mutagenicity are increased lipophilicity and electron-donating substituents.
Studies of the anti-tumor activity were done against L1210 leukemia in mice. From the data, the
following QSAR was developed:
where C is the molar concentration of compound producing a 40% increase in life span of mice,
MR is molar refractivity, which is a measure of molecular volume, and EsR is a steric parameter
22
for the R group. Based on these equations, mutagenicity is more sensitive than anti-tumor
activity to the electronic effects of the substituents. Thus, electron-withdrawing substituents were
examined, as illustrated in the example below:
By substituting a sulfonamide group at the para position, the anti-tumor activity was reduced 1.2-
fold, whereas the mutagenicity was reduced by about 400-fold.
Using QSAR to predict combination of steric, electronic and hydrophobic properties required to
achieve the desired properties (generally interpolative in nature).Different properties must be
optimized for different types of drugs.
Maximize desirable activities while minimizing side and toxic effects (i.e. improve therapeutic
index). May also want to maximize the spectrum of selectivity of a potential antibiotic (i.e. range
of bacteria that the antibiotic is effective against). Develop individual QSARs with respect to
different types of activity. Do this for a variety of activities for the same set of test compounds.
Analyze the multiple QSARs simultaneously to find the ranges of physical properties that
maximize desirable activities and minimize undesirable activities.
Example: To get a neutral compound into the CNS the partition coefficient must be close to 2.0
and to keep it out of the CNS, stay away from 2.
Sulmazole: original compound in clinical trials as antibacterial had a 4-OMe with a partition
coefficient of 2.59 (close to CNS magic number). During clinical trials some patients reported
23
seeing "bright visions." Therefore the 4-OMe was changed to a more polar 4-S(O)Me group,
yielding a partition coefficient of 1.17, CNS absorption was decreased and the unwanted side
effect was eliminated. This is an example of bioisosteric replacement where the interaction of
the function group with the environment is maintained (i.e. hydrogen bond acceptor) but the
physical property (partition coefficient) changes.
Important aspect of the use of QSAR is the reduction of the use of animals in drug discovery:
QSAR allows for the elimination of compounds from further consideration prior to biological
testing, thereby minimizing the number of animals required.
4. Prediction of Activity
Predict activity of an unknown molecule via QSAR Develop QSAR for a "training set" of
compounds, then use the obtained mathematical relationship to predict the biological activity of
new compounds prior to their synthesis. Most accurate for congeneric series of compounds:
Congeneric Series: Collection of structurally related compounds that vary primarily only by
their substituents. For example, benzene, amino-benzene and chloro-benzene would represent a
24
congeneric series, however, indole would not be part of the congeneric series due to it contain a
different ring system.
Fig. 10 :- Example of Congeneric series
Summary:
The physical properties of drugs, in part, dictate their biological activity. In addition, use of
descriptors of physical properties allow for the application of mathematical models to analyze
and predict drug activity. Thus QSAR is the study of different physical properties that influence
biological activity, use of those properties in the development of mathematical models that relate
the physical properties to biological activity, and how those mathematical models may be used to
understand and predict drug action. In determining qualitative structure relationship three
essential physicochemical properties such as hydrophobicity, electronic effect & steric effect are
studied. It is helpful in determining classification, diagnosis of mechanism of drug action,
prediction of drug activity, lead compound optimization.
References:
Hansch, C., Leo, A., and Taft, R.W. (1991) A Survey of Hammett Substituent Constants
and Resonance and Field Parameters. Chem. Rev., 91: 165-195.
Hansch, C., Leo, A., and Hoekman, D. (1995) Exploring QSAR - Hydrophobic, Electronic,
and Steric Constants. American Chemical Society, Washington, D.C.
Venger, B.H., Hansch, C., Hatheway, G.J., and Amrein, Y.U. (1979) Ames Test of 1-(X-
Phenyl)-3,3-dialkyltriazenes. A Quantitative Structure-Activity Study. J. Med. Chem., 22:
473-476.
25
Kumar, K., King, R.W., and Carey, P.R. (1974) Carbonic Anhydrase - Aromatic
Sulfonamide Complexes, A Resonance Raman Study. FEBS Lett. 48: 283-287.
DesJarlais, R.L., Sheridan, R.P., Seibel, G.L., Dixon, J.S., Kuntz, I.D., and
Venkataraghavan, R. (1988) Using Shape Complementarity as an Initial Screen in
Designing Ligands for a Receptor Binding Site of Known Three-Dimensional Structure. J.
Med. Chem., 31: 722-729.
Web Resources:
http/www.netsci.org/Science/Compchem/feature11.html
http://www.statsoft.com/textbook/stmulreg.html
http://www.netsci.org/Science/Compchem/feature12.html
http://www.tdx.cesca.es/TESIS_UdG/AVAILABLE/TDX-1210104-133736//tags2de4.pdf
http://media.wiley.com/product_data/excerpt/03/04712709/0471270903.pdf