Development of Efficient Drug Analogs for Dutasteride through Insilico Modeling

Nitin  Sharma; Sandeep K.  Kushwaha; Pallavi  Chauhan; Madhvi  Shakya

Development of Efficient Drug Analogs for Dutasteride through Insilico Modeling

N Sharma, S Kushwaha, P Chauhan, M Shakya

Keywords

binding energy, drug analogs, dutasteride, molecular dynamics, side-effect

Citation

N Sharma, S Kushwaha, P Chauhan, M Shakya. Development of Efficient Drug Analogs for Dutasteride through Insilico Modeling. The Internet Journal of Medical Informatics. 2008 Volume 5 Number 1.

Abstract

Action of drug depends on the quantity of drug that reaches to the receptor and the degree of affinity between drug and receptor. Once drug bound to its receptor, it shows its desired effect (intrinsic activity) along with major and minor side-effects. More than 500 drugs with significant side-effects are available in market. In the present study the efforts have been made to identify new candidate compounds for dutasteride through modeled structure of two existing drug targets to improve efficacy by optimizing various parameters. Three dimensional structures of both drug targets of dutasteride were generated by MODELLER, validated by PROCHECK and ERRAT programs (90.04% of model-I and 84.647% of model-II). Energy minimization and molecular dynamics calculations were done through GROMACS using OPLS force field. RMSD calculated for both simulated model attain a constant deviation within 10,000 cycles. For analog generation, mono-substitution is preferred instead of multi-substitution. Objective and subjective both approaches were used for identification and calculation of chemical descriptors. A comparison of the calculated binding affinities for structurally similar inhibitors of dutasteride gave suitable analogs. Total 169 analogs were generated and eight are selected for comparative study. Our finding reveals that three dutasteride drug analogs (-CH₂NH₂, -CH₂OH, -CF₂OH) were most suitable analogs, showing theoretically more superior results. These findings need to be further evaluations in laboratory.

Introduction

Soft-computing techniques are concerned with approximate calculation, imprecision, analysis where human capability of making decisions is in dilemma. The recent developments in soft-computing techniques are expanding its scope in pharmaceutical industry. At present thousands of drugs are in market but many of them needed to improve efficacy (minimization of biological toxicity effects and side effects) [1, 2]. In the present study dutasteride (AVODART) drug for hair loss treatment is studied [3]. Dutasteride inhibits both isoforms of 5-alpha reductase, which are considered as drug targets. The most common adverse reactions of Dutasteride are impotence, decreased libido, breast disorders, and ejaculation disorders. Dutasteride belongs to: Drug type - small molecule, Drug category - Approved, Investigational, Anti-baldness agent and Antihyperplasia agents (http://www.drugbank.ca/). The PDB structures of both the drug targets are not available and additionally the molecular modeling of targets was a tougher task due to very low sequence similarity with known PDB templates.

In this paper an attempt has been made to model the tertiary structure of drug targets through multiple templates with incorporation of fold assignment and secondary structure information [4, 5]. After molecular dynamics calculations and model assessment, active site characterization, analogs generation, analog selection and descriptors calculations of analogs has been done through various offline and online softwares and tools. Furthermore to explore the efficacy of dutasteride, a comparative analysis of generated analogs has been performed with dutasteride.

Material & Methods

Dutasteride and its drug target information were obtained from drug bank. Dutasteride drug targets are existing in two isoform i.e. 3-oxo-5-alpha-steroid4-dehydrogenase 1(Drug target-I) and 3-oxo -5-alpha-steroid 4-dehydrogenase 2(Drug target-II) as reported in drug bank database. For these targets 290 and 292 SNPs are reported [6] at location chromosome-5 locus 5p15 and chromosome-2, locus-2p23 respectively.

Modeling of drug targets

The PDB structures of both drug targets are not available. So, modeling of both the target proteins were performed using MODELLER. A template search has been performed through BLAST and PSI-BLAST programs [7]. Global alignment method was used for comparison between the target-template sequences [4]. Gaps with variable gap penalty function are included for structural loops and core regions, in order to get maximum correspondence between the sequences. Alignment file for MODELLER was prepared by CLUSTALW [8]. Fold recognization was done through mGenThreader and LOMETS server for fold assignment [9]. Energy minimization of generated 3D-model was done through GROMACS (OPLS force field) by using Steepest Descent and Conjugate Gradient Algorithms [10]. Parameters like covalent bond distances and angles, stereochemical validation, atom nomenclature were validated using PROCHECK and overall quality factor of non-bonded interactions between different atoms types were measured by ERRAT program [11]. RMSD (root-mean-square deviation) and RMSF (Root Mean Square Fluctuation) was calculated for modeled structures. Functionally important residues (Active-site) were identified through comparative result of POCKETFINDER and SURFACE RACER 4.0.

Analog generation Characterization and Docking

Drug analogs were created through mono- substitution in the hydrophilic region on the target molecule with other functional groups (-I Group to +I Group) through CHEMSKETCH 10.0. Generated analogs were further analyzed by comparative study of molecular descriptors, various energies and QSAR studies [12, 13, 14, 15, and 16]. Drug-binding specificity of target to analogs was also performed with docking through AUTODOCK 4.0 [17, 18]. Mono-substitution studies have more relevant results and valuable in the effective analog development. Initially 169 analogs were generated from the reference compound (Dutasteride, Drug bank accession DB01126). Prediction and identification of potential drugs and non-drugs from generated analogs is most important task [19]. Objectives (Independent variables) as well as subjective (Dependent variables) approaches were used in identification and selections of chemical descriptors [20].

Toxicity Analysis of Selected analogs

Prediction and identification of potential drugs and non-drugs from generated analogs is most important task. Toxicity behavior of selected analogs was studied through PALLAS and different types of toxicity (i.e. Oncogen, Mutagen, Teratogen, Irritation, Immunotox and Neurotox) were reported in selected analogs on the basis of toxic fragments.

Drug likeness (Lipinski Rules)

The Lipinski “Rule of Five” states that compounds are likely to have good absorption and permeation in biological systems. Pharmaceutical industry is following this index for evaluation of drug health. We performed the lipinski evaluation for all the selected analogs.

Results & Discussions

Target protein modeling, model verification, active site characterization, analogs generation, docking and QSAR studies resulted through various offline and online tools are as follows.

Modeling Results

For modelling of both drug targets, template search were performed with the BLAST and PSI-BLAST and did not find single suitable PDB template for target protein sequence. Thermus Aquaticus Core Rna Polymerase-Rifampicin Complex (1I6V) and methanococcus Jannaschii phosphosulfolactate synthase (1QWG) are identified as templates for drug target-I. 1UTH and 2UYE were identified as templates for drug target-II along with other PDBs. Modelling of dutasteride drug targets was a tedious task due to very low sequence similarity and coverage. Three dimensional models of drug targets were generated through identified templates along with fold fitting. Fold recognization was done through mGenThreader and LOMETS server for fold assignment. Helices have dominance over other secondary structure i.e. sheet, coil in both generated model. Both modeled structure of drug target-I (Model-I) and drug target II (Model-II) is shown in figure-1(a&b).

Figure 1

Figure-1(a & b) Three dimensional visualization of the modeled structure of drug targets (Model-I & Model-II) through VMD software.

Energy minimization of generated 3D-model was done through GROMACS (OPLS force field by using steepest descent and conjugate gradient algorithms).The generated 3D model (Figure 1a and 1b) of target proteins was checked by Ramachandran plot (Figure-2a & 2b) through PROCHECK program.

Figure 2

Figure -2(a &b) shows torsion angles of and in the generated models through Ramachandran plot of generated model I & II

For model-I, 91.03% residues was lies in most favoured region whereas 88.09% residues were lies in most favoured region in model-II in Ramachandran plot .Besides very low sequence similarity and sequence coverage to PDB templates, the overall quality factor for drug target model-I (90.04%) and model-II (84.647) were reported through structure validation server ERRAT (Figure-3a & 3b).

Figure 3

Figure-3(a) shows the overall quality factor of generated model-I of drug target of dutasteride through structure validation server ERRAT.

Dark black regions were showing high error due to high structural variability (high transition of E, C and H) in generated models. Like –in model-I two dark places (100-120 and 245-259).

Figure 4

Figure 5

Figure-3(b) shows the overall quality factor of generated model-II of drug target of dutasteride through structure validation server ERRAT.

RMSD and RMSF - Reliable Indicators to Check Variability.

The molecular dynamics simulations have provided significant new information on the nature of proteins. RMSD measures the accuracy whereas dynamic fluctuations (RMSF) of proteins around their average conformations play an important indicator of many biological processes such as enzyme activity, macromolecular recognition, and complex formations [21].

Figure 6

Figure-4(a, b, c) Analyzing structural variability of model-I protein structures using RMSD

The changes in structural conformation were monitored in terms of RMSD and RMSF [5]. Figure -4(a, b, c) shows, for model-I RMSD after 10 ps time when calculated for Backbone-Backbone was 0.19, for Protein-Protein was 0.23. Further when the simulation cycles were increased to 50ps time RMSD for Backbone-Backbone was 0.22 and Protein-Protein was 0.26. Results clearly indicated that energy minimization should be done for few steps only. More energy minimization steps can cause more structural hindrance [22].

Figure 7

Figure-5(a, b and c) Analyzing structural variability of model-II protein structures using RMSD.

Figure -5(a, b, c) shows, model-II RMSD after 10 ps time (Backbone-Backbone (0.225), Protein-Protein (0.265) whereas RMSD after 50ps (Backbone-Backbone (0.3), Protein-Protein (0.265).

Figure 8

Figure-6 Residues wise (C) RMSF of model-I and model-II

Model-I has sequence length of 259 residues whereas model-II contains 254 residues. During the course of MD trajectory, coils are more prone to fluctuations than the alpha helices (figure-6). After modeling of target proteins, characterization of structure is another very important task for rational drug designing. Prediction of active site of target protein was done by comparative result of POCKET FINDER and SURFACE RACER-4.0 which is shown in table-1.

Figure 9

Table-1 Comparative results of softwares and analysis for Active-site prediction

Analog generation and optimization results

Drug analogs were created through mono- substitution in the hydrophilic region on the drug molecule with other functional groups (-I Group to +I Group) through CHEMSKETCH 10.0 and compatibility of analogs was checked through AUTODOCK 4.0 against modeled structures of target proteins(Target-I + Target-II) with binding energies.

Figure 10

Table-2 shows binding energy list of selected analogs and their best binding conformations

Figure 11

Table-3 Comparative results of the molecular descriptors of selected analogs.

Comparative results of the molecular descriptors of selected analogs are given table-3. It was observed that changes in molecular descriptors of selected analogs are under considerable range.

Toxicity prediction is most important criteria for new analog selection. Toxicity of a molecule is highly dependent upon its structural elements. Certain molecule segments are characteristic of hazardeous compounds and therefore called toxic segments. Toxicity prediction is therefore based on Segment body of chemical, Positive condition, Negative condition that can be used to predict certain health hazards of organic chemicals combining toxicological knowledge and judgment with the use of leading-edge QSAR artificial intelligence techniques. Chemical segment body of analog generated from dutasteride is alpha-beta unsaturated (thio) amide, Aliphatic Polyholide-1-2, Benzyl halide/sulphate, T_Har_Hat_hat08. Segment body of analogs shown in Table [4].

Figure 12

Table-4 Showing toxicity of analogs with reference compound by using PALLAS-3.112.

Figure 13

Graph-1 showing biological toxicity effects (%) of analogs with reference compound

These analogs are showing different type of biological toxicity effects (Oncogenicity, Mutagenicity, Teratogenicity, Membrane Irritation, Sensitization, Immunotoxicity, and Neurotoxicity). Level of toxic effects is high in three analogs (-CCL3, -CH3, -CCL2OH) which is shown in Graph-1

The Lipinski “Rule of Five” states that compounds are likely to have good absorption and permeation in biological systems. Pharmaceutical industry is following this index for evaluation of drug health.

Figure 14

Table-5 Showing Lipinski Status of designed analogs with the reference compound.

Conclusion

Results clearly indicate that before synthesis and biochemical testing of new analogs, one can use molecular mechanics based methods for qualitative assessment of relative binding affinities, toxicity and Lipinski evaluation to speed up drug discovery process by eliminating less potent compounds from synthesis. In present work efforts have been made to identify new candidate compounds for existing drug through modeled structure of drug targets. Both drug target of dutasteride were generated using MODELLER (Figure-1a&b) and validated by PROCHECK and ERRAT programs. For model-I, 91.03% residues were lying in most favoured region whereas 88.09% residues were lying in most favoured region in model-II in Ramachandran plot (Figure-2 a & b) which were good indicator of fitness of stereochemical quality of protein structures. Besides very low sequence similarity and sequence coverage to PDB templates, the overall quality factor for model-I (90.04%) and model-II (84.647) were reported through structure validation server ERRAT (Figure-3 a&b). Energy minimization and molecular dynamics calculations were done through GROMACS using OPLS force field. RMSD calculated for both simulated model has become almost constant within 10,000 cycles. A comparative analysis of RMSD and RMSF of model-I &II has been performed at 10ps and 50ps (Figure-4 &5). Results clearly indicated that energy minimization should be done for few steps only. More energy minimization steps can causes, more structural hindrance. Similar conclusion can be drawn for RMSF and it was observed that during the course of MD trajectory, coils are more prone to fluctuations than the alpha helices (figure-6). A comparative study of tools has been performed for active site of both models. ARG (187) and THR (225) were identified as active site residues for model-I whereas ALA (69), LEU (86) were identified for model-II. Binding affinities compatibility of analogs was checked through AUTODOCK 4.0 with model-I&II (Table-2). Total 169 analogs were generated and eight are selected for comparative study. The comparative results of the molecular descriptors of selected analogs are given in table-3. Toxicity of analogs with reference compound (Dutasteride) was checked by PALLAS-3.112 (Table-4 and Graph-1). Lipinski Status of designed analogs was checked (table-5). On the basis of binding energy, toxicity and Lipinski evaluation, three dutasteride drug analogs (-CH₂NH₂, -CH₂OH, -CF₂OH) are showing theoretically more superior results.

Future Directions

The present model can be further extended with some modifications, if necessary for analysis of other drug compounds which have more side effects. All these analysis and predictions are made on the basis of bioinformatics tools & techniques by statistical analysis. Theoretical calculation may varie from software to software but the protocol which is developed is up to mark for the new drug discovery. This knowledge will contribute positively to develop new drug analogs with more efficacies as well as less side effects.

Acknowledgement

We are grateful to Department of Bioinformatics, MANIT, Bhopal, India for support and cooperation.

References

r-0. [1] A. H. Salter. Large-scale databases in toxicogenomics. Pharmacogenomics, 6(7):749-754, 2005.
r-1. [2]A. M. Richard. DSSTox web site launch: Improving public access to databases for building structure-toxicity prediction models. Preclinica., 2(2):103-108, 2004.
r-2. [3] N. E. Levy, N. M. Bar, M. Frydman, E. Pras. Confirmation of the association between male pattern baldness and the androgen receptor gene. Eur. J. Dermatol., 15(5):339-40, 2005.
r-3. [4] A. Sali, T.L. Blundell. Comparative protein modelling by satisfaction of spatial restraints. In: Protein Structure by Distance Analysis. Ed: H. Bohr and S. Brunak. IOS Press, Amsterdam, 64-86., 1994.
r-4. [5] R. Ettrich, M. Melichercik, J. Teisinger, O. Ettrichova, R. Krumscheid, K. Hofbauerova, P. Kvasnicka, W. Schoner, E. Amler. Three-dimensional structure of the large cytoplasmic H-4-H-5loop of Na+/K+-ATPase deduced by restraint-based comparative modeling shows only one ATP binding site. J. Mol. Model., 7: 184-192, 2001.
r-5. [6] J. A. Ellis, M. Stebbing, S. B. Harrap. Polymorphism of the androgen receptor gene is associated with male pattern baldness. J Invest Dermatol. Mar; 116 (3):452-5, 2001.
r-6. [7] S.F. Altschul, L.M. Thomas, A.S. Alejandro, Z. Jinghui, Z. Zheng, M. Webb, J.L. David. Gapped BLAST and PSI BLAST: a new generation of protein database search programs. Nucleic Acids Res., 25 (17):3389–3402, 1997.
r-7. [8] K. B. Li. ClustalW-MPI: ClustalW analysis using distributed and parallel computing, Bioinformatics. 19:1585–1586, 2003.
r-8. [9] J. Bowie, R. Luthy, D. Eisenberg. A method to identify protein sequences that fold into a known three dimensional structure. Science., 253: 164-170, 1991.
r-9. [10] W.L. Jorgensen, R.J. Tirado. The OPLS Force Field for Proteins. Energy Minimizations for Crystals of Cyclic Peptides and Crambin. J. Am. Chem. Soc., 110:1657-1666, 1988.
r-10. [11] R. A. Laskowski, J. A. C. Rullmann, M.W. MacArthur, R. Kaptein , M. T. Janet. AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMR. Journal of Biomolecular NMR. 8, 477-486, 1996.
r-11. [12] J. Liu, J. Feng, S. Young. 2005. PowerMV: A software environment for statistical analysis, molecular viewing, descriptor generation and similarity search: National Institute of Statistical Sciences, Research Triangle Park, NC, USA, 2005. http://www.niss.org/PowerMV/.
r-12. [13] A. F. Fliri, W. T Loging, P. F. Thadelo, R. A. Volkmann. Biological spectra analysis: Linking biological activity profiles to molecular structure. PNAS, 102(2):261-266, 2005.
r-13. [14] S. Dimitrov, G. Dimitrova, T. Pavlov, N. Dimitrova, G. Patlewicz, J. Niemela, O. A. Mekenyan. Stepwise approach for defining the applicability domain of SAR and QSAR models. J. Chem. Inf. Model.,45: 839–849, 2005.
r-14. [15] N. Nikolova, J. Jaworska. Approaches to measure chemical similarity – a review. QSAR Comb. Sci., 22:1006–1026, 2004.
r-15. [16] P. Willett. Chemical Similarity Searching. J. Chem. Inf. Comput. Sci., 38:983-996, 1998.
r-16. [17] J. Ghuman, P. A. Zunszain, I. Petitpas, A. A. Bhat-tacharya, M. Otagiri, S. Curry. Structural basis of the drug-binding specicity of human serum albumin. J. Mol. Biol., 353(1):38-52, 2005.
r-17. [18] N. A. Kratochwil, W. Huber, F. Muller, M. Kansy, P. R. Gerber. Predicting plasma protein binding of drugs: a new approach. Biochem Pharmacol., 64(9):1355-1374, 2002.
r-18. [19] M. Wagener, V. Geerestein. Potential drugs and non-drugs: prediction and identification of important structural features. Journal of Chem. Inf. Comput. Sci., 40:280-292, 2000.
r-19. [20]Y. Liu. A comparative study on feature selection methods for drug discovery. Journal of Chem. Inf. Comput. Sci., 44:1823-1828, 2004.
r-20. [21] D. K. Bhattacharya, E. Clementi, W. Xue. Stochastic Dynamic Simulation of a Protein. Int. J. Quantum Chem., 42:1397-1408, 1992.
r-21. [22] Philip E. B. and Helge W. : Structural Boinformatics, Ist editions, pp-509-523,2003.

ISPUB.com

Internet
Scientific
Publications

Development of Efficient Drug Analogs for Dutasteride through Insilico Modeling

Keywords

Citation

Abstract

Introduction

Material & Methods

Modeling of drug targets

Analog generation Characterization and Docking

Toxicity Analysis of Selected analogs

Drug likeness (Lipinski Rules)

Results & Discussions

Modeling Results

Figure 1

Figure 2

Figure 3

Figure 4

Figure 5

RMSD and RMSF - Reliable Indicators to Check Variability.

Figure 6

Figure 7

Figure 8

Figure 9

Analog generation and optimization results

Figure 10

Figure 11

Figure 12

Figure 13

Figure 14

Conclusion

Future Directions

Acknowledgement

References

Author Information