2008 Virtual Screening For SHP-2 Specific Inhibitors Using Grid Computing

Virtual Screening for SHP-2 Specific Inhibitors Using Grid Computing
Simon X. Han2, Marshall J. Levesque2, Kohei Ichikawa3, Susumu Date1, Jason H. Haga2 1 Cybermedia Center, Osaka University, Osaka, Japan 2 Department of Bioengineering, University of California, San Diego, La Jolla, CA 3 Research Institute of Socionetwork Strategies, Kansai University, Osaka, Japan xhan@ucsd.edu, mlevesqu@ucsd.edu, ichikawa@ycss.kansai-u.ac.jp, date@ais.cmc.osakau.ac.jp, jhaga@bioeng.ucsd.edu
growth and death, it has implications in the progression of different diseases such as Alzheimers disease, diabetes, and cancer [6]; however, the complexity of SHP-2 function makes it difficult to elucidate the signaling pathways that are regulated by SHP-2. The objective of this study was to identify several potential inhibitors of SHP-2 function by performing grid-enabled virtual screening experiments with the crystal structure of SHP-2. Although the methods employed in this study were similar to that reported in a companion paper [20], the details of the results obtained and their biological significance are different. Complications of performing multiple, routine virtual screenings on the grid are also described and their potential solutions are discussed accordingly. The results of this study will provide important pharmacologic tools that will help to better understand SHP-2 function and provide promising leads to clinical treatments for various diseases.
Abstract
SHP-2 is a protein tyrosine phosphatase (PTP) that plays an important role in many cellular functions such as development, growth, and death; thus SHP-2 has been hypothesized to play an important role in various diseases such as diabetes, neurodegeneration, and cancer. The importance of the individual roles of different PTPs is not well understood and this is complicated by the lack of specific inhibitors. In this study, we have utilized the multi-institutional PRAGMA Grid computation resources to virtually screen the ZINC 7 database using virtual docking software DOCK 6.2. Preliminary results suggest several SHP-2 specific inhibitors that can be further tested and validated under laboratory conditions. Complications during these multiple, virtual screenings on the grid as well as potential improvements are also discussed. These findings have future clinical significance in the creation of new drug therapies for the treatment of different diseases.
2. Methods
This study employed virtual screening experiments to identify potential inhibitor compounds for a specific enzymatic target using molecular docking software. This method has proven to work successfully in drug discovery [12]. DOCK 6.2 was the software used to go through a database of small compound structures and simulate the molecular interactions with the target protein structure [15]. A number of different scoring algorithms included in DOCK, such as the grid energy and AMBER scoring methods, were used since it has been shown that the most successful docking results are those that consult different scoring algorithms. The docking algorithms orient compound structures in the binding pocket of the protein molecule and energy scores are calculated and assigned to the paired complex. These scores are used to rank the database of compounds, creating a list of potential inhibitor compounds ordered best to worst. An idealized experiment would screen an extensive chemical library of compounds with the most accurate docking and scoring methods available. However, more accurate scoring and larger databases
1. Introduction
SHP-2 is a ubiquitously expressed cytoplasmic PTP that contains two Src-homology-2 (SH2) domains, a catalytic PTP domain, and a C-terminal domain [6, 10, 11, 13]. Under non-stimulated conditions, the protein is in an inactive state, where the N-terminal SH2 domain blocks the catalytic site from being accessible [11]. When SHP-2 becomes active, the protein structure changes, exposing the catalytic site and allows it to dephosphorylate other substrates [11]. Dephosphorylation of specific proteins can modulate cellular functions. SHP-2 has been found to be primarily a positive regulator in many different cellular functions including growth, death, and development [6, 8]. Some examples include enhancing the process of programmed cell death by dephosphorylating the STAT5 protein and promoting neural cell growth [6]. There is also evidence that SHP-2 plays a negative role in cellular functions [6]. Because of the important role of SHP-2 in cellular
require increasing amounts of computational resources. Deploying DOCK over grid resources makes this type of experiment a viable strategy for laboratories of any size.
2.1. Protein Crystal Structure

The three-dimensional crystal structure of the bound form of the SHP-2 protein is available on Protein Data Bank (PDB ID: 3b7o) [4]. The crystal structure provides the chemical and structural representation of the protein binding site, which is used in the molecular docking software. The availability of the bound form of SHP-2 is especially important because the compounds are simulated to interact with the protein as it performs its enzymatic activity.
grid-enabled DOCK services tied together and automated with Perl scripts, an entire virtual screening experiment can be distributed from a central (master) cluster to be executed across the remote clusters making up the PRAGMA Grid [3,18]. The simple and standardized software tools offer a highly flexible and customizable docking platform where the tremendous power and cost-efficiency of the grid can be utilized. However, the sheer number of compounds in the ZINC database and advanced docking methods can still take considerable resources. With this in mind, the screening was split into two phases to screen the database exhaustively and efficiently. Table 1. Resources used Cluster Processors Location Rocks-52 28 SDSC, US Tea01 80 Osaka U, JP Cafe01 64 Osaka U, JP Ocikbpra 32 U of Zurich, CH Lzu 22 LanZhou U, CN
2.2. ZINC Database

The chemical compound databases used in this study was the ZINC 7 database [1]. Several subdatabases of ZINC 7 screened were the drug-like (2,066,906 compounds) and lead-like (972,608 compounds) subsets. The databases are typically distributed to individual instances of DOCK in smaller compound lists called slices. These ready-made, dockable ligand files were provided and maintained by the University of California, San Francisco. The advantages of using ZINC were that the database was free and the compounds are commercially available for purchasing and testing in wet-bench experiments. The large number of compounds present in the database increases the probability of finding an inhibitor.
2.4. Input File Preparation

The SHP-2 protein molecule was prepared using Chimera [7] by adding hydrogens, calculating and adding charge, and removing solvents. The bound ligand that came with the crystal structure of SHP-2 was prepared by isolating it, adding hydrogens and charge, and saving it as a separate file [5]. This was required for all DOCK input ligands [3]. However, this step was omitted for ZINC compounds as they come in a DOCK-ready format. Using accessory programs included in the DOCK software package, a surface of the protein is generated and is used to geometrically describe the SHP-2 binding site within 8.0 (angstrom) of the prepared ligand file. This generated 44 spheres in the binding site that characterized the site and were used to orient ligands [14]. For the first phase of the screening, the grid.nrg and grid.bmp files were generated with a grid_spacing value of 0.5 while keeping other parameters at their default values. For the second phase screening, the two grid files were re-generated using a grid_spacing value of 0.3 . The smaller grid_spacing value used in the second phase resulted in more grid points and consequently more calculations in the docking algorithm, leading to a more resource intensive screen. The AMBER scoring function was also used in the second phase of the screening and it requires many input files [14]. These files are generated prior to the execution of DOCK using automated scripts in the DOCK suite of programs.
2.3. Grid Computing

The PRAGMA (Pacific Rim Application and Grid Middleware Assembly) Grid is a group of collaborative institutions spanning 15 countries and regions committing over 900 processors to research [2]. Although DOCK has built-in MPI for parallelization within a single computing cluster, it lacks inter-cluster communication capabilities. The detailed process of the whole virtual screening on PRAGMA grid resources, including retrieval and storage of the input files, docking, and the subsequent gathering, summarizing, and analysis of the results has been described previously [18]. Briefly, a previously developed method of wrapping DOCK with the grid middleware application, Opal Op, to create accessible grid services solves many of the security, communication, and resource heterogeneity problems associated with grid computing. With the
2.6. Flexible Energy Score

The energy scoring function in DOCK maps the energetic state of the binding site with a specified sampling spacing distance [17]. We chose the flexibleligand option for increased accuracy, as shown in other studies [9]. DOCK uses the anchor-and-grow algorithm where one part of the ligand is first identified as the anchor, and the remaining parts progressively added (grow) for the best fit [16]. Important DOCK parameters include pruning_ max_orients, pruning_clustering_cutoff, simplex_ anchor_max_iterations, and simplex_grow_max_ iterations. The parameter pruning_max_orients indicates the number of anchor orientations to be used for the grow phase. pruning_clustering_cutoff is used to filter orientations. The lower the value, the more likely an orientation is removed, and consequently fewer growth opportunities. The parameters simplex_anchor_max_iterations and simplex_grow_ max_iterations limit the iterations of minimization the scoring algorithm uses when optimizing energy scores for each anchor and growth step [14]. The different sub-databases were screened in a two phase screening as described in a previous gridenabled DOCK virtual screening study [3]. Briefly, the first phase is designed to quickly screen the entire database to eliminate obviously unfit compounds. The compounds were screened using less stringent parameters, listed in Table 2, which resulted in docking times of 0.5-0.7 minutes per compound per processor (mpcpp). This unit translates to the average time required for DOCK to find and calculate the best scoring pose for a single compound bound to the proteins binding site. Table 2. First phase screen parameters pruning_max_orients 100 pruning_clustering_cutoff 12 simplex_anchor_max_iterations 10 simplex_grow_max_iterations 25 After the first phase, the results were collected, ranked, and used to make a new database to be used in the second phase. This phase used more stringent parameters, listed in Table 3, and produced docking times averaging 3.95 mpcpp in the drug-like rescreen. Table 3. Second phase energy parameters pruning_max_orients 250 pruning_clustering_cutoff 30 simplex_anchor_max_iterations 100 simplex_grow_max_iterations 100
The parameters for each screen were also adjusted with respect to the availability of grid resources and experiment deadlines.
2.7. AMBER Score

AMBER scoring is based on the receptor and ligand interactions using solvent energy calculations [14], and allows both receptor and ligand to be flexible. AMBER screening also requires many input files that take a considerable time to generate. The results from the rescreen were collected, ranked, and formed a new database to be used in the AMBER screen. Important parameters were amber_score_before_md_minimization_cycles (before md), amber_score_md_steps (md steps), and amber_score_after_md_minimization_cycles (after md). Before md and after md describe score optimization cycles while md steps is the number of steps in the molecular dynamics simulation [14]. Table 4. Second phase AMBER parameters before md 50 md steps 750 after md 25 Tests using the parameters listed in Table 4 produced docking times of 7.65 mpcpp.
3. Results
In this experiment, the catalytic site of protein tyrosine phosphatase SHP-2 was successfully screened against the drug-like and lead-like databases and used up to 137 processors from 5 clusters.
3.1. Inhibitors for SHP-2

The resulting energy scores from the second phase energy and AMBER scoring methods were combined, and a final ranked list of compounds, from best binding to worst binding, was generated. Compounds that failed to be ranked by AMBER are labeled with n/a. Visual inspection was required to confirm proper binding. Some ranked compounds had questionable scores, such as the -5x105 score of ZINC3097907. Of the 2,066,906 compounds in the drug-like database, 1,391,066 were ranked in the first phase. In order to complete the rescreening in a reasonable amount of time, 2.5% of the ranked compounds (34,776) were gathered into new database slices. 34,008 compounds were subsequently ranked in the rescreen and all were used in the AMBER screen where 33,942 were ranked. The 20 best binding
compounds from the presented in Table 5.
drug-like
database
are
Table 5. Top 20 drug-like compounds

RANKINGS Rank 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 ZINC ID 1717339 2260256 3097907 4892457 4025466 1532056 3105888 1733678 1130175 2042297 3876071 3875553 0040093 4872616 6736725 4872627 2843737 2637980 3871718 1760558 Total 14 20 30 163 465 499 571 624 636 686 764 767 791 805 820 831 859 919 929 959 Dock 3 11 28 163 424 499 507 326 629 451 265 613 409 689 665 769 275 564 170 656 Amber 11 9 2 n/a 41 n/a 64 298 7 235 499 154 382 116 155 62 584 355 759 303 SCORES Dock -114 -103 -90.1 -70.7 -66.8 -66.2 -66.1 -67.9 -65.1 -66.5 -68.8 -65.2 -66.9 -64.6 -64.7 -64.2 -68.7 -65.6 -70.6 -64.8 Amber -902 -1119 -5x10 n/a -97.8 n/a -82.7 -65.2 -1486 -67.6 -60.8 -71.9 -63.2 -75.4 -71.9 -83.6 -59.4 -63.7 -57.0 -65.1
5
Figure 2. Visualization of the fifth ranked compound (ZINC 4025466) from the druglike screening. high score for this interaction. The next compound that reasonably interacted with the SHP-2 catalytic pocket is shown in Figure 2. This compound was ranked fifth and fit well in the binding pocket of SHP-2. Intensive interaction was demonstrated by the numerous hydrogen bonds (green lines) connecting oxygen atoms (red) of the binding compound to amino acid residues (orange sticks) within the catalytic site of SHP-2. Similar to the drug-like database screening, 22,938 of the 972,608 lead-like compounds were ranked and the best 20 binding compounds are presented in Table 6. It is interesting to note that two of these compounds (ZINC3097907 and ZINC1532056) appear in both of these lists with the same energy scores, but have different rankings. This is due to the fact that these compounds have both drug-like and lead-like properties. Table 6. Top 20 lead-like compounds
RANKINGS Rank 1 2 3 4 5 6 7 8 9 10 11 12 ZINC ID 5518020 3097907 0405809 1532056 5478334 5413470 5413467 3953252 3115745 2116249 8030102 2431301 Total 20 25 30 169 249 418 450 496 587 702 719 799 Dock 16 23 27 169 238 401 437 495 400 153 667 86 Amber 4 2 3 n/a 11 17 13 1 187 549 52 713 SCORES Dock -95.9 -90.1 -82.8 -66.2 -64.8 -62.8 -62.5 -62.1 -62.8 -66.4 -60.9 -68.7 Amber -4x10 -5x10 -2x10 n/a -868 -214 -231 -2x10
6 4 5 5
Figure 1 is an image of the first ranked compound from the drug-like screening. It appears to interact very well with the catalytic pocket (denoted by the purple box) of SHP-2, however, careful inspection revealed a subtle irregularity where one atom intersected with the SHP-2 surface, as indicated in the red circle. This may have contributed to the relatively
Figure 1. Visualization of the first ranked compound (ZINC 1717339) from the druglike screening.
-63.4 -54.8 -73.3 -52.2
13 14 15 16 17 18 19 20
5290065 0157925 0132550 2139830 6941347 5134922 0074861 3293108
854 884 932 953 959 1032 1104 1115
90 884 122 182 278 811 1004 878
764 n/a 810 771 681 221 100 237
-68.6 -59.6 -67.4 -65.7 -64.3 -60 -59 -59.7
-51.4 n/a -50.7 -51.3 -52.6 -62.6 -68.3 -61.9
Visual inspection of the top five ranked compounds again showed erroneous or no interaction with SHP-2 catalytic pocket. When ZINC3097907 was visualized as shown in Figure 3A and 3B, it was apparent that the compound was partially embedded in the protein, again causing the extraordinarily high AMBER score (-5x105). The sixth compound had reasonable binding to SHP-2 as illustrated in Figure 4. Again, good interaction was evident with the presence of numerous
Figure 4. Visualization of the sixth ranked compound (ZINC 5413470) from the leadlike screening hydrogen bonds (green lines) connecting oxygen atoms (red) of the compound to various amino acids in the catalytic site (orange sticks). This provides a good alternative compound with a different structure and chemical properties that may inhibit SHP-2 effectively. The diverse interaction locations of ZINC5413470 suggest it may have a different level of specificity with SHP-2, compared to the compounds identified in the drug-like screen. Our results showed that of the top 20 compounds from each database, sulfonic acid (Fig 5A) motifs stand out in the rankings. There are 9 sulfonic acids compared to 5 carboxylic acids (the next frequent) in Fig 5C. It is interesting to note that phosphinic acids, shown in Fig 5F, are ranked the highest but have a lower frequency (4 excluding duplicates). Other compounds include 4 propanoic acids (Fig 5D), and 4 phosphonic acids (Fig 5E). The highest ranked visually confirmed compounds in the drug-like and lead-like databases are phosphonic acids, suggesting that phosphinic and sulfonic acids are more prone to generate false positives if docked with phosphatases.
B Figure 3a and 3b. Visualization of the second ranked compound (ZINC 3097907) from the lead-like screening A B
F Figure 5. Chemical motifs of the top ranked drug-like and/or lead-like compounds
3.2. Grid performance

Tables 7 and 8 present a summary of the computational resources used during the screening. Tea01 had the slowest processing time, yet screened the most compounds. This can be attributed to the numerous processors on Tea01 as well as availability of resources. The unit mpcpp offered a useful way of looking at cluster performance as it disregarded grid related variables, and focused on computational times, which were directly related to input parameters and processing power. Table 7. Cluster performance during second phase energy score screen
Cluster Rocks-52 Tea01 Cafe01 Ocikbpra Lzu CPUs 6-16 28-48 9-26 6-26 14-21 Compounds Screened 7642 37914 8600 3885 5678 mpcpp 3.33 4.55 2.48 2.01 2.01
increase in AMBER parameters was not implemented. It was also found that emphasizing the initial score minimization (before md) resulted in better score optimization. DOCK issues: Segmentation faults of DOCK were observed in this experiment, however, contrary to the previous study by Levesque, et al. [3], compounds causing segmentation faults in this experiment did not share a common trait and only occurred during AMBER screen. This error was found to be independent of grid resources because the same fault occurred when the job was re-screened on different clusters. Removing the problematic compounds eliminated the fault and the other compounds completed without errors. Currently, the input file preparation cannot start again right after removing the faulty ligand and must be restarted manually. These compounds (30/3,039,514) represent less than 0.001% of the three databases screened and should therefore not be thought of as a deterrent to using virtual screening to identify potential inhibitors. Disk storage issues: The screening of SHP-2 against two databases totaling 3,039,514 compounds produced a great deal of data and results. The total amount of disk space used is summarized in Table 10. AMBER screening requires many input files and these files can take as long to generate as it did to compute the AMBER energy score. Table 10. Summary of disk space used
Cluster Rocks-52 Tea01 Cafe-01 Ocikbpra Lzu Total Space Used 38GB 94GB 111GB 30GB+ (compressed to 11GB) 52GB 325GB+
Table 8. Cluster performance during second phase AMBER screen

Cluster Rocks52 Tea01 Cafe01 Ocikbpra Lzu CPUs 6-16 28-48 9-26 6-26 14-21 Compounds Screened 9490 20254 4959 9871 16795 mpcpp 7.58 12.68 6.55 6.55 6.10
Energy and AMBER scores calculated using the rescreen parameters in Tables 3 and 4 produced satisfactory results without consuming large amounts of grid resources. Test sets have shown that energy scores produced by the second phase energy parameters were very similar to more stringent parameters (four-fold increase in first screen parameters) while completed docking in 2/5 the time. Comparison of scores produced by AMBER parameters to scores produced by a four-fold increase in the same parameters showed that not only did Table 4s parameters produce much more minimized scores, but also only took the time. Thus, the four-fold
In a slice of 573 compounds, the input files amounted to 1.5 GB. For three AMBER screens totaling 56,980 compounds, at least 150GB of data can be expected. This can interfere with the logistics of a run, especially on older clusters where disk limitations may prevent data gathering. Additionally, since all users share the same disk allocation on each cluster, a single user with unrestricted disk usage can inconvenience other users. Thus, in some cases, the data was compressed using standard zip commands. For instance, the data for a slice of 243 compounds required 620 MB of uncompressed space, but after compression the data only required 115 MB. Although the compression reduced the disk space usage, it added a layer of complexity to the collecting, ranking and overall organization of the data.
4. Conclusions
Virtual docking on the grid is an effective and efficient method to screen compound databases for biomedical purposes such as drug discovery. Our experiment has produced a list of potential SHP-2 specific inhibitors that are in the process of being validated in wet-bench experiments and will be used to study SHP-2 signaling pathways in cells. These compound have potential clinical applications. However, further testing must be performed to verify that these compounds are membrane permeable i.e. be able to enter the cell and are effective inhibitors of SHP-2 inside the cell. Although DOCK is an established program capable of delivering results, Figures 2 and 4 show that DOCK is not foolproof. It is ultimately a tool aimed in aiding scientific discoveries and requires further experimental verification. In the AMBER run, a small portion of the input ligands failed to prepare and caused a segmentation fault. The preparation generally fails because of missing force field parameters, charge issues, bond issues, or atom issues. In the case of the missing parameters, they are recoverable with the auxiliary program Antechamber. In the remaining three cases, the problem may lie in an improper structure of the ligand file. Because of the very small number of problem molecules, it was not considered a significant issue during the screening process. Moreover, a complete diagnosis of the problem would have required substantial knowledge in the workings of AMBER auxiliary programs [19] and is beyond the scope of this project. From Tables 7 and 8 we can see that all clusters contributed greatly to the virtual screening experiment and the absence of even of one of these clusters can significantly increase the virtual screening time. This highlights the importance of the collaborative nature of grid computing. Consideration of other grid users becomes an issue during routine virtual screening experiments, especially with regard to disk space requirements. During the virtual screening experiment, the data cannot exist in compressed form since the AMBER input files must be read by DOCK. Additionally, the current scripts designed for the collection and ranking of the screening results are not compatible with compressed data, hence requiring the data to be uncompressed manually in order to locate the files of interest. During the course of these virtual screens, an updated version of the ZINC drug-like database was released with over 5 million compounds [1]. The input files alone would require an estimated 13 TB of total disk space. With the increasing size of chemical compound databases, other solutions should
be considered such as the addition of more disk space to the grid environment. If multiple users are performing virtual screenings using the same chemical database, placement of one set of DOCK input files on a cluster system that could be accessed by each user is another option, but security and access privileges would have to be addressed in this scenario. Deployment of DOCK on a grid environment remains feasible, but still offers some challenges. The drug-like AMBER screen, for example, was expected to take 4 days to complete (2 days of docking and 2 days of input file generation). However, the screening required 11 days, with much of the time lost to restarting and continuing the screening due to cluster specific, grid resource errors. A Java error on the rocks-52 cluster related to zombie processes resulted in the discontinuation of the managing Perl script on the master cluster and required manually restarting the screening. This is discussed in a companion paper [20]. There were also user uncontrollable grid resource related situations. Cluster maintenance for example, did disconnect a cluster, affecting the jobs running there. An intriguing aspect of grid resource availability was the high number of users. It was observed that some DOCK jobs were in queue for many days before finally running or being cancelled. On these busy clusters, such as Rocks-52 and Cafe01, the minimum number of processors to be used in the virtual screen was lowered to take advantage of any processors that may become available. Since DOCK is an inherently processor-heavy application, busy clusters may not have enough free processors to allow screenings, as freed processors can get immediately taken by the more versatile one processor jobs. This suggests the need of more advanced schedulers where the relative queuing times of different jobs are taken into consideration [21]. It can be expected that improvements will continue be built on the current platform to further advance the role of the Grid and computer technology in the field of biomedical sciences.
5. Acknowledgments
The authors would like to acknowledge support from the UCSD Pacific Rim Experiences for Undergraduates program (PRIME NSF INT 0407508 and NSF OISE 0710726), the California Institute for Telecommunication and Information Technology (Calit2), and Osaka University's Fostering of Globallyleading Researchers in Integrated Sciences program funded by MEXT. We appreciate PRAGMA for the use of the Grid testbed and technical support. Molecular graphics images were produced using
the UCSF Chimera package from the Resource for Biocomputing, Visualization, and Informatics at the UC San Francisco (supported by NIH P41 RR-01081).
Dual-Specificity Enzyme SSH-2 via Docking Experiments on the Grid, 4th IEEE International Conference on eScience, 2008, In press. [21] Personal communication, Blair Bethwaite, 2008.
6. References
[1] J.J. Irwin and B.K. Shoichet, ZINC - A Free Database for Virtual Screening, 2006, http://zinc.docking.org/. [2] PRAMGA Grid Resources http://pragma-goc.rocks clusters.org/pragma-doc/resources.html. [3] M.J. Levesque, K. Ichikawa, S. Date, J.H. Haga, Bringing Flexibility to Virtual Screening for Enzymatic Inhibitors on the Grid, Grid 2008, In press. [4] Protein Data Bank http://www.rcsb.org/pdb /explore.do?structureId=3B7O. [5] P.T. Lang and S. Brozell, Preparing Molecules for DOCKing, 2007, http://dock.compbio.ucsf.edu/DOCK_6/ tutorials/struct_prep/prepping_molecules.htm. [6] Z.Z. Chong and K. Maiese, The Src homology 2 domain tyrosine phosphatase SHP-1 and SHP-2: diversified control of cell growth, inflammation, and injury, Histo Histopathol., Cellular and Molecular Biology, pp. 1-3. [7] UCSF Chimera http://www.cgl.ucsf.edu/chimera/. [8] D. Barford and B.G. Neel, Revealing mechanisms for SH2 domain mediated regulation of the protein tyrosine phosphatase SHP-2, Structure, Current Biology Ltd, 1998, pp. 1. [9] S. Makino and I. D. Kuntz, Automated flexible ligand docking method and its application for database search, J. Comp. Chem., 1997, pp. 1812-1825. [10] H. Wheadon, N.R.D. Paling, and M.J. Welham, Molecular interactions of SHP1 and SHP2 in IL-3signalling, Cell. Signalling, Elsevier, 2002, pp. 1. [11] N.K. Tonks, Protein tyrosine phosphatase: from genes, to function, to disease, Mol. Cell Biol., Nature Publishing Group, 2006, pp. 1-11. [12] W.L. Jorgensen et al., The Many Roles of Computation in Drug Discovery, Science, AAAS, Washington DC, 2004, pp. 3. [13] M. Stein-Gerlach, C. Wallasch, and A. Ullrich, SHP-2, SH2-containing protein tyrosine phosphatase-2, The International Journal of Biochemistry & Cell Biology, Pergamon, 1998, pp. 1. [14] P.T. Lang, D. Moustakas et al., DOCK 6.1 Users Manual, 2007, http://dock.compbio.ucsf.edu/DOCK_6/ dock6_manual.htm. [15] UCSF DOCK http://dock.compbio.ucsd.edu/. [16] P.T. Lang, D. Moustakas et al., DOCK 6.1 Users Manual, 2007, http://dock.compbio.ucsf.edu/DOCK_6/ tutorials/ligand_sampling_dock/ligand_sampling_dock.html. [17] P.T. Lang, D. Moustakas et al., DOCK 6.1 Users Manual, 2007, http://dock.compbio.ucsf.edu/DOCK_6/ tutorials/grid_generation/generating_grid.html. [18] M.J. Levesque, K. Ichikawa, S. Date, and J.H. Haga, Design of a Grid Service-based Platform for In Silico Protein-Ligand Screenings, Comp. Meth. Prog. Biomed., 2008, doi:10.1016/j.cmpb.2008.07.005. [19] Personal communication, Scott Bozell, Scripps Research Institute, 2008. [20] P.D. Pham, M.J. Levesque, K. Ichikawa, S Date, and J.H. Haga, Identification of a Specific Inhibitor for the

2008 Virtual Screening For SHP-2 Specific Inhibitors Using Grid Computing

Enviado por

Dados do documento

Descrição original:

Título original

Direitos autorais

Formatos disponíveis

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Direitos autorais:

Formatos disponíveis

2008 Virtual Screening For SHP-2 Specific Inhibitors Using Grid Computing

Enviado por

Direitos autorais:

Formatos disponíveis

Virtual Screening for SHP-2 Specific Inhibitors Using Grid Computing

2.1. Protein Crystal Structure

2.2. ZINC Database

2.4. Input File Preparation

2.3. Grid Computing

2.6. Flexible Energy Score

2.7. AMBER Score

3.1. Inhibitors for SHP-2

compounds from the presented in Table 5.

Table 5. Top 20 drug-like compounds

-63.4 -54.8 -73.3 -52.2

5290065 0157925 0132550 2139830 6941347 5134922 0074861 3293108

854 884 932 953 959 1032 1104 1115

90 884 122 182 278 811 1004 878

764 n/a 810 771 681 221 100 237

-68.6 -59.6 -67.4 -65.7 -64.3 -60 -59 -59.7

-51.4 n/a -50.7 -51.3 -52.6 -62.6 -68.3 -61.9

3.2. Grid performance

Table 8. Cluster performance during second phase AMBER screen

Você também pode gostar