SlideShare una empresa de Scribd logo
1 de 64
Tuesday September 18th 2012
DEVELOPMENT OF METHODS FOR DOCKING AND
DESIGNING
SMALL MOLECULES WITHIN THE ROSETTA CODE
FRAMEWORK
A doctoral dissertation defense presented by
GORDON HOWARD LEMMON
ROSETTA
Outline of presentation
A. What is structural biology?
B. Protein modeling and ligand docking
C. Introduction to Rosetta software
D. HIV-1 PR/PI binding affinity prediction
E. Rosetta software development
F. Ligand docking with waters using improved
Rosetta ligand docking code
2
Outline of presentation
A. What is structural biology?
B. Protein modeling and ligand docking
C. Introduction to Rosetta software
D. HIV-1 PR/PI binding affinity prediction
E. Rosetta software development
F. Ligand docking with waters using improved
Rosetta ligand docking code
3
What is structural biology?
ProteinsDNA
Structural Biology is the study of structure and function of
biological molecules such as DNA, RNA, and proteins
4
How big are proteins?
5
Water
1.51 Å
HH
O
Amprenavir
~17 Å 72 atoms
HIV-1 Protease (PR)
~54 Å 3163 atoms
1 Angstrom (Å) = 1 ten millionth of a millimeter
Proteins consist of amino acid chains
6
Protein sequence determines
structure7
Protein structure determines function
HIV-1 protease cleaves poly-protein precursors to
form functional proteins
8
Peptide chain
HIV-1 protease
Proteins are dynamic
9
Outline of presentation
A. What is structural biology?
B. Protein modeling and ligand docking
C. Introduction to Rosetta software
D. HIV-1 PR/PI binding affinity prediction
E. Rosetta software development
F. Ligand docking with waters using improved
Rosetta ligand docking code
10
What is protein modeling?
 Prediction of protein structure from
1. Sequence alone (de novo folding)
HIV-1 PR
Amino Acid Sequence
ANPCCSNPCQNRGECMSTGFDQ
YKCDCTRTGFYGENCTTPEFLTRI
KLLLKPTPNTVHYILTHFKGVWNIV
NNIPFLRSLIMKYVLTSRSYLIDSP
PTYNVHYGYKSWEAFSNLSYYTR
ALPPVADDCPTPMGVKGNKELPD
SKEVLEKVLLRREFIPDPQGSNM
MFAFF…
11
What is protein modeling?
 Prediction of protein structure from
2. Sequence similarity (Comparative modeling)
HIV-1 PR Sequence
PQITLWKRPLVTIRIGGQL
KEALLDTGADDTVLEEMN
LPGRWKPKMIGGIGGFIK
VRQYDQIPIEICGHKAIGT
VLVGPTPTNVIGRNLLTQI
GCTLNF…
HIV-2 PR
HIV-1 PR
12
+
What is ligand docking?
 Prediction of structure of protein/ligand interface
 Prediction of ligand binding affinity
13
+
Outline of presentation
A. What is structural biology?
B. Protein modeling and ligand docking
C. Introduction to Rosetta software
D. HIV-1 PR/PI binding affinity prediction
E. Rosetta software development
F. Ligand docking with waters using improved
Rosetta ligand docking code
14
Rosetta protein modeling consists
of sampling and scoring15
RosettaLigand docking consists of
sampling and scoring16
RosettaLigand docking consists of
sampling and scoring17
RosettaLigand docking consists of
sampling and scoring18
RosettaLigand score function
 Knowledge-based score terms
19
Score term
Default
weight
attractive 0.8
repulsive 0.4
solvation 0.6
dunbrack 0.4
pair 0.8
hbond_lr_bb 2.0
hbond_bb_sc 2.0
hbond_sc 2.0
Outline of presentation
A. What is structural biology?
B. Protein modeling and ligand docking
C. Introduction to Rosetta software
D. HIV-1 PR/PI binding affinity prediction
E. Rosetta software development
F. Ligand docking with waters using improved
Rosetta ligand docking code
20
21
HIV-1 PR is flexible
Simmerling 2005
22
HIV-1 PR becomes rigid upon
PI binding23
HIV-1 protease mutations
WHO drug resistance
mutations in red
24
Mutation leads to conformational diversity
FDA approved protease inhibitors (PIs)
Tipranavir
Darunavir
Atazanavir
Lopinavir
25
Previous PR/PI ΔΔG
predictions failed
Cheng (2009)
Score Function
Correlation
N=112
Number of non-hydrogen atoms 0.172
X-Score (HPScore) 0.341
SYBYL (ChemScore) 0.276
DS (PMF04) 0.183
DrugScore (PairSurf) 0.225
AutoDock 0.38
Jenwitheesuk E Samudrala R. (2003)
26
Experimental vs Predicted HIV-1 PR ΔΔG
Defining ΔΔG and ΔΔΔG
27
176 experimental PR/PI ΔΔGs
171 PR template structures28
 176 PR/PI ΔΔGs
 sequence but not structure
 34 sequences
 10 distinct protease inhibitors
 171 PR structures represent PR flexibility
RosettaLigand PR/PI ΔΔGs predictions
29
0.1 Å 5˚ PI movements
Side chain and ligand rotamer sampling
Minimization of PR side chain and PI
torsion angles
MC Accept
Minimize Backbone torsion angles
Energy filter
Random 5 Å Translation complete
rotation of PI
171 PR template
structures
176 Sequence/PI
pairs
10 Rosetta relaxed models per
input (300,960 models)
30,096 Rosetta inputs
1000 RosettaLigand docked
models per relaxed model
(300,960,000 docked models)
Top 10% of models by total score
for each Sequence/PI pair
Top models by interface score for
each Sequence/PI pair
RosettaLigand DockingPR/PI ΔΔGs prediction workflow
x6
Reweighting score terms improves
HIV-1 PR/PI ΔΔG predictions
Score term
Default
weight
Optimized weights
ΔΔG ΔΔΔG
attractive 0.8 0.71 0.31
repulsive 0.4 -0.01 0.17
solvation 0.6 0.68 0.15
dunbrack 0.4 0.29 0.43
pair 0.8 0.80 0.80
hbond_lr_bb 2.0 0.85 0.11
hbond_bb_sc 2.0 0.09 -0.20
hbond_sc 2.0 -0.35 1.71
CORRELATIONS (R) 0.16 0.38 0.51
30
Assuming constant unbound ΔG
improves PR/PI ΔΔG predictions
Standard approachConstant unbound approach
31
Correlation plots
Experimental on X Predicted on Y
Default weights:
R=0.16
32
Previous PR/PI ΔΔG
predictions failed
Score Function
Correlation
N=112
Number of non-hydrogen atoms 0.172
X-Score::HPScore 0.341
SYBYL::ChemScore 0.276
DS::PMF04 0.183
DrugScorePDB::PairSurf 0.225
AutoDock 0.38
RosettaLigand 0.71
33
Experimental vs Predicted HIV-1 PR ΔΔG
Outline of presentation
A. What is structural biology?
B. Protein modeling and ligand docking
C. Introduction to Rosetta software
D. HIV-1 PR/PI binding affinity prediction
E. Rosetta software development
F. Ligand docking with waters using improved
Rosetta ligand docking code
34
35
Fragment the Ligand Search database for fragments
Assemble rotamer librariesSample from libraries
during docking
Flexibility through fragments
36
Ligand fragment rotamers allow
efficient flexibility37
Ligand rotamer docking
38
Ligand docking with interface design
A54R
L50Y
C9R
DHT
DHT: Dihydrotestosterone
HisF: imidazole glycerol phosphate synthase
HisF
DHT
Enlarged prostate gland
prostate cancer
RosettaLigand prediction
39
Fragment based screening can greatly
expand sampling space
Congreve, M. et al. Drug Discov.Today 2003,8, 876-877
Traditional Screening Fragment based screening
40
Common drug based Fragments
Hartshorn M.J. Murray C.W.et.al. J. Med. Chem. 2005 48 403-413
H
N
N
N
N
N
N
H
N
N
S
O
O
NH2
NH
NH2
O
N
H
OH
OH
N
H
N N
NH
N
O
N
N NH
O
41
RosettaLigandDesign
Library of small
molecule fragments
Place fragments in protein binding site
-10
-12
3
-7
-5
Select low
energy
models for
refinement
Dock ligand with flexible protein
side-chains and backbone
42
RosettaLigandDesign
Library of small
molecule fragments
Place fragments in protein binding site
-8
-15
-18
-10
-12
Select low
energy
models for
refinement
Dock ligand with flexible protein
side-chains and backbone
43
Examples of fragments
Carbon
Oxygen Nitrogen
1 connection
2 connections
CH2 connections Ntrp connections
Core fragment
44
Random assembly of fragments
45
Rosetta ligand design in action
46
A. Low-res search for starting fragment
B. Refine (dock) starting fragment
C. Grow small-molecule using fragment library
D. Refine (dock) 2-fragment complex
E. Grow small-molecule using fragment library
F. Refine (dock) 3-fragment complex
G. Add Hydrogens to unsatisfied connection
points
Protein binding sites are complex
Dethiobiotin
(DTB)
Inorganic
phosphate
Mg
Ions ADP
47
Multiple Ligand docking may
capture induced fit effects
Serial Docking
Simultaneous Docking
48
Rosetta multiple ligand docking
49
Outline of presentation
A. What is structural biology?
B. Protein modeling and ligand docking
C. Introduction to Rosetta software
D. HIV-1 PR/PI binding affinity prediction
E. Rosetta software development
F. Ligand docking with waters using
improved Rosetta ligand docking code
50
Binding of HIV-1 protease
inhibitors involves H2O51
Translation of water and PI
52
Rotation of water and PI
53
RMSD measures accuracy of
docked models54
6 Angstrom (Å) RMSD 2 Angstrom (Å) RMSD
6 Angstrom (Å) RMSD 2 Angstrom (Å) RMSD
“Root mean square deviation”
Protein-centric waters improve
HIV-1 protease placement55
Ligand-centric waters improve
CSAR inhibitor placement
 “Community Structure-Activity Resource”
 299 protein/ligand structures with interface waters
56
RMSDs vs Rosetta scores
57
Waters improve docking in non-
crowded interfaces
58
Interface crowdedness correlates
with helpfulness of water docking59
Conclusions
 Binding affinity predictions can be improved by
 Optimizing Rosetta score term weights
 Ignoring the unbound state
 New RosettaLigand code allows
 Multiple ligand docking
 Fragment based rotamers for greater flexibility
 Fragment based design of ligands
 Docking with waters helps in spacious binding
cavities, hurts in crowded binding cavities
60
Professional acknowledgements
Meiler Lab
Jens Meiler
Kristian Kaufmann
Sam Deluca
Steven Combs
Committee
David Tabb
Richard DAquila
Brian Bachmann
Jarrod Smith
Molecular Biophysics Training Grant (NIH)
RosettaCommons
61
Personal acknowledgments
Church Friends
62
Personal acknowledgements
63
Personal acknowledgements
64

Más contenido relacionado

Destacado

Applications of computational chemistry in drug designing
Applications of computational chemistry in drug designingApplications of computational chemistry in drug designing
Applications of computational chemistry in drug designingaabia zafar
 
Protein-ligand docking
Protein-ligand dockingProtein-ligand docking
Protein-ligand dockingbaoilleach
 
Accurate protein-protein docking with rapid calculation
Accurate protein-protein docking with rapid calculationAccurate protein-protein docking with rapid calculation
Accurate protein-protein docking with rapid calculationMasahito Ohue
 
Homology modeling and molecular docking
Homology modeling and molecular dockingHomology modeling and molecular docking
Homology modeling and molecular dockingRangika Munaweera
 
molecular docking
molecular dockingmolecular docking
molecular dockingKOUSHIK DEB
 
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS santosh Kumbhar
 
Basics Of Molecular Docking
Basics Of Molecular DockingBasics Of Molecular Docking
Basics Of Molecular DockingSatarupa Deb
 
Computer Aided Drug Design and Discovery : An Overview (2006)
Computer Aided Drug Design and Discovery : An Overview (2006)Computer Aided Drug Design and Discovery : An Overview (2006)
Computer Aided Drug Design and Discovery : An Overview (2006)Girinath Pillai
 
Protein-Ligand Docking
Protein-Ligand DockingProtein-Ligand Docking
Protein-Ligand Dockingbaoilleach
 
Computer aided drug designing
Computer aided drug designingComputer aided drug designing
Computer aided drug designingMuhammed sadiq
 

Destacado (20)

Applications of computational chemistry in drug designing
Applications of computational chemistry in drug designingApplications of computational chemistry in drug designing
Applications of computational chemistry in drug designing
 
Protein-ligand docking
Protein-ligand dockingProtein-ligand docking
Protein-ligand docking
 
MOLECULAR DOCKING
MOLECULAR DOCKINGMOLECULAR DOCKING
MOLECULAR DOCKING
 
Docking Tutorial
Docking TutorialDocking Tutorial
Docking Tutorial
 
Dock Sem
Dock SemDock Sem
Dock Sem
 
Molecular docking
Molecular dockingMolecular docking
Molecular docking
 
Accurate protein-protein docking with rapid calculation
Accurate protein-protein docking with rapid calculationAccurate protein-protein docking with rapid calculation
Accurate protein-protein docking with rapid calculation
 
Credit Rating
Credit RatingCredit Rating
Credit Rating
 
Homology modeling and molecular docking
Homology modeling and molecular dockingHomology modeling and molecular docking
Homology modeling and molecular docking
 
molecular docking
molecular dockingmolecular docking
molecular docking
 
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
 
Computer Aided Drug Design
Computer Aided Drug DesignComputer Aided Drug Design
Computer Aided Drug Design
 
Basics Of Molecular Docking
Basics Of Molecular DockingBasics Of Molecular Docking
Basics Of Molecular Docking
 
Computer Aided Drug Design and Discovery : An Overview (2006)
Computer Aided Drug Design and Discovery : An Overview (2006)Computer Aided Drug Design and Discovery : An Overview (2006)
Computer Aided Drug Design and Discovery : An Overview (2006)
 
Computer aided Drug designing (CADD)
Computer aided Drug designing (CADD)Computer aided Drug designing (CADD)
Computer aided Drug designing (CADD)
 
docking
docking docking
docking
 
Protein-Ligand Docking
Protein-Ligand DockingProtein-Ligand Docking
Protein-Ligand Docking
 
Computer aided drug designing
Computer aided drug designingComputer aided drug designing
Computer aided drug designing
 
Protein docking
Protein dockingProtein docking
Protein docking
 
MOLECULAR MODELLING
MOLECULAR MODELLINGMOLECULAR MODELLING
MOLECULAR MODELLING
 

Similar a Docking & Designing Small Molecules within Rosetta Code Framework

Basler modellers.210126reduced
Basler modellers.210126reducedBasler modellers.210126reduced
Basler modellers.210126reducedOlivier Bignucolo
 
Structural Systems Pharmacology
Structural Systems PharmacologyStructural Systems Pharmacology
Structural Systems PharmacologyPhilip Bourne
 
Structure based computer aided drug design
Structure based computer aided drug designStructure based computer aided drug design
Structure based computer aided drug designThanh Truong
 
FORENSIC EPIGENETICS FOR BODILY FLUID TYPING, SUSPECT AGE, AND PHENOTYPING
FORENSIC EPIGENETICS FOR BODILY FLUID TYPING, SUSPECT AGE, AND PHENOTYPINGFORENSIC EPIGENETICS FOR BODILY FLUID TYPING, SUSPECT AGE, AND PHENOTYPING
FORENSIC EPIGENETICS FOR BODILY FLUID TYPING, SUSPECT AGE, AND PHENOTYPINGiQHub
 
3-D QSAR studies on maslinic acid analogs
3-D QSAR studies on maslinic acid analogs3-D QSAR studies on maslinic acid analogs
3-D QSAR studies on maslinic acid analogsNeha Dhiman
 
Structure Modeling of Disordered Protein Interactions
Structure Modeling of Disordered Protein InteractionsStructure Modeling of Disordered Protein Interactions
Structure Modeling of Disordered Protein InteractionsPurdue University
 
Dr Luke Alphey - DNA Sequencing (Introduction to Biotechniques)-Garland S
Dr Luke Alphey - DNA Sequencing (Introduction to Biotechniques)-Garland SDr Luke Alphey - DNA Sequencing (Introduction to Biotechniques)-Garland S
Dr Luke Alphey - DNA Sequencing (Introduction to Biotechniques)-Garland SEdizonJambormias2
 
Dynamic Residue Interaction Network Analysis of D30N/L90M dual mutation in Pr...
Dynamic Residue Interaction Network Analysis of D30N/L90M dual mutation in Pr...Dynamic Residue Interaction Network Analysis of D30N/L90M dual mutation in Pr...
Dynamic Residue Interaction Network Analysis of D30N/L90M dual mutation in Pr...Norifumi Yamamoto
 
Griffen MedChemica Virtual Tox Panel
Griffen MedChemica Virtual Tox PanelGriffen MedChemica Virtual Tox Panel
Griffen MedChemica Virtual Tox PanelEd Griffen
 
Accessing genetically tagged heterocycle libraries via a chemoresistant DNA s...
Accessing genetically tagged heterocycle libraries via a chemoresistant DNA s...Accessing genetically tagged heterocycle libraries via a chemoresistant DNA s...
Accessing genetically tagged heterocycle libraries via a chemoresistant DNA s...Laura Berry
 
Improving enrichment rates
Improving enrichment ratesImproving enrichment rates
Improving enrichment ratesbaoilleach
 
Towards semantic systems chemical biology
Towards semantic systems chemical biology Towards semantic systems chemical biology
Towards semantic systems chemical biology Bin Chen
 
Dynamic residue interaction network analysis of the neuraminidase H274Y mutan...
Dynamic residue interaction network analysis of the neuraminidase H274Y mutan...Dynamic residue interaction network analysis of the neuraminidase H274Y mutan...
Dynamic residue interaction network analysis of the neuraminidase H274Y mutan...Norifumi Yamamoto
 
Presentation1.pptx
Presentation1.pptxPresentation1.pptx
Presentation1.pptxmalik830559
 

Similar a Docking & Designing Small Molecules within Rosetta Code Framework (20)

CHEM3204_PRAC_Manual_2016
CHEM3204_PRAC_Manual_2016CHEM3204_PRAC_Manual_2016
CHEM3204_PRAC_Manual_2016
 
Basler modellers.210126reduced
Basler modellers.210126reducedBasler modellers.210126reduced
Basler modellers.210126reduced
 
Structural Systems Pharmacology
Structural Systems PharmacologyStructural Systems Pharmacology
Structural Systems Pharmacology
 
Structure based computer aided drug design
Structure based computer aided drug designStructure based computer aided drug design
Structure based computer aided drug design
 
presentation
presentationpresentation
presentation
 
FORENSIC EPIGENETICS FOR BODILY FLUID TYPING, SUSPECT AGE, AND PHENOTYPING
FORENSIC EPIGENETICS FOR BODILY FLUID TYPING, SUSPECT AGE, AND PHENOTYPINGFORENSIC EPIGENETICS FOR BODILY FLUID TYPING, SUSPECT AGE, AND PHENOTYPING
FORENSIC EPIGENETICS FOR BODILY FLUID TYPING, SUSPECT AGE, AND PHENOTYPING
 
3-D QSAR studies on maslinic acid analogs
3-D QSAR studies on maslinic acid analogs3-D QSAR studies on maslinic acid analogs
3-D QSAR studies on maslinic acid analogs
 
Rosa_PhD
Rosa_PhDRosa_PhD
Rosa_PhD
 
Structure Modeling of Disordered Protein Interactions
Structure Modeling of Disordered Protein InteractionsStructure Modeling of Disordered Protein Interactions
Structure Modeling of Disordered Protein Interactions
 
Dr Luke Alphey - DNA Sequencing (Introduction to Biotechniques)-Garland S
Dr Luke Alphey - DNA Sequencing (Introduction to Biotechniques)-Garland SDr Luke Alphey - DNA Sequencing (Introduction to Biotechniques)-Garland S
Dr Luke Alphey - DNA Sequencing (Introduction to Biotechniques)-Garland S
 
Dynamic Residue Interaction Network Analysis of D30N/L90M dual mutation in Pr...
Dynamic Residue Interaction Network Analysis of D30N/L90M dual mutation in Pr...Dynamic Residue Interaction Network Analysis of D30N/L90M dual mutation in Pr...
Dynamic Residue Interaction Network Analysis of D30N/L90M dual mutation in Pr...
 
Griffen MedChemica Virtual Tox Panel
Griffen MedChemica Virtual Tox PanelGriffen MedChemica Virtual Tox Panel
Griffen MedChemica Virtual Tox Panel
 
Accessing genetically tagged heterocycle libraries via a chemoresistant DNA s...
Accessing genetically tagged heterocycle libraries via a chemoresistant DNA s...Accessing genetically tagged heterocycle libraries via a chemoresistant DNA s...
Accessing genetically tagged heterocycle libraries via a chemoresistant DNA s...
 
Improving enrichment rates
Improving enrichment ratesImproving enrichment rates
Improving enrichment rates
 
ABRCMS Poster2012
ABRCMS Poster2012ABRCMS Poster2012
ABRCMS Poster2012
 
Towards semantic systems chemical biology
Towards semantic systems chemical biology Towards semantic systems chemical biology
Towards semantic systems chemical biology
 
Dynamic residue interaction network analysis of the neuraminidase H274Y mutan...
Dynamic residue interaction network analysis of the neuraminidase H274Y mutan...Dynamic residue interaction network analysis of the neuraminidase H274Y mutan...
Dynamic residue interaction network analysis of the neuraminidase H274Y mutan...
 
Vls
VlsVls
Vls
 
Presentation1.pptx
Presentation1.pptxPresentation1.pptx
Presentation1.pptx
 
Abhishek RBF final
Abhishek RBF finalAbhishek RBF final
Abhishek RBF final
 

Último

Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxjana861314
 

Último (20)

Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
 

Docking & Designing Small Molecules within Rosetta Code Framework

  • 1. Tuesday September 18th 2012 DEVELOPMENT OF METHODS FOR DOCKING AND DESIGNING SMALL MOLECULES WITHIN THE ROSETTA CODE FRAMEWORK A doctoral dissertation defense presented by GORDON HOWARD LEMMON ROSETTA
  • 2. Outline of presentation A. What is structural biology? B. Protein modeling and ligand docking C. Introduction to Rosetta software D. HIV-1 PR/PI binding affinity prediction E. Rosetta software development F. Ligand docking with waters using improved Rosetta ligand docking code 2
  • 3. Outline of presentation A. What is structural biology? B. Protein modeling and ligand docking C. Introduction to Rosetta software D. HIV-1 PR/PI binding affinity prediction E. Rosetta software development F. Ligand docking with waters using improved Rosetta ligand docking code 3
  • 4. What is structural biology? ProteinsDNA Structural Biology is the study of structure and function of biological molecules such as DNA, RNA, and proteins 4
  • 5. How big are proteins? 5 Water 1.51 Å HH O Amprenavir ~17 Å 72 atoms HIV-1 Protease (PR) ~54 Å 3163 atoms 1 Angstrom (Å) = 1 ten millionth of a millimeter
  • 6. Proteins consist of amino acid chains 6
  • 8. Protein structure determines function HIV-1 protease cleaves poly-protein precursors to form functional proteins 8 Peptide chain HIV-1 protease
  • 10. Outline of presentation A. What is structural biology? B. Protein modeling and ligand docking C. Introduction to Rosetta software D. HIV-1 PR/PI binding affinity prediction E. Rosetta software development F. Ligand docking with waters using improved Rosetta ligand docking code 10
  • 11. What is protein modeling?  Prediction of protein structure from 1. Sequence alone (de novo folding) HIV-1 PR Amino Acid Sequence ANPCCSNPCQNRGECMSTGFDQ YKCDCTRTGFYGENCTTPEFLTRI KLLLKPTPNTVHYILTHFKGVWNIV NNIPFLRSLIMKYVLTSRSYLIDSP PTYNVHYGYKSWEAFSNLSYYTR ALPPVADDCPTPMGVKGNKELPD SKEVLEKVLLRREFIPDPQGSNM MFAFF… 11
  • 12. What is protein modeling?  Prediction of protein structure from 2. Sequence similarity (Comparative modeling) HIV-1 PR Sequence PQITLWKRPLVTIRIGGQL KEALLDTGADDTVLEEMN LPGRWKPKMIGGIGGFIK VRQYDQIPIEICGHKAIGT VLVGPTPTNVIGRNLLTQI GCTLNF… HIV-2 PR HIV-1 PR 12 +
  • 13. What is ligand docking?  Prediction of structure of protein/ligand interface  Prediction of ligand binding affinity 13 +
  • 14. Outline of presentation A. What is structural biology? B. Protein modeling and ligand docking C. Introduction to Rosetta software D. HIV-1 PR/PI binding affinity prediction E. Rosetta software development F. Ligand docking with waters using improved Rosetta ligand docking code 14
  • 15. Rosetta protein modeling consists of sampling and scoring15
  • 16. RosettaLigand docking consists of sampling and scoring16
  • 17. RosettaLigand docking consists of sampling and scoring17
  • 18. RosettaLigand docking consists of sampling and scoring18
  • 19. RosettaLigand score function  Knowledge-based score terms 19 Score term Default weight attractive 0.8 repulsive 0.4 solvation 0.6 dunbrack 0.4 pair 0.8 hbond_lr_bb 2.0 hbond_bb_sc 2.0 hbond_sc 2.0
  • 20. Outline of presentation A. What is structural biology? B. Protein modeling and ligand docking C. Introduction to Rosetta software D. HIV-1 PR/PI binding affinity prediction E. Rosetta software development F. Ligand docking with waters using improved Rosetta ligand docking code 20
  • 21. 21
  • 22. HIV-1 PR is flexible Simmerling 2005 22
  • 23. HIV-1 PR becomes rigid upon PI binding23
  • 24. HIV-1 protease mutations WHO drug resistance mutations in red 24 Mutation leads to conformational diversity
  • 25. FDA approved protease inhibitors (PIs) Tipranavir Darunavir Atazanavir Lopinavir 25
  • 26. Previous PR/PI ΔΔG predictions failed Cheng (2009) Score Function Correlation N=112 Number of non-hydrogen atoms 0.172 X-Score (HPScore) 0.341 SYBYL (ChemScore) 0.276 DS (PMF04) 0.183 DrugScore (PairSurf) 0.225 AutoDock 0.38 Jenwitheesuk E Samudrala R. (2003) 26 Experimental vs Predicted HIV-1 PR ΔΔG
  • 27. Defining ΔΔG and ΔΔΔG 27
  • 28. 176 experimental PR/PI ΔΔGs 171 PR template structures28  176 PR/PI ΔΔGs  sequence but not structure  34 sequences  10 distinct protease inhibitors  171 PR structures represent PR flexibility
  • 29. RosettaLigand PR/PI ΔΔGs predictions 29 0.1 Å 5˚ PI movements Side chain and ligand rotamer sampling Minimization of PR side chain and PI torsion angles MC Accept Minimize Backbone torsion angles Energy filter Random 5 Å Translation complete rotation of PI 171 PR template structures 176 Sequence/PI pairs 10 Rosetta relaxed models per input (300,960 models) 30,096 Rosetta inputs 1000 RosettaLigand docked models per relaxed model (300,960,000 docked models) Top 10% of models by total score for each Sequence/PI pair Top models by interface score for each Sequence/PI pair RosettaLigand DockingPR/PI ΔΔGs prediction workflow x6
  • 30. Reweighting score terms improves HIV-1 PR/PI ΔΔG predictions Score term Default weight Optimized weights ΔΔG ΔΔΔG attractive 0.8 0.71 0.31 repulsive 0.4 -0.01 0.17 solvation 0.6 0.68 0.15 dunbrack 0.4 0.29 0.43 pair 0.8 0.80 0.80 hbond_lr_bb 2.0 0.85 0.11 hbond_bb_sc 2.0 0.09 -0.20 hbond_sc 2.0 -0.35 1.71 CORRELATIONS (R) 0.16 0.38 0.51 30
  • 31. Assuming constant unbound ΔG improves PR/PI ΔΔG predictions Standard approachConstant unbound approach 31
  • 32. Correlation plots Experimental on X Predicted on Y Default weights: R=0.16 32
  • 33. Previous PR/PI ΔΔG predictions failed Score Function Correlation N=112 Number of non-hydrogen atoms 0.172 X-Score::HPScore 0.341 SYBYL::ChemScore 0.276 DS::PMF04 0.183 DrugScorePDB::PairSurf 0.225 AutoDock 0.38 RosettaLigand 0.71 33 Experimental vs Predicted HIV-1 PR ΔΔG
  • 34. Outline of presentation A. What is structural biology? B. Protein modeling and ligand docking C. Introduction to Rosetta software D. HIV-1 PR/PI binding affinity prediction E. Rosetta software development F. Ligand docking with waters using improved Rosetta ligand docking code 34
  • 35. 35
  • 36. Fragment the Ligand Search database for fragments Assemble rotamer librariesSample from libraries during docking Flexibility through fragments 36
  • 37. Ligand fragment rotamers allow efficient flexibility37
  • 39. Ligand docking with interface design A54R L50Y C9R DHT DHT: Dihydrotestosterone HisF: imidazole glycerol phosphate synthase HisF DHT Enlarged prostate gland prostate cancer RosettaLigand prediction 39
  • 40. Fragment based screening can greatly expand sampling space Congreve, M. et al. Drug Discov.Today 2003,8, 876-877 Traditional Screening Fragment based screening 40
  • 41. Common drug based Fragments Hartshorn M.J. Murray C.W.et.al. J. Med. Chem. 2005 48 403-413 H N N N N N N H N N S O O NH2 NH NH2 O N H OH OH N H N N NH N O N N NH O 41
  • 42. RosettaLigandDesign Library of small molecule fragments Place fragments in protein binding site -10 -12 3 -7 -5 Select low energy models for refinement Dock ligand with flexible protein side-chains and backbone 42
  • 43. RosettaLigandDesign Library of small molecule fragments Place fragments in protein binding site -8 -15 -18 -10 -12 Select low energy models for refinement Dock ligand with flexible protein side-chains and backbone 43
  • 44. Examples of fragments Carbon Oxygen Nitrogen 1 connection 2 connections CH2 connections Ntrp connections Core fragment 44
  • 45. Random assembly of fragments 45
  • 46. Rosetta ligand design in action 46 A. Low-res search for starting fragment B. Refine (dock) starting fragment C. Grow small-molecule using fragment library D. Refine (dock) 2-fragment complex E. Grow small-molecule using fragment library F. Refine (dock) 3-fragment complex G. Add Hydrogens to unsatisfied connection points
  • 47. Protein binding sites are complex Dethiobiotin (DTB) Inorganic phosphate Mg Ions ADP 47
  • 48. Multiple Ligand docking may capture induced fit effects Serial Docking Simultaneous Docking 48
  • 50. Outline of presentation A. What is structural biology? B. Protein modeling and ligand docking C. Introduction to Rosetta software D. HIV-1 PR/PI binding affinity prediction E. Rosetta software development F. Ligand docking with waters using improved Rosetta ligand docking code 50
  • 51. Binding of HIV-1 protease inhibitors involves H2O51
  • 52. Translation of water and PI 52
  • 53. Rotation of water and PI 53
  • 54. RMSD measures accuracy of docked models54 6 Angstrom (Å) RMSD 2 Angstrom (Å) RMSD 6 Angstrom (Å) RMSD 2 Angstrom (Å) RMSD “Root mean square deviation”
  • 55. Protein-centric waters improve HIV-1 protease placement55
  • 56. Ligand-centric waters improve CSAR inhibitor placement  “Community Structure-Activity Resource”  299 protein/ligand structures with interface waters 56
  • 57. RMSDs vs Rosetta scores 57
  • 58. Waters improve docking in non- crowded interfaces 58
  • 59. Interface crowdedness correlates with helpfulness of water docking59
  • 60. Conclusions  Binding affinity predictions can be improved by  Optimizing Rosetta score term weights  Ignoring the unbound state  New RosettaLigand code allows  Multiple ligand docking  Fragment based rotamers for greater flexibility  Fragment based design of ligands  Docking with waters helps in spacious binding cavities, hurts in crowded binding cavities 60
  • 61. Professional acknowledgements Meiler Lab Jens Meiler Kristian Kaufmann Sam Deluca Steven Combs Committee David Tabb Richard DAquila Brian Bachmann Jarrod Smith Molecular Biophysics Training Grant (NIH) RosettaCommons 61

Notas del editor

  1. Do NOT explain!
  2. The meilerlab focuses on proteins…There are 1000s of different proteins that all have a unique role to play – these include proteins that form muscle, hair, and skin, to proteins that perform chemical reactions, forming and breaking chemical bonds.
  3. Explain here that most drugs that you pick up at the pharmacy work by binding to specific proteins.Proteins are very large. How is a molecule this large constructed?
  4. How are molecules as large as proteins created?
  5. This protein has 198 amino acids – it is actually two chains of 99 AA eachHow can the sequence determine something as complex as 3-D structure? It has to do with the way that amino acids interact with each other.
  6. Sequence determines structure, which determines function.These mature proteins plays a role in the activity of the HIV virus
  7. Determining sequence is easy, determining structure is hard. If we can predict structure we can understand function.
  8. Using EXPERIMENTAL structures as comparison
  9. Structure means the position of the small molecule with respect to the ligand.Predicting binding affinity is more difficult.If we can predict ligand binding affinity, then we can make predictions about how tight a potential drug will bind to its target and how specific that binding will be.
  10. Point out that this is
  11. The lowest scoring model we predict will be closest to the true position that the small molecule will assume.
  12. I’ve talked about H-bonding but there are many terms and each has a default weight.
  13. Mutations lead to drug resistance. WHO keeps track of these mutations…
  14. Medicine: As HIV-1 PR mutates, a patient being treated with one of these PRs stops responding to treatment. So they are switched to a different PR.
  15. Experimental vs Predicted!
  16. dG = Gibbs free energy.ddG = Binding affinity. Relative binding affinity w/respect to mutation.
  17. Explain the hypothesis about effect of mutation on flexible vs. rigid structure.
  18. Experimental vs Predicted!
  19. Explain that the ligand moves as well. This is very important!
  20. The idea is that instead of screening libraries of millions of larger compounds, one could screen libraries of several hundred fragments for several independent fragments, then link these together.
  21. for example a protein binding pocket can have…
  22. Induced-fit means that the protein changes its shape as it interacts with the small molecule.Enzymes that catalyze chemical reactions, either creating or breaking bonds are good examples.
  23. RMSD is an average distance over all pairs of atoms.
  24. Talk about how important these results are for PI development
  25. RMSD on X axis and Rosetta Interface Score on Y axisWith water we are consistently producing low scoring models below 2 A RMSD