SlideShare una empresa de Scribd logo
1 de 49
The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA
Cheminformatics tools supporting
dissemination of data associated with US EPA
Clean Water Act hazardous substances
Antony Williams1, Fran Kremer2, Jason Lambert1, Jace Cuje3 and Valery Tkachenko4
1. Center for Computational Toxicology and Exposure, US-EPA
2. Center for Environmental Solutions and Emergency Response, US-EPA
3. Office of Science Advisor, Policy and Engagement. US-EPA
4. ScienceDataExperts Inc.
March 2024: Spring Fall Meeting, New Orleans, LA
Data, Model and Tool Development
• I work for the Center for Computational Toxicology and Exposure
in the Computational Chemistry and Cheminformatics Branch
• There are many tools developed by our cheminformatics team and
across other centers in EPA. I will represent ours only…
• We have production level public-facing tools, proof-of-concept
public-facing tools, and many tools in development…
• We focus on FAIR data releasing it to the community and making
it available on Public APIs
1
Free-Access Cheminformatics Tools
• The Center for Computational Toxicology and Exposure has
delivered many tools including
– CompTox Chemicals Dashboard
– Proof-of-Concept cheminformatics modules
• Chemicals Hazard Profiling
• Chemical Transformations Database
• Analytical Methods and Spectra
• Chemical Safety Profiling
2
Research Projects we apply them to
4
Research Projects we apply them to
5
Research Projects we apply them to
Curating Chemistry into the DSSTox Database
6
• Chemistry underpins all of our tools
• Data assembly and curation is critical
• DSSTox assembled over 25 years
CompTox Chemicals Dashboard
https://comptox.epa.gov/dashboard/
The Charge for the Dashboard
• Develop a “first-stop-shop” for environmental chemical data to
support EPA and partner decision making:
– Centralized location for relevant chemical data
– Chemistry, exposure, hazard and dosimetry
– Combination of existing data and predictive models
– Publicly accessible, periodically updated, curated
• Easy access to data improves efficiency and ultimately
accelerates chemical risk assessment
Detailed Chemical Pages
Experimental and Predicted Data
• Physchem and Fate & Transport
experimental and predicted data
• Data can be downloaded as Excel,
TSV and CSV files
Chemical Hazard Data
Hazard Data for Copper
• 2246 rows of human/eco hazard data harvested with three
clicks
Sources of Exposure to Chemicals
Chemical Lists
Chemical Lists
• Chemical lists are focused on regulations, specific research
efforts and categories
• 425 lists and growing
– TSCA Inventory
– Clean Water Act Hazardous Substances
– Consumer Products database
– Chemicals of Emerging Concern
– PFAS lists
– Extractables and Leachables
– …lists are versioned and updated and new lists added
15
Remember those Research Projects?
Some Research Projects…
Some Research Projects…
Clean Water Act Hazardous Substances
19
Harvesting Data en masse
• Harvesting data for CWAHS related chemicals
–Physicochemical properties
–Fate and transport
–Toxicity values
–Exposure data
–Chemical identifiers
–Links to regulatory assessments
Batch Searching
Batch Searching is a big enabler
https://pubs.acs.org/doi/10.1021/acs.jcim.0c01273
22
Batch Search
Batch Search – Excel, CSV, SDF file
Batch Search
We supply predicted data for many endpoints
• Property prediction – e.g., water solubility, vapor pressure
• Fate and Transport – e.g., bioaccumulation, bioconcentration
• Bioactivity – e.g., endocrine disruption
• Models are constantly updated with fresh data, are transparent
in their data, and are open source
26
QSAR Modeled Data are available
• We build models then apply then to our curated datasets
for release, PLUS deliver the models for realtime use
27
Where do we use predictions like this?
• Models are used in many places in our computational
toxicology research
• They are used in the analytical labs to help guide non-
targeted analysis
• By stakeholders for Hazard
profiling of chemicals
28
Where do we use predictions like this?
• Models are used in many places in our computational
toxicology research
• They are used in the analytical labs to help guide non-
targeted analysis
• By stakeholders for Hazard
profiling of chemicals
• Predictions for breakdown
products in the environment
29
Lots of “proof-of-concept” tools in development
• PoCs are research software builds to prove approaches
before moving into production software environments
• PoCs are to figure out how to address specific questions
• Assemble data, develop data model(s), test user interface
approaches, work with test user base to garner feedback
• Since PoCs are internal access data refreshes and application
updates can be more
• Underlying APIs are being used in our research
30
How to compare Hazard Data?
31
How to compare Hazard Data?
NOT Easy to interpret…
32
Hazard Profile
33
• Hazard Comparison module profiles toxicity across chemicals
https://www.epa.gov/chemical-research/cheminformatics
Hazard Profile
On-Hover view of trumping scheme call
34
Hazard Profile
On-click view of underlying data
35
Data to Excel in <60s
36
Linked to Chemical Transformation Simulator
37
Linked to Chemical Transformation Simulator
38
Where can our tools be applied
• Emergency Response utility is obvious…
• Consider East Palestine
39
https://www.cleveland19.com/2023/
02/14/ntsb-announces-preliminary-
malfunction-that-caused-east-
palestine-train-derailment/
POLYPROPYLENE
POLYETHYLENE
Residue lube oil
VINYL CHLORIDE
DIPROPYLENE GLYCOL
PROPYLENE GLYCOL
DIETHYLENE GLYCOL
COMBUSTIBLE LIQ., NOS (ETHYLENE GLYCOL MONOBUTYL ETHER)
SEMOLINA
COMBUSTIBLE LIQ., NOS (ETHYLHEXYL ACRYLATE)
POLYVINYL
PETROLEUM LUBEOIL
POLYPROPYL GLYCOL
ISOBUTYLENE
BUTYL ACRYLATES, STABILIZED
PETRO OIL, NEC
ADDITIVES, FUEL
BALLS,CTN,M EDCL
SHEET STEEL
VEGTABLE, FROZEN
BENZENE
PARAFFIN WAX
FLAKES, POWDER
HYDRAULIC CEMENT
AUTOS PASSENGER
MALT LIQUORS
Hazard Comparison Profiling
40
Hazard Comparison Profiling
Perfect Example of FAIR Data and APIs
• We owe a lot to FAIR data and availability of information
• We curate a lot of our chemistry data using public resources
such as PubChem, ChEBI, Common Chemistry and others
• The availability of Public APIs takes things to another level!
• We have been using the PubChem API to harvest data so
we can build new applications, like the Safety Module
42
Cheminformatics Safety Module (NOT PUBLIC)
Integrate multiple data streams…
43
Some Related Publications of Interest
You want to know more…
• Lots of resources available
– Presentations: https://tinyurl.com/w5hqs55
– Communities of Practice Videos: https://rb.gy/qsbno1
– Manual: https://rb.gy/4fgydc
– Latest News: https://comptox.epa.gov/dashboard/news_info
45
This talk is an overview
• This talk is a high-level overview only. We
can provide trainings into the individual
modules and data as required
• LOTS of training materials are available
https://www.epa.gov/chemical-research/new-approach-methods-nams-training
Conclusions
• Underpinning chemistry data is from the DSSTox database
• CompTox Chemicals Dashboard is public access to DSSTox
and other related databases
• Proof-of-Concept (PoC) tools are built to prove approaches
• Everything is increasingly API driven and APIs are now public
47
Contact Information
• Contact info: williams.antony@epa.gov
• Slides available at: https://www.slideshare.net/AntonyWilliams/
• Obtain articles from Google Scholar Profile
48

Más contenido relacionado

Similar a Cheminformatics tools supporting dissemination of data associated with US EPA Clean Water Act hazardous substances

The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...Andrew McEachran
 

Similar a Cheminformatics tools supporting dissemination of data associated with US EPA Clean Water Act hazardous substances (20)

The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Env...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Env...The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Env...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Env...
 
How to place your research questions or results into the context of the "Lega...
How to place your research questions or results into the context of the "Lega...How to place your research questions or results into the context of the "Lega...
How to place your research questions or results into the context of the "Lega...
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
 
The EPA CompTox Chemistry Dashboard and Underpinning Software Architecture
The EPA CompTox Chemistry Dashboard and Underpinning Software Architecture The EPA CompTox Chemistry Dashboard and Underpinning Software Architecture
The EPA CompTox Chemistry Dashboard and Underpinning Software Architecture
 
Chemistry data delivery from the US-EPA to support environmental chemistry
Chemistry data delivery from the US-EPA to support environmental chemistryChemistry data delivery from the US-EPA to support environmental chemistry
Chemistry data delivery from the US-EPA to support environmental chemistry
 
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
 
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
 
Progress in delivering transparency in research data
Progress in delivering transparency in research dataProgress in delivering transparency in research data
Progress in delivering transparency in research data
 
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
 
Utilizing US-EPA Data Dashboards to Support Exposomics research
Utilizing US-EPA Data Dashboards to  Support Exposomics researchUtilizing US-EPA Data Dashboards to  Support Exposomics research
Utilizing US-EPA Data Dashboards to Support Exposomics research
 
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted AnalysisThe US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
 
Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
 
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
 
New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
 
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
 
The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...
The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...
The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...
 
Delivering access to chemistry and bioassay data from the National Center for...
Delivering access to chemistry and bioassay data from the National Center for...Delivering access to chemistry and bioassay data from the National Center for...
Delivering access to chemistry and bioassay data from the National Center for...
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 

Último

Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 

Último (20)

Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 

Cheminformatics tools supporting dissemination of data associated with US EPA Clean Water Act hazardous substances

  • 1. The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA Cheminformatics tools supporting dissemination of data associated with US EPA Clean Water Act hazardous substances Antony Williams1, Fran Kremer2, Jason Lambert1, Jace Cuje3 and Valery Tkachenko4 1. Center for Computational Toxicology and Exposure, US-EPA 2. Center for Environmental Solutions and Emergency Response, US-EPA 3. Office of Science Advisor, Policy and Engagement. US-EPA 4. ScienceDataExperts Inc. March 2024: Spring Fall Meeting, New Orleans, LA
  • 2. Data, Model and Tool Development • I work for the Center for Computational Toxicology and Exposure in the Computational Chemistry and Cheminformatics Branch • There are many tools developed by our cheminformatics team and across other centers in EPA. I will represent ours only… • We have production level public-facing tools, proof-of-concept public-facing tools, and many tools in development… • We focus on FAIR data releasing it to the community and making it available on Public APIs 1
  • 3. Free-Access Cheminformatics Tools • The Center for Computational Toxicology and Exposure has delivered many tools including – CompTox Chemicals Dashboard – Proof-of-Concept cheminformatics modules • Chemicals Hazard Profiling • Chemical Transformations Database • Analytical Methods and Spectra • Chemical Safety Profiling 2
  • 4. Research Projects we apply them to
  • 5. 4 Research Projects we apply them to
  • 6. 5 Research Projects we apply them to
  • 7. Curating Chemistry into the DSSTox Database 6 • Chemistry underpins all of our tools • Data assembly and curation is critical • DSSTox assembled over 25 years
  • 9. The Charge for the Dashboard • Develop a “first-stop-shop” for environmental chemical data to support EPA and partner decision making: – Centralized location for relevant chemical data – Chemistry, exposure, hazard and dosimetry – Combination of existing data and predictive models – Publicly accessible, periodically updated, curated • Easy access to data improves efficiency and ultimately accelerates chemical risk assessment
  • 11. Experimental and Predicted Data • Physchem and Fate & Transport experimental and predicted data • Data can be downloaded as Excel, TSV and CSV files
  • 13. Hazard Data for Copper • 2246 rows of human/eco hazard data harvested with three clicks
  • 14. Sources of Exposure to Chemicals
  • 16. Chemical Lists • Chemical lists are focused on regulations, specific research efforts and categories • 425 lists and growing – TSCA Inventory – Clean Water Act Hazardous Substances – Consumer Products database – Chemicals of Emerging Concern – PFAS lists – Extractables and Leachables – …lists are versioned and updated and new lists added 15
  • 20. Clean Water Act Hazardous Substances 19
  • 21. Harvesting Data en masse • Harvesting data for CWAHS related chemicals –Physicochemical properties –Fate and transport –Toxicity values –Exposure data –Chemical identifiers –Links to regulatory assessments
  • 23. Batch Searching is a big enabler https://pubs.acs.org/doi/10.1021/acs.jcim.0c01273 22
  • 25. Batch Search – Excel, CSV, SDF file
  • 27. We supply predicted data for many endpoints • Property prediction – e.g., water solubility, vapor pressure • Fate and Transport – e.g., bioaccumulation, bioconcentration • Bioactivity – e.g., endocrine disruption • Models are constantly updated with fresh data, are transparent in their data, and are open source 26
  • 28. QSAR Modeled Data are available • We build models then apply then to our curated datasets for release, PLUS deliver the models for realtime use 27
  • 29. Where do we use predictions like this? • Models are used in many places in our computational toxicology research • They are used in the analytical labs to help guide non- targeted analysis • By stakeholders for Hazard profiling of chemicals 28
  • 30. Where do we use predictions like this? • Models are used in many places in our computational toxicology research • They are used in the analytical labs to help guide non- targeted analysis • By stakeholders for Hazard profiling of chemicals • Predictions for breakdown products in the environment 29
  • 31. Lots of “proof-of-concept” tools in development • PoCs are research software builds to prove approaches before moving into production software environments • PoCs are to figure out how to address specific questions • Assemble data, develop data model(s), test user interface approaches, work with test user base to garner feedback • Since PoCs are internal access data refreshes and application updates can be more • Underlying APIs are being used in our research 30
  • 32. How to compare Hazard Data? 31
  • 33. How to compare Hazard Data? NOT Easy to interpret… 32
  • 34. Hazard Profile 33 • Hazard Comparison module profiles toxicity across chemicals https://www.epa.gov/chemical-research/cheminformatics
  • 35. Hazard Profile On-Hover view of trumping scheme call 34
  • 36. Hazard Profile On-click view of underlying data 35
  • 37. Data to Excel in <60s 36
  • 38. Linked to Chemical Transformation Simulator 37
  • 39. Linked to Chemical Transformation Simulator 38
  • 40. Where can our tools be applied • Emergency Response utility is obvious… • Consider East Palestine 39 https://www.cleveland19.com/2023/ 02/14/ntsb-announces-preliminary- malfunction-that-caused-east- palestine-train-derailment/ POLYPROPYLENE POLYETHYLENE Residue lube oil VINYL CHLORIDE DIPROPYLENE GLYCOL PROPYLENE GLYCOL DIETHYLENE GLYCOL COMBUSTIBLE LIQ., NOS (ETHYLENE GLYCOL MONOBUTYL ETHER) SEMOLINA COMBUSTIBLE LIQ., NOS (ETHYLHEXYL ACRYLATE) POLYVINYL PETROLEUM LUBEOIL POLYPROPYL GLYCOL ISOBUTYLENE BUTYL ACRYLATES, STABILIZED PETRO OIL, NEC ADDITIVES, FUEL BALLS,CTN,M EDCL SHEET STEEL VEGTABLE, FROZEN BENZENE PARAFFIN WAX FLAKES, POWDER HYDRAULIC CEMENT AUTOS PASSENGER MALT LIQUORS
  • 43. Perfect Example of FAIR Data and APIs • We owe a lot to FAIR data and availability of information • We curate a lot of our chemistry data using public resources such as PubChem, ChEBI, Common Chemistry and others • The availability of Public APIs takes things to another level! • We have been using the PubChem API to harvest data so we can build new applications, like the Safety Module 42
  • 44. Cheminformatics Safety Module (NOT PUBLIC) Integrate multiple data streams… 43
  • 46. You want to know more… • Lots of resources available – Presentations: https://tinyurl.com/w5hqs55 – Communities of Practice Videos: https://rb.gy/qsbno1 – Manual: https://rb.gy/4fgydc – Latest News: https://comptox.epa.gov/dashboard/news_info 45
  • 47. This talk is an overview • This talk is a high-level overview only. We can provide trainings into the individual modules and data as required • LOTS of training materials are available https://www.epa.gov/chemical-research/new-approach-methods-nams-training
  • 48. Conclusions • Underpinning chemistry data is from the DSSTox database • CompTox Chemicals Dashboard is public access to DSSTox and other related databases • Proof-of-Concept (PoC) tools are built to prove approaches • Everything is increasingly API driven and APIs are now public 47
  • 49. Contact Information • Contact info: williams.antony@epa.gov • Slides available at: https://www.slideshare.net/AntonyWilliams/ • Obtain articles from Google Scholar Profile 48