Lhasa Limited: Shared Knowledge, Shared Progress
Agenda and Abstracts

NEW HORIZONS IN TOXICITY PREDICTION

 

Sponsored by: LHASA Limited:

Cambridge University, December 8 -9, 2008

 

Session Chairs:

Professor Robert Glen, Unilever Center for Molecular Informatics

Dr Alan Wilson, Lexicon Genetics Incorporated

Dr David Hawkins, Lhasa Limited Board of Directors

 

DAY ONE:

 

9:30 am Welcome and Opening Remarks
Dr David Hawkins & Professor Robert Glen (Chair)

 

Session 1: Current Approaches for Toxicity Prediction

 

        Current Status of Toxicity Prediction

        What is currently available?

        How are these tools being used by Regulators,   Industry & Academia ?

        Unmet needs?


9:45 am Opening Remarks

Dr David Hawkins (Chair)

 

9:50 am Pharmaceutical Regulatory Perspective
Dr Edwin J. Matthews, Food & Drug Administration


Abstract
US FDA and pharmaceutical industry are under increasing political, scientific, and fiscal pressure to modernize the pharmaceutical development and the regulatory review processes (see FDA's Critical Path). Predictive software programs are now recognized as cost-effective tools to facilitate this modernization and to improve upon the safety and efficacy of drug products. The programs are expected to provide rapid and reliable decision support information on the activities of substances detected in pre-clinical toxicology tests; absorption, distribution, metabolism, and excretion (ADME) properties of lead chemicals; and adverse effects observed in human clinical trials and post market surveillance.

Expectations for predictive software program performance are very high; they must have: (1) a high domain of applicability for FDA regulated substances, (2) both highly specific and highly sensitive predictions of activities, (3) a reasonable interpretation of the mode of action (MOA) of predicted activities, (4) measures of model performance through internal and external validation studies, (5) transparency of the prediction paradigm and weight of evidence employed for the training data-set algorithm, and (6) documentation of the data sources employed. These high expectations can best be met when the predictive software utilizes the collective knowledge of toxicology and clinical studies present in both publicly available sources and in proprietary studies in agency and industry archives.

We at the FDA are meeting these objectives through collaborations with a consortium of software developers. Harvesting the knowledge in FDA and EPA archival data is in progress and non-proprietary portions of these data have already been made publicly available in part. Methods to achieve high domain of applicability, specificity, sensitivity, and accurate MOA predictions in validated QSAR models and expert systems have been reported and new improvements are continuing to be realized.


10:25 am In Silico Tools & Guidance developed by the European Chemicals Bureau
Dr
Andrew Worth, European Chemicals Bureau


Abstract
To promote the availability of reliable computer-based estimation methods for the regulatory assessment of chemicals, the European Chemicals Bureau (ECB) within the European Commission's Joint Research Centre (JRC) has been developing a range of user-friendly and publicly accessible software tools.

Toxtree predicts various kinds of toxic effect by applying decision tree approaches. The set of decision trees currently includes the Cramer classification scheme, the Verhaar scheme, the BfR rulebases for irritation and corrosion, and the Benigni-Bossa scheme for mutagenicity and carcinogenicity. Additional rulebases are under development.

Toxmatch generates quantitative measures of chemical similarity. These can be used to compare datasets and to calculate pairwise similarity between compounds. Consequently, Toxmatch can be used to compare model training and test sets, to facilitate the fomration of chemical categories, and to support the application of chemical read-across.

DART (Decision Analysis by Ranking Techniques) was developed to make ranking methods available to scientific researchers. DART is designed to support the ranking of chemicals according to their environmental and toxicological concern and is based on the most recent ranking theories. Different kinds of order ranking methods, roughly classified as total and partial-order ranking methods are implemented.

Finally, the ECB is developing a web-based inventory of (Q)SAR models (the JRC QSAR Model Database) which will help to identify relevant (Q)SARs for chemicals undergoing regulatory review. The JRC QSAR Model Database will provide publicly-accessible information on QSAR models and will enable any developer or proponent of a (Q)SAR model to submit this information by means of a QSAR Model Reporting Format (QMRF). The in silico tools developed by the ECB can be found on the ECB website (http://ecb.jrc.it/qsar/).

 

11:00 am Break

 

11:30 am Modelling and informatics support for safety an metabolism studies in early drug discovery projects

Dr Scott Boyer, Astra Zeneca

 

Abstract
Access to metabolism and toxicology data is critical to effective decision making in early drug discovery projects. Often in such projects little is known about the therapeutic target and usually even less is known about potential metabolism or adverse effects of the chemical series being investigated. Simply providing unstructured metabolism- and safety-related information on targets and chemical series to project teams trying to make decisions is not adequate due to the varied nature and quality of metabolism and toxicology data. This presentation gives examples of how relevant data can be structured, mined and in some cases modelled to enhance decision-making. Project examples will be presented of QSAR models and their interpretation, including characterization of the underlying assay error for better interpretation of the model results, development of SAR systems that support decision-making and enhance awareness around such endpoints as metabolism/P450 activation, mutagenesis, hERG and reactive intermediates. In general, metabolism and toxicology data should be structured depending on, 1) its intended use, 2) its overall quality and 3) its internal data structure (text vs. numerical) to assure its optimum use. Brief examples of the varying data types and their usage in project decision making will be presented along with some strategies for hypothesis generation around adverse events using a combined approach of molecular modelling/virtual screening and text mining. Together, these tools, built to be appropriate to the various data types, represent a basic toolkit for the toxicologist and drug metabolism scientist needing to make meaningful contributions to chemical design decisions made in early drug discovery projects.

12:05 am Recent Developments in Toxico-Cheminformatics:                 Supporting a New Paradigm for Predictive Toxicology

Dr Ann Richard, US Environmental Protection Agency


Abstract
EPA's National Center for Computational Toxicology is building capabilities to support a new paradigm for toxicity screening and prediction through the harnessing of legacy toxicity data, creation of data linkages, and generation of new high-content and high-thoughput screening data. In association with EPA's ToxCastTM, ToxRef DB, and ACToR projects, the DSSTox project provides cheminformatics support and, in addition, is improving public access to quality structure-annotated chemical toxicity information in less summarized forms than traditionally employed in SAR modeling, and in ways that facilitate data-mining and data read-across. The latest DSSTox version of the Carcinogenic Potency Database file (CPDBAS) illustrates ways in which various summary definitions of carcinogenic activity can be employed in modeling and data mining. The DSSTox Structure-Browser provides structure searchability across all published DSSTox toxicity-related inventory, and is enabling linkages between previously isolated toxicity data resources associated with environmental and industrial chemicals. The public DSSTox inventory also has been integrated into PubChem, allowing a user to take full advantage of PubChem structure-activity and bioassay clustering features. Phase I of the ToxCastTM project is generating high-throughput screening data from several hundred biochemical and cell-based assays for a set of 320 chemicals, mostly pesticide actives with rich toxicology profiles. Incorporating traditional SAR concepts into this new data-rich world poses conceptual and practical challenges, but also holds great promise for improving predictive capabilities. This work was reviewed by EPA and approved for publication, but does not necessarily reflect EPA policy.

 

12:40 pm Lunch

 

Session 2: Strengths and Limitations of Current Toxicity Prediction Systems

 

        How do different approaches compare (QSAR,   Knowledge-Based, Neural Networks)

        Pros and Cons of different systems

        Toxicity databases and tool boxes

 

2:00 pm Opening Remarks

Professor Robert Glen (Chair)

 

2:10 pm The Role of Predictive Data Mining in Understanding Toxicity

Professor Chihae Yang, Leadscope Inc.

 

Abstract
Toxicity data from various sources are integrated into a rigorously designed database using the ToxML schema. The public database sources include FDA submission data from approved new drug applications, food contact notifications, generally recognized as safe food ingredients, as well as chemicals from NTP and CCRIS databases. The data from various sources are then combined according to ToxML criteria. The resulting database is used for data mining analysis to investigate, for example, genotoxic and non-genotoxic carcinogens. Data analysis methods include both knowledge building and applying informatics, leading to predictive data mining. Extraction and validation of structural alerts, grouping compounds by mode-of-action driven alerts, and building predictive models within this space using chemical and biological descriptors are discussed.


2:45 pm Consensus QSAR Models

Dr Mark Cronin, Liverpool John Moores University


Abstract
In silico approaches to predicting toxicity can range from the relatively simple to the remarkably complex. At the simplest level are structure-activity relationships (SARs) or the qualitative relationship between a chemical sub-structure and a toxicological event. An understanding of the structural basis for toxicology may allow for the formation of (mechanistically based) categories, and hence allow for read across or extrapolations.

Quantitative structure-activity relationships (QSARs) vary from simplistic linear regression analyses to wholly multivariate and non-linear models such as neural networks. Within all these approaches, there is a capability to form a consensus from predictions both within models (the so-called consensus-QSAR approaches) and from between models. Forming a consensus from populations of (regression-based) QSAR models can have advantages in improving statistical fit by bringing in outliers, but the loss of transparency of models may be viewed as a disadvantage. Particularly exciting for the prediction of toxicity is the formation of a consensus between different modelling approaches, e.g. combination of information from read-across, (Q)SARs, experts systems as well as other information such as in chemico reactivity or in vitro data.

Approaches to predict toxicity, in the context of forming a consensus using weight-of-evidence within integrated testing strategies, will be presented.

 

3:20 pm Break

 

3:55 pm QSAR Approaches, Models & Statistics Relating to Toxicity Prediction

Professor Douglas Hawkins, University of Minnesota


Abstract
Quantitative Structure Activity/Property/Toxicity Relationship modelling has a long history. The first statistical tools used were multiple linear regression for a ordinal measure of activity, and logistic regression and linear or quadratic discriminant analysis for a binary measure of activity. These methods generally work quite well when n, the number of available compounds for calibrating the model, is large, and p, the number of predictors, is small. They do rely on additivity and linearity assumptions which, with small p, can be diagnosed and remedied graphically.

More recently, many QSAR data sets have used values of p in the hundreds to thousands. At this point, traditional graphical methods of diagnosing interaction and non-linearity are ineffective and, even if the traditional models can be fitted, they tend to work poorly because of the impact of collinearity.

Current QSAR methodologies for 'large p' problems address different deficiencies of the traditional methods and in different ways. Partial least squares (PLS), ridge regression (RR), the elastic net (EN) and principal component regression (PCR) keep the assumption of a linear relationship, but address the collinearity issues associated with large p, particularly if coupled with small n. Neural nets and support vector machines (SVMs) concentrate on curing possible nonlinearities in the relationship between the dependent and the predictors, but are impacted by the 'large p' problem. Nearest neighbours models have minimal assumptions about the nature of the relationship, but rely on compounds being relatively evenly spread around in the predictor space. Recursive partitioning (RP) methods assume neither additivity nor linearity, and can handle arbitrarily large p values, but require large n.

There is therefore no 'one size fits all' statistical methodology.

Feature selection - picking out a relatively few predictors that account for the relationship - is desirable both for making better predictions and for understanding underlying mechanisms, but is a potential source of large biases.

Having fitted a model, one needs to evaluate its usefulness for predicting the toxicity of as-yet-unstudied compounds whose chemical structure is known. Unless very large samples are available for the fitting and verification, this is best done using some form of cross-validation which allows all available compounds to be used for both the modelling and the model assessment steps.

 

4:30 pm Knowledge-Based Approaches

Dr Nigel Greene, Pfizer

 

5:05 pm Models and Databases for Genetic/Carcinogenic Toxicity

Romualdo Benigni, National Institute of Health, Italy


Abstract
The Structure-Activity Relationships paradigm provides a wide range of tools that can be exploited in the research on, and regulation of toxic chemicals. Such tools have different degrees of approximation / uncertainty, and apply to different scopes.

On one side, there are coarse-grain approaches such as the Structure Alerts (SA) (often accompanied by modulating factors). Together with being a repository of the science on the chemical biological interactions at the basis of chemical carcinogenicity, the SAs have a crucial role in practical applications for risk assessment, for: a) description of sets of chemicals; b) preliminary hazard characterization; c) formation of categories for e.g., regulatory purposes; d) generation of subsets of congeneric chemicals to be analyzed subsequently with Quantitative Structure-Activity Relationships (QSAR) methods; e) priority setting .

On the other side, there are the fine-tuned QSARs for congeneric classes of chemicals. Recently, a range of good quality, local QSARs for mutagenicity and carcinogenicity have been assessed in our laboratory, and challenged for their predictivity in respect to real external test sets (i.e., chemicals never considered by the authors while developing their models). The QSARs for potency (applicable only to toxic chemicals) generated predictions 30 to 70 % correct, whereas the QSARs for discriminating between active and inactive chemicals were 70 to 100 % correct in their external predictions: thus the latter can be used with good reliability for applicative purposes. The same study showed that internal, statistical validation methods, which are often assumed to be good diagnostics for predictivity, did not correlate well with the predictivity of the QSARs when challenged in external prediction tests.

Non local QSARs, designed for non congeneric sets of chemicals, showed a high degree of variability supposedly linked mainly to the definition of the applicability domain.

The crucial role of mechanistic knowledge in the process of applying Structure-Activity considerations to risk assessment should be strongly emphasized. Mechanistic knowledge provides a ground for interaction and dialogue between model developers, toxicologists and regulators, and permits the integration of the (Q)SAR results into a wider regulatory framework, where different types of evidence and data concur or complement each other as a basis for making decisions and taking actions. Whenever possible, (Q)SARs with clear links to the underlying toxicity mechanisms should be preferably applied.

 

5:40 pm- End -

 

7.30pm Symposium Dinner: Peterhouse: College Campus

 


DAY TWO:

 

Session 3 Emerging Areas and Technologies in Toxicity Prediction

 

       

8:30 am Opening Remarks

Dr Alan Wilson (Chair)

 

8:40 am Use of Biosimulation and Toxicogenomics to Predict the Efficacy of Antiatherogenic Therapies, Cardiovascular Toxicity and Outcomes

Dr Héctor De León, Entelos


Abstract
Despite the progress made in the treatment of cardiovascular (CV) disease, the high failure rate of therapies entering late-stage clinical development and the unexpected appearance of adverse CV events highlight the need for predictive technologies to complement drug discovery and clinical trial design. We have developed a unique set of biosimulation and gene expression profiling tools aimed at the early identification of effective drug candidates with low toxicity. We assembled the CV PhysioLab® platform, a large-scale mathematical model of human lipid metabolism and CV pathology, to evaluate the potential efficacy of alternate therapeutic approaches. The model uses differential equations to represent interactions of cells and biomolecules linked to key CV clinical outcomes (e.g. myocardial infarction). Finite-element modeling is used to simulate the temporal changes in the structure of atherosclerotic plaques that lead to rupture. The structural stability of the plaque can be linked to an estimated risk of a CV event. Virtual patients and patient populations are used to represent different pathophysiological hypotheses and to analyze the impact of phenotypic variability in response to therapies and drug-induced toxicity. We also developed DrugMatrix®, an extensive toxicogenomic database of microarray expression data linked to classic preclinical and clinical toxicology measurements, to identify predictive gene expression profiles. Hypotheses generated from these profiles can be simulated in the CV PhysioLab platform to identify biomarkers predictive of adverse events. Together, these technologies can significantly increase the likelihood of identifying effective compounds with low toxicity profiles early in the drug development process.

 

9:15 am Emerging Areas in Toxicity Prediction: NIHS Perspective

Dr Akihiko Hirose, National Institute of Health Sciences,Japan



Abstract
Huge kinds of chemicals exist in our environment, and the risks of majority industrial chemicals have not been evaluated. We urgently need to develop a high throughput evaluation system for the human risk of these chemicals. The (quantitative) structure activity relationships ((Q)SAR) approach has possibility to resolve this issue, but individual QSAR system has not so powerful to judge administratively in the default settings. Recently, NIHS has started to develop the flow to assess chemical genotoxicity in combination of three in silico systems. We selected DEREK Windows, MultiCASE and ADMEWorks, because of different modes for analyses. At first, we customized each system for mutagenisity prediction, i.e., bacterial gene mutation and in vitro chromosomal aberration assays. In combination approach, the concordance between in vitro and in silico assays on bacterial gene mutation reached to around 94%, although the applicability decreased to 55%. Next, we tried the similar approach for developing the chromosomal aberration predicting system. The performance of chromosomal aberration prediction is still lower than that of bacterial mutagenicity prediction and the further development are required.

In addition to these genotoxicity studies, repeated dose rat toxicity studies are commonly used as screening tests for evaluating the human risk of industrial chemicals. However, no appropriate in silico general toxicity evaluating system is available at present. We analyzed the toxicity profiles of hundreds of the twenty-eight days repeated dose studies. The hepato- and/or renal-toxicities were selected as major endpoints. We decided to focus to develop the predicting system of these two types of toxicity, by searching new sub-structural alerts for DEREK Windows and using the discriminant-based QSAR model builder, such as the Leadscope Predictive Data Miner (Leadscope Inc.).

NIHS also joined to the multi-institutional project of the developing for the repeated-dose toxicity knowledge-base system, which could assist toxicological expert judgment, or support preliminary governmental decisions. The system consists of three parts, the detailed sub-chronic toxicity studies database, the toxicity mechanisms database, and the metabolite predicting system. The project is lead by National Institute of Technology and Evaluation (NITE), Tohoku University, Kwansei Gakuin University, Fujitsu Co. Ltd., and NIHS. The project is financially supported by the New Energy and Industrial Technology Development Organization (NEDO). The parts of this in silico knowledge-base system will be integrated to the OECD (Q)SAR Application Tool Box, and will support the categorical approach on HPV chemicals evaluation. We are also developing repeated-dose toxicity (Q)SAR system in future.

 

9:50 am Coffee Break

 

10:20am Consumer Products Industries Perspective

Dr Cameron MacKay, Unilever - Safety & Environmental Assurance Centre

 


Abstract
Assuring the safety of consumer products without the need to conduct animal tests is a considerable challenge. The mouse local lymph node assay (LLNA) is now used widely to generate data for assessing the risk of chemical-induced skin sensitization. However, changes in EU legislation [i.e. 7th Amendment to the EU Cosmetics Directive] have made developing non-animal approaches to provide the data for skin sensitization risk assessment a key business need. In collaboration with Entelos Inc, Unilever has developed a large-scale in silico model (comprised of nonlinear ODEs) of skin sensitization induction to characterise and quantify the contribution of implicated pathways to the overall biological response. Such knowledge is crucial in guiding the development of in vitro assay development for use in consumer safety risk assessment.

The model describes the response in mouse over a 7-day period following exposure to dinitrochlorobenzene (a well known contact allergen) and includes both epidermal and lymph node cellular processes implicated in skin sensitisation such as cytokine responses, cell surface marker regulation, cellular migration and proliferation. In order to populate the model, in vivo and in vitro data from the published literature were used. Some of the data, such as epidermal cytokine release in response to chemical insult, were used to build focused submodels of the biology. Mouse local lymph node assay data (an assay for the identification of contact allergens that covers topical exposure through to measurement of the resulting immune response) were used in order to ensure that, acting together, these submodels could model the full system response effectively.

The modelling uncovered a previously underappreciated pathway in skin sensitisation and showed it to be key to the sensitisation response. Additionally, the modelling revealed a number of gaps in both the current mechanistic knowledge and the available data. Unilever is using the model to focus and guide its future research in the area of skin sensitisation.

10:55 am Challenges In Predicting Metabolism & Toxicity with Known & Abstract Targets

Dr Fred Guengerich, Vanderbilt University


Abstract
The costs of drug development and environmental risk assessment continue to increase, and the availability of better in silico and in vitro methods has the potential to yield better and more economical predictions. Metabolism issues are key to some toxicities, e.g. acetaminophen. However, an accurate assessment of the fraction of all drug and other toxicities is due to metabolism is unavailable and, even when metabolism is agreed to be central, the ensuing biological events are not well understood.

A need exists for better biomarkers and assays to predict not only bioactivation but also off-target toxicity, immune-related toxicity, and idiosyncratic reactions. Although much has been learned about the enzymes involved in the metabolism of many drugs and other xenobiotics, predictions are not trivial. Actual protein structures are very preferable to homology models, but even when these are available they often do not predict products accurately.

Issues include non-product complexes and the limitations on obtaining structures of enzymes relevant to the key catalytic steps. Predicting toxicity in silico is difficult in that relatively few molecules are recognized in off-target and other toxicities. Even when known, the structures may still be models (e.g. hERG channel). In most toxicity events, the mechanism of action is unknown at the molecular level and relatively few biological structures can be used. Thus, many aspects of predictive toxicology still involve structure-activity relationships among chemicals, but the issues of relevance of the parameters used in comparisons is critical in judging the usefulness of the analyzer.
(Supported in part by U.S. Public Health Service grants R01 ES010546 and P30 ES000267.)

 

11:30 am Panel Discussion: Future Needs and Challenges*

 

        Identification of Gaps, Future Needs and   Challenges

        Increasing application of in silico modeling

        Barriers to implementation

 

1:00 pm Closing Remarks & End of Conference

This event precedes the Lhasa Limited European ICGM, which will also be hosted by the University of Cambridge and provides an opportunity to meet Lhasa Limited scientists and members of the Sales team, learn about Lhasa Limited's software and services, and see demonstrations of our programs. Click here for more information.

University of Cambridge University of Cambridge