National Cancer Institute
The cancer Biomedical Informatics Grid (caBIG):
Enabling the patient-centric molecular medicine revolution
J. Robert Beck, M.D. Fox Chase Cancer Center R. Mark Adams, Ph.D. Booz Allen Hamilton
0
National Cancer Institute
Talk Agenda
caBIG™, A Cancer Center Perspective
Program Motivations Development of the Program
Program Structure and Process Program Timeline and Products
Example Program Product: C3D Panel Discussion
1
National Cancer Institute
Talk Agenda
caBIG™, A Cancer Center Perspective
Program Motivations Development of the Program
Program Structure and Process Program Timeline and Products
Example Program Product: C3D Panel Discussion
2
National Cancer Institute
A Focus on Interaction
3
National Cancer Institute
Scenario, 2008
Small Molecules
Cell Type
A researcher involved in a phase II clinical trial of a new molecularly targeted therapeutic for brain tumors observes that cancers derived from one specific tissue progenitor appear to be strongly affected. The trial has been generating proteomic and microarray data. The researcher would like to identify potential biochemical and signaling pathways that might be different between this cell type and other potential progenitors in cancer, deduce whether anything similar has been observed in other clinical trials involving agents known to affect these specific pathways, and identify any studies in model organisms involving tissues with similar pathway activity.
Pathways
Clinical Trials
Therapeutics
Animal Models
4
Homologous Proteins
National Cancer Institute
Today vs 2008?
In 2005 such a study would involve immense manual work getting information locally and from other sites, precluding the possibility of identifying the required data and thus being unable to deduce the likely significance of the trial observation.
However, with caBIG™ compliant components now under development, the researcher would be able to perform the analysis routinely, with data flowing through systems and analysis being automatic. This analysis will yield biomarkers and potential drug targets gathered from multiple workspaces and make it possible to develop treatment modalities faster, less expensively, and more effective for patients.
5
National Cancer Institute
Discovery utilizing caBIG™ Integrated Cancer Research Tools
FunctionE xpress
Promoter DB Identify recurring promoter elements
Gene Pattern
Gene annotation
Analysis Identify upregulated genes in specific pathways
Clinical Trials Tumor Samples
400 brain tumor tissue samples acquired
Gene expression profiling
caArray
Pathways Tool
caTIES
Pathology reports caTissue
Discrete and manual annotation on tissues
Clinical Annotation Modules
Potential Drug Targets and Biomarkers
Mutation identification
TrAPSS
Proteomics LIMS
Analysis
Q5
Annotation
PIR
6
National Cancer Institute
Overall Goals for caBIG™
Three-year (mid-2008)
•
Develop sufficient research tools and standards to have a positive impact on the cancer research community, as measured by adoption of relevant caBIG principles in project proposals.
•
Ensure widespread adoption of developer standards so that funded developer projects are operating under the Gold standard of compatibility.
Adopt and use caBIG interoperable tools and data sets within the caBIG community.
•
•
Develop mechanisms for engaging and promoting caBIG compliant technologies and established datasets within the oncology research community.
7
National Cancer Institute
Overall Goals for caBIG™
Five-year (mid-2010)
•
Ensure widespread adoption, dissemination, and use of caBIG interoperable tools, standards, and data sets within the larger cancer community, to include the biopharmaceutical industry, non-NCI cancer centers, and the national cancer research enterprise. Begin to see results of caBIG-compliant interdisciplinary and inter-institutional research affecting clinical oncology care.
•
8
National Cancer Institute
What does caBIG mean to the Cancer Centers?
• • • Product—in CTMS, ICR, Tissue Banking
Support for large scale collaborative research No need to ―go it alone‖ with bioinformatics or other quantitative support tools
9
National Cancer Institute
Clinical Trials Management: Three Levels of Interaction Based on Institutional Readiness
No Existing System
C3D Adoption exchange data
Additional Functionality
Existing System: Change Imminent
New caBIG modules
caBIG Open Source Core Functionality
Interoperable caBIG Community
Existing System: No Change
caBIG compatibility efforts with vendor
Additional Functionality
10
National Cancer Institute
What does caBIG offer the Cancer Center for Clinical Trials Management?
–
–
Cancer Center Clinical Database or C3D – available today
Adverse Event Reporting or caAERS – available 1st Qtr 2006
–
–
Clinical Trial Structured Protocol Representation (BRIDG) – available (in draft) now
Key is to understand the workflows to support clinical trials and choose a starting point for use of caBIG tools
National Cancer Institute
Tools for Translational Research
• caBIG Compatibility Guidelines
– 3 year plan: all tool development at Gold level
•
•
Successful Developer-adopter relationships
Extension to other research areas
– Imaging – Population Science
12
National Cancer Institute
ICR Software Maturity Table
Maturity Level
1 Basic research/Design
Description
The key concepts for the system have been defined, algorithms outlined, use cases documented, requirements and specification document(s) prepared. A semi-functional system has been implemented for the purpose of getting feedback from potential end-users on the execution of the design. System has gone through at least one round of testing by end users and some feedback has been incorporated. System is in trial use by a limited number of users. Only intended for limited distribution by users in close contact with the development team. System has gone though a thorough beta test and is deployed and in use at at least one site. Additional sites may deploy the software with possible assistance from the development team. Initial user and developer guides are available. Known defects are documented. System is readily deployed by any interested party with no assistance from the development team. Known defects are documented. Mature user and developer guides are available as well as training materials.
2
Prototype/Proof of Principle
3
First working implementation (―beta‖)
4
Deployed and in use
5
Post deployment and maintenance
13
National Cancer Institute
Cancer Center Roles in caBIG
–
Developer (20% of centers) • Key is to create an environment for sharing tools with other centers • One of the most important issues is not to ignore the need for common data elements and vocabulary services Adopter (20% of centers) • Key is to understand the needs at local center (and be vocal) • Don’t abandon other development efforts; think modular • When adopting tools make sure they “talk” to legacy systems Working Group & Strategic Planning (60% of centers) • These are not “soft” roles • Critical to the success of the program • White paper development will guide caBIG successes • Make sure to communicate internally to all parts of the Cancer Center
–
–
National Cancer Institute
caBIG Compatibility
•
What does caBIG ―compatibility‖ mean to CC – caBIG is all about Interoperability • Key is to create tools for sharing info with other centers • Not an easy task due to “silo” mentality • CDEs and EVS enable common language and data values • Software design principles ensure cross compatibility of developed software – Extensible infrastructure • Expandable and modular software to plug in to existing systems so current development efforts are not wasted • Scalable so that efficiency is gained by software engineering – Ensures partnerships • Built on commercial software development principles • Encourages relationships between academia, government and industry • Scalable so that commercial partners can utilize “open source”
15
National Cancer Institute
Talk Agenda
caBIG™, A Cancer Center Perspective
Program Motivations
Development of the Program Program Structure and Process
Program Timeline and Products Example Program Product: C3D
Panel Discussion
16
National Cancer Institute
The NCI 2015 challenge goal:
… eliminate suffering and death due to cancer
―When I look into the eyes of a patient losing the battle with cancer, I say to myself, It doesn’t have to be this way.‖
Dr. A.C. von Eschenbach, M.D. Director, National Cancer Institute
The Nation’s Investment in Cancer Research (2003)
17
National Cancer Institute
NCI biomedical informatics
Goal: A virtual web of interconnected data, individuals, and organizations redefines how:
• • • research is conducted care is provided patients/participants interact with the biomedical research enterprise
18
National Cancer Institute
Biomedical information tsunami
• overwhelming volume of data multitude of sources
•
19
National Cancer Institute
Informatics tower of Babel
• Each cancer research community speaks its own scientific ―dialect‖
•
Integration critical to achieve promise of molecular medicine
20
National Cancer Institute
Talk Agenda
caBIG™, A Cancer Center Perspective
Program Motivations
Development of the Program Program Structure and Process
Program Timeline and Products Example Program Product: C3D
Panel Discussion
21
National Cancer Institute
cancer Biomedical Informatics Grid
• • • Common, widely distributed infrastructure permits cancer research community to focus on innovation Shared vocabulary, data elements, data models facilitate information exchange Collection of interoperable applications developed to common standard Raw cancer research data is available for mining and integration
•
22
National Cancer Institute
caBIG principles
• Open source
• Open access
• Open development • Federated
23
National Cancer Institute
Current caBIG community
• NCI-designated Cancer Centers (50)
– Academic Centers (integrated into broader biomedical infrastructure) – Stand-alone (community leaders)
– Community outreach
• • • •
Government Industry International Groups >700 active participants
24
National Cancer Institute
caBIG action plan
• Establish pilot network of NCI Cancer Center – Groups agreeing to caBIG principles – Mixture of capabilities – Mixture of contributions
• •
Expanding collection of participants Establish consortium development process – Collecting and sharing expertise
– Identifying and prioritizing community needs – Expanding development efforts
•
Moving at the speed of the internet…
25
National Cancer Institute
caBIG: a new way of doing business…
• Coordinated development
• Active management • Community directed • Common services
26
National Cancer Institute
27
National Cancer Institute
Common needs helped shape priority areas for the caBIG pilot activities
Database & Datasets Imaging Tools & Databases Integration High Performance Computing Pathways Licensing Issues LIMS Meeting Microarray & Gene Expression Tools Proteomics Remote/Bandwidth Visualization & Front-End Tools Statistical Data Analysis Tools Vocabulary & Ontology Tools & Databases Meta-Project Common Data Elements & Architecture Center Integration & Management Tissue & Pathology Tools Access to Data Translational Research Tools Distributed Data Sharing/Analysis Tools Staff Resources Clinical Data Management Tools
0
5 10 15 20 25 Number of Needs Reported
Clinical Trial Management Systems
Tissue Banks & Pathology
Integrative Cancer Research
30
35
28
National Cancer Institute
Talk Agenda
caBIG™, A Cancer Center Perspective
Program Motivations
Development of the Program Program Structure and Process
Program Timeline and Products Example Program Product: C3D
Panel Discussion
29
National Cancer Institute
30
National Cancer Institute
Four Domain Workspaces and two Cross Cutting Workspaces have been launched
DOMAIN WORKSPACE 1 Clinical Trial Management Systems
DOMAIN WORKSPACE 2 Integrative Cancer Research DOMAIN WORKSPACE 3 Tissue Banks & Pathology Tools DOMAIN WORKSPACE 4 Imaging addresses the need for consistent, open and comprehensive tools for clinical trials management. provides tools and systems to enable integration and sharing of information. provides for the integration, development, and implementation of tissue and pathology tools. provides for the sharing and analysis of in vivo imaging data. CROSS CUTTING WORKSPACE 1 Vocabularies & Common Data Elements CROSS CUTTING WORKSPACE 2 Architecture
responsible for evaluating, developing, and integrating systems for vocabulary and ontology content, standards, and software systems for content delivery developing architectural standards and architecture necessary for other workspaces.
31
National Cancer Institute
Strategic Level Planning Groups
Data Sharing and Intellectual Capital
Addresses issues related to the sharing of data, applications and infrastructure both within the consortium and in the larger cancer research community.
Training
Developing strategies for providing training in the use of the caBIG developed resources including on-line turtorials, workshops, training programs.
caBIG Strategic Planning
Assists in identifying strategic priorities for the development and evolution of the caBIG effort.
32
National Cancer Institute
The NCI’s General Contractor
•
•
Responsible for coordination of Workspace activities
Provides funding through contracts to participating centers
• •
•
Provides a channel for communication of guidance and priorities Provides a measure of participant progress (e.g., contract milestones)
Fosters accountability and provides mechanisms for conflict resolution
The NCI is investing resources for administration and support of the caBIG project, decreasing the burden on the participating community
33
National Cancer Institute
How does caBIG work?
Develop Project Plan, Cost Proposal, and Deliverable/ Payment Schedule Put Contract Mechanism in Place with each Center (one time activity) Begin Work Deliver Product/ Service Submit Invoice to General Contractor
Centers
Provide Monthly Progress Reports to General Contractor
General Contractor
Develop SOW & Deliverables/ Milestones
Review Project Plan & Cost Proposal
Notify Center of Acceptance Issue Task Order to Center Review Product/ Service
Pay Center Invoice
NCI
Accept Product/ Service & Authorize Payment
34
National Cancer Institute
How can my Center Participate?
•Participation in Workspaces
•Development of Tools
•Sharing of Data •Adoption of Tools
•etc.
The main mechanism is through RFPs issued by the Project on relevant topics- Workspace participation is a key way to stay abreast of relevant opportunities..
35
National Cancer Institute
How can my Center get Funding?
•
•
Your Center may be already funded- as are 50 Designated Cancer Centers nationwide
The caBIG™ Point of Contact for your organization is a key resource
•
Those with projects that may be of interest to the program are encouraged to participate in the Workspaces, and be on the project mailing lists where RFPs for project activities are distributed
36
National Cancer Institute
How can my Center benefit from caBIG™ Tools?
• Everything developed by the program is open source and freely available
•
The latest versions of all the software developed as part of the project can be obtained from the caBIG™ CVS site:
– http://cabigcvs.nci.nih.gov/viewcvs/viewcvs.cgi/
•
Commercial-grade documentation is provided as part of the project, which will be located at the project gforge site:
– http://gforge.nci.nih.gov
37
National Cancer Institute
How can I get support for these tools?
NCICB Applications Support will coordinate support for caBIG™ tools:
• Live Support: Monday – Friday 8 am – 8 pm Eastern Time
– Telephone support is available Monday to Friday, 8 am – 8 pm Eastern Time, excluding government holidays. – You may leave a message, send an email or submit a support request via the Web at any time.
• • • •
Email: ncicb@pop.nci.nih.gov Phone: 301-451-4384 Toll-free: 888-478-4423 Web: http://ncicbsupport.nci.nih.gov
38
National Cancer Institute
Talk Agenda
caBIG™, A Cancer Center Perspective
Program Motivations
Development of the Program Program Structure and Process
Program Timeline and Products Example Program Product: C3D
Panel Discussion
39
National Cancer Institute
caBIG Deliverables
• The project is providing useful products in a variety of areas:
– Clinical Trials (C3D, caAERS, etc.)
– Tissue Banks and Pathology (caTISSUE, caTIES, etc.) – Bioinformatics (caArray, caWorkbench, etc.)
•
Tools are developed by, and with the direct specification of, the Cancer Center community, and reflect the needs and capabilities of that community.
40
National Cancer Institute
41
National Cancer Institute
42
National Cancer Institute
43
National Cancer Institute
44
National Cancer Institute
45
National Cancer Institute
A Sample of the caBIG Participant Community
9Star Research Albert Einstein Ardais Argonne National Laboratory Burnham Institute California Institute of Technology-JPL City of Hope Clinical Trial Information Service (CTIS) Cold Spring Harbor Columbia University-Herbert Irving Consumer Advocates in Research and Related Activities (CARRA) Dartmouth-Norris Cotton Data Works Development Department of Veterans Affairs Drexel University Duke University EMMES Corporation First Genetic Trust Food and Drug Administration Fox Chase Fred Hutchinson GE Global Research Center Georgetown University-Lombardi IBM Indiana University Internet 2 Jackson Laboratory Johns Hopkins-Sidney Kimmel Lawrence Berkeley National Laboratory Massachusetts Institute of Technology Mayo Clinic Memorial Sloan Kettering Meyer L. Prentis-Karmanos Ohio State University-Arthur G. James/Richard Solove Oregon Health and Science University Roswell Park Cancer Institute St Jude Children's Research Hospital Thomas Jefferson University-Kimmel Translational Genomics Research Institute Tulane University School of Medicine University of Alabama at Birmingham University of Arizona University of California Irvine-Chao Family University of California, San Francisco University of California-Davis University of Chicago University of Colorado University of Hawaii University of Iowa-Holden University of Michigan University of Minnesota University of Nebraska University of North Carolina-Lineberger University of Pennsylvania-Abramson University of Pittsburgh University of South Florida-H. Lee Moffitt University of Southern California-Norris University of Vermont University of Wisconsin Vanderbilt University-Ingram Velos Virginia Commonwealth University-Massey Virginia Tech Wake Forest University Washington University-Siteman Wistar 46 Yale UniversityNorthwestern University-Robert H. Lurie
National Cancer Institute
Talk Agenda
caBIG™, A Cancer Center Perspective
Program Motivations
Development of the Program Program Structure and Process
Program Timeline and Products Example Program Product: C3D
Panel Discussion
47
National Cancer Institute
•
C3D is a web-based clinical data management solution. – – The result of a partnership between NCICB, Center for Cancer Research (CCR) and the Oracle Corporation. Goal is to provide the cancer research community with the infrastructure to collect and manage clinical trial data in a manner that supports reuse and efficiency in building studies that are consistent with the caBIG compatibility guidelines and requirements. Originally developed for NCI's intramural research program, now available as both a web-accessible application service provider (ASP) solution hosted at the NCICB, or as a system entirely deployed at participating Cancer Centers.
–
•
C3D supports data standardization, reuse, sharing, and interoperability
48
National Cancer Institute
Where Are We Now?
•
•
70+ Active Oncology Studies
20+ Sites
– Support for NIH intramural and extramural research
• •
•
Automated CDE Migration from caDSR 2,000+ Patients; 120,000+ Completed eCRFs
700,000+ Lab Results Automatically Loaded ; Automated Grading Based on Common Toxicity Criteria (CTC)
•
Regular Electronic Submissions to Monitors and Agencies
49
National Cancer Institute
Why Use C3D?
• Decrease individual protocol set up time
• • • •
Increase use of standards – data elements and CRFs Provide remote data capture across study sites Provide data collection in compliance with regulations (21CFR11) Automate regulatory submission – CTEP IND – CTMS – biweekly incremental – CDUS – quarterly cumulative
• •
Decrease local support costs – ASP model Decrease Oracle Clinical cost of ownership – both ASP and local models – share HHS level Oracle discount
50
National Cancer Institute
What Are the Benefits of Using RDC?
• Eliminate duplicate data collection efforts.
– Study team can enter and review their own data directly from the medical record and/or the patient’s chart.
– Study team does not have to fill out paper case report forms (CRFs) and hand off them to data entry team.
•
Speed up data collection time.
– You eliminate the time required to ship paper CRFs to the data entry team. – You eliminate the time required for deciphering handwriting. – You eliminate the telephone/email tag involved in answering questions and discrepancies between your study site and your data entry team.
•
Improve data quality.
— Study team can enter and review their own data, eliminating the chance for errors due to handwriting or misunderstanding notes.
— Study team can see and resolve discrepancies quickly.
51
National Cancer Institute
Licensing C3D
• The license fee is a one time fee, which is a significantly discounted Oracle Clinical License fee of $3000 for each protocol builder account and $2000 for each data entry account.
There will be an additional 22% annual maintenance. These licenses can be moved from one person to another if participants leave.
• •
52
National Cancer Institute
The NCICB C3D Support Team – At Your Service
•
•
Technical Director
Global Librarian / Data Curator
• •
• • •
Protocol Builders Database Administrators
C3D Developers Clinical Application Trainers Dedicated Application Support
53
National Cancer Institute
Talk Agenda
caBIG™, A Cancer Center Perspective
Program Motivations
Development of the Program Program Structure and Process
Program Timeline and Products Example Program Product: C3D
Panel Discussion
54