Example CCL Universe Data Set Progress and Recommendations

Click to download
Reviews
Shared by: aaba272ccfbce297
Stats
views:
0
rating:
not rated
reviews:
0
posted:
6/18/2009
language:
English
pages:
0
Example CCL Universe Data Set: Progress and Recommendations Presentation to NDWAC CCL Work Group Washington, DC July 15, 2003 Review Example CCL Universe Data Set/Analysis Purpose/Intent of Example CCL Universe Data Set Development of the Data Set Findings What insights do we have about this data set? Availability Quality Other Will this data set be sufficient to test the proposed “gate” screening? What can we say about the use of this data set for attribute/scoring and classification? Overview: Purpose The NRC has recommended a CCL process requiring data for the following steps: identify the “universe” of potential drinking water contaminants; select a Preliminary CCL from the “universe” by a screening process; develop a training set of compounds to train a prototype algorithm to prioritize drinking water contaminants; apply a prototype algorithm to classify the PCCL into a CCL; and conduct an expert review of the algorithm results. Overview: Purpose of the Example CCL Universe Data Set Gather some data to support NDWAC’s process Test the proposed approaches: Building the CCL Universe “Gates” screening approach Attribute Scoring Use of a prototype algorithm to classify the PCCL into a CCL Gain insights about available data on contaminants, identify data elements, issues and gaps. Overview: Structure As the process moves from “universe” to CCL, the data requirements of each contaminant will grow, Number of contaminants Data per contaminant while the pool of contaminants will shrink. Universe PCCL CCL Reg. Determination Data Retrieval Began with list of more than 200 data sources Chose 23 sources based on the following criteria: high-quality data easily accessible electronically contained information relevant to building the CCL Universe (i.e., occurrence data, etc.) Downloaded tables from 23 sources a total of 87 tables All tables formatted as Microsoft Excel files Performed QA/QC review on Excel files to ensure accurate reproduction of data from original source Linking Data To link data across tables, contaminants need a unique identifier Unique identifier was first assigned to a CAS number If the CAS number was not listed or available, a unique identifying number was assigned to the contaminant by name A table was created to list each contaminant with its corresponding identifier Not a perfect process Data Sources HPVMS HPV HA CPP CPH CERCLA ATSDR Candidate Name CAS RN & Name SuperUniqueID ITER JMPR NCOD-6 NCOD-1&2 NIRS OEHHA RAI RBC CEDI EAFUS RIVM CADW GRAS PRG TRI WHO Classifying Data Sources The 23 data sources were classified according to the type(s) of data and/or information available in each First it was determined what data elements were in each data sources Then the data elements were classified, and the data sources were classified according to what kinds of data elements were available in the source Data sources could be classified into one or more categories: HE Data, HE Info, Occ Data, Occ Info, or No Data or Info Types of Data and/or Information in the Data Sources Example Data Set Sources With HE Data ATSDR MRLs CADW CPH CPP HA ITER WHO OEHHA PRGs RBCs RIVM WHO CEDI JMPR HE Info ATSDR MRLs CADW CEDI HA ITER OEHHA PRGs RBCs RIVM Occ Data NIRS NCOD Round 1 and 2 NCOD Six Year Review Occ Info CADW PRGs RAIS RBCs TRI No Data or Info ATSDR CERCLA HPV HPV Master Summary Table GRAS EAFUS The World Health Organization (WHO): Classification of Pesticides by Hazard (CPH) CPH Class Ia ID Candidate Name Type CAS RN UN No Chem Type Phys State Main Use LD50 mg/kg LD50 Note Remarks SuperUniqueID CPH Class Ib ID Candidate Name Type CAS RN UN No Chem Type Phys State Main Use LD50 mg/kg LD50 note Remarks SuperUniqueID CPH Fumigants ID Candidate CASRN Remarks SuperUniqueID CPH PIC subject pesticides ID Class CAS RN Candidate SuperUniqueID CPH Class II ID Candidate Name Type CAS RN UN No Chem Type Phys State Main Use LD50 mg/kg LD50 Low mg/kg LD50 High mg/kg LD50 Note Remarks SuperUniqueID CPH Unlikely Hazards ID Candidate Name Type CAS RN Other CAS no listed Chem Type Phys State Main Use LD50 mg/kg LD50 Note Remarks SuperUnique ID CPH Class III update ID Candidate Name Type CAS RN UN No Chem Type Phys State Main Use LD50 mg/kg LD50 Low LD50 High LD50 Note Remarks SuperUniqueID CPH Key ID Abbreviation Name SuperUniqueID CPH Obsolete or Discontinued ID Candidate CAS RN SuperUniqueID Classifying Data Elements NRC 2001 definition demonstrated measurements in natural waters occurrence occurrence data other information about contaminant occurrence potential occurrence occurrence information health effects measurements from toxicological studies that relate to exposures via drinking demonstrated health water effects health effects data other information about adverse health effects potential health effects health effects information Type of Data NDWAC CCL CP Workgroups draft definition Data Mapping All data elements (848 total) were mapped to 5 categories: Health Effects Data (108) Occurrence Data (49) Health Effects Information (67) Occurrence Information (69) Other (555) Classification of data elements to these categories allows summarizing the candidates by each of the screening 'gates' Data Set Findings: Overview Overview 1,500 records (out of 30,000, or 5 percent) had a chemical name but no CAS RN Assigned CAS RN to 721 (~50 percent) of these 11,000 records had multiple chemical names assigned a single CAS RN 108 records had multiple CAS RNs assigned to one chemical name Example CCL Universe Data Set Summary Statistics Chemicals D atabase Subset D Elem ata ents Data Available Total (data, Sources Contam inants Data or Info info, other) Only¹ 23 10,360 293 848 Entire Exam Data ple Set All Chemicals with H ealth Effects D or ata Inform ation AND O ccurrence Data or Inform ation C icals with H hem ealth Effects Data AN D O ccurrence Data 18 774 293 848 16 62 157 <848 Data Set Summary Statistics Compared With CCL Che micals Data base Subse t Da ta Source s Conta mina nts Out of 262 on Dra ft CCL 244 Out of 50 on Final CCL E ntire Ex am ple Data S et A ll Chemic als with Health Effec ts Data or Inform ation AND Occ urrenc e Data or Inform ation Chem ic als with Health E ffec ts data AND Occ urrenc e data 23 10,360 45 18 774 188 40 16 62 35 18 Screening Gates “Gates” will be used to screen contaminants from the Universe to the PCCL based on varying criteria: Gate 1: Contaminants have Health Effects Data and Occurrence Data Gate 2: Contaminants have Health Effects Information and Occurrence Data Gate 3: Contaminants have Health Effects Data and Occurrence Information Gate 4: Contaminants have Health Effects Information and Occurrence Information Gate 5: Expert Judgement Number of Chemicals with Data/Information Number of chemicals in various categories related to data/information elements in the Example CCL Universe Data Set Category or Criteria of Data/Information Available; Gate Screening Criteria Number of Chemicals in Example Data Set Meeting the Criteria 62 0 695 17 774 Percentage of Chemicals in Example Data Set Meeting the Criteria 0.6% 0.0% 6.7% 0.2% 7.5% Health Effects DATA AND Occurrence DATA; Gate 1 Health Effects Information AND Occurrence DATA; Gate 2 Health Effects DATA AND Occurrence Information; Gate 3 Health Effects Information AND Occurrence Information; Gate 4 Subtotal of Candidates Through Gates 1-4: The Example CCL Data Set and the NRC’S Venn Diagram 19 Occurrence Data 0 1,605 Health Effects 62 Data 695 CCL Universe 6,245 Health Effects Info 813 17 Occurrence Info 904 Quantitative “GATE 1” Screening Results Ratio of Maximum Occurrence to Minimum Health Effect Level Occ MAX/HE MIN 1000000000 100000000 10000000 1000000 100000 10000 1000 100 10 1 0.1 0.01 0.001 0.0001 0.00001 0.000001 0.0000001 0.00000001 BUTACHLOR Occ MAX/HE MIN RADIUM-226 1,2,3-TRICHLOROPROPANE PHOSPHORUS, WHITE TIN •Most GATE 1 chemical occurrence is within two orders of magnitude (1 to 100) of lowest health effect level with exceptions noted. Ratio of Max Occurrence to Reference Dose: screen out ~ 60% of Gate 1 if 1:1 100000 RfD/Cancer Value to Max OCC (N=53/11, 85%) 10000 1000 100 Maximum Measured Occurrence ( g/L) 10 1 0.1 0.01 0.001 0.001 0.01 0.1 1 10 100 1000 10000 100000 Reference Dose/Cancer Value Converted to µg/L Ratio of Max Occurrence to Reference Dose : screen out ~ 60% of Gate 1 if 1:1 100000 RfD to Max OCC (N=53, 85%) 10000 1000 Maximum Measured Occurrence ( g/L) 100 10 1 0.1 0.1 1 10 100 1000 10000 100000 Reference Dose Converted to µg/L Max. Occurrence to NOAEL/LOAEL 10 ratio of Max occurrence value to LOAEL 1 0.1 0.01 0.001 0.0001 0.00001 0.000001 0.0000001 ratio of Max occurrence value to NOAEL E E AN AN E IN IN YL N M ET H M ET H IE LD AL D TA D AR B AR BU O O D O R O R C O IF LU FL U LO R O O D LO R R LO H IC TR D IC H H EX AC H N AP H TH A LE N R R IE E 0.000001 0.00001 100000 0.0001 10000 0.001 1000 100 10 1 0.01 0.1 BU PR OP AC HL O R LO TA CH EL NI CK 10-34 17 of 23 (74%) of Max. Occ. to LD50 *10-6 is greater than 1 ratio of occurrence value to health effect value Ratio of Max Occ to LD50(*10-6) DI R CA ET M BA O LA 1, CH 2, 4LO M 1, TR ET 1, R IM 1, RI 2ET BU TE HY ZI TR N LB AC EN HL ZE OR NE OE TH AN DI E QU DI AT M CH ET CH LO HO LO RO M RO YL DI FL ET UO HA RO NE M HE ET XA H C A AN CH RB E LO 1, AR RO 3DI YL BU CH TA LO 1, R O D IE TR 1NE DI PR IC CH HL OP LO O EN 1, RO RO 1, E 2, FL ET 2UO HA TE RO TR NE M AC ET HL OR H A N E OE TH AN AL E NA DR PH IN TH AL EN E DI 1, BR EL 2, DR OM 3TR IN OM IC ET HL HA OR NE OP RO PA NE M Summary of Gate 1 Quantitative Screening Gate 1 Screening Max Max Criteria Ratio Occurrence/ Occurrence/ (Occ:HE) Lowest HE Lowest RfD 1:1 or greater 65% 42% 1:0.1 81% 64% 1:0.01 97% 83% 1:0.001 98% 98% 1:0.0001 98% 100% >1:0.00001 100% 100% N 62 53 Max Occurrence/High Max est NOAEL/ Occurrence/ Lowest LOAEL Lowest LD50 0% 74% 0% 83% 14% 96% 43% 96% 86% 96% 100% 100% 7 23 Insights Data Availability is better for health effects than for water occurrence. Lack of contaminants for Gate 2 an indicator that water occurrence data may be a limiting factor Used high quality sources – few additional sources will be of similar quality – need data quality measures for additional sources Relationships among Data Sources: Use of prioritized lists informative for screening, but may want background data for attribute scoring (iterative process) E.g. ATSDR data sources HazDat Data CERCLA Priority List Toxicity Profiles Minimal Risk Levels Another Example: ITER Derived endpoints e.g. Reference Dose Can get background data from RAIS, EPA, etc. ITER useful for screening, may want additional data for attribute scoring How Representative are the 23? 10% of reviewed data sources included 10,360 contaminants (~10% of anticipated CCL Universe) 750 (about 7.5%) get through “qualitative” “gate” screening 90% of CCL 1998 chemicals included 80% of CCL 1998 through “qualitative” “gates” (40 of 50) Is the Example Data Set Adequate for Screening? Sufficient data are available for screening Gate 1 screening demonstrates adequacy for decision making Qualitative screening is simple, and effective for this example With larger numbers ( e.g. Actual CCL Universe), quantitative screening may be needed to limit size of PCCL Other gates probably require a quantitative screening Would increase representativeness to include additional occurrence data (e.g. NAWQA, NREC) in Example CCL Data Set RECOMMENDATION: Use Qualitative Screening for NDWAC Example PCCL Add more Occurrence Data sources Findings and Recommendations Using tabular sources maximizes the number of elements for screening Have many derived health effect endpoints and redundancies Have included national drinking water surveys Could identify additional Gate 1 contaminants with other natural water surveys (e.g. NAWQA) Additional data sources would be helpful for attributes scoring What can we say about the use of this data set for attribute/scoring and classification? Additional sources may provide needed elements and perhaps should be added to data set Additional elements may be helpful for attribute scoring. For example, critical effect information to score severity from derived “potency” endpoints may not be available. Additional sources may require manual manipulation and judgment for use in PCCL to CCL E.g., additional occurrence data is in raw data format, so statistics must be calculated

Related docs
CCL Universe Example Data Set - Update
Views: 0  |  Downloads: 0
Building the CCL Universe
Views: 19  |  Downloads: 0
Building the CCL Universe
Views: 31  |  Downloads: 0
The Universe ? or Nothing
Views: 50  |  Downloads: 0
Other docs by aaba272ccfbce2...
Breach of Duty
Views: 891  |  Downloads: 8
Praise the Lord
Views: 230  |  Downloads: 0
National Chemistry Week Experiments: Bubbles
Views: 524  |  Downloads: 12
Holy Holy Holy
Views: 184  |  Downloads: 0
ch102
Views: 124  |  Downloads: 0
Worthy is the Lamb (new)
Views: 465  |  Downloads: 7
English and its Relationship with French
Views: 583  |  Downloads: 12
He is Exalted
Views: 410  |  Downloads: 3
Installment sale and security agreement
Views: 282  |  Downloads: 7
O Keeffe v Snyder _ADv of chattels
Views: 868  |  Downloads: 12
dv105c
Views: 145  |  Downloads: 0
Be With Me Lord
Views: 244  |  Downloads: 2
Arms of Love
Views: 336  |  Downloads: 9
Mortgage accounting inc escrow
Views: 365  |  Downloads: 12
Con Law IR outline
Views: 303  |  Downloads: 7