CSG 2007 Data Management and Governance Survey
MIT Fall 2007
Agenda
• • • • • • •
8:00 Introduction 8:15 Survey Results - Shel Waggener 8:45 MIT’s Data Governance Perspectives - Mary Weisse 9:30 U-Washing Data Governance Prospectives - Bill Yock 10:15 BREAK 10:45 Data Governance at Berkeley - Shel Waggener 11:10 Discussion – Jim Phelps
CSG 2007 Data Management Survey
Q: How critical of an issue is data management to your institution? A: Highly critical “Top Priority in Past Year” “Provost …tackle urgent issue” “University wide needs” “Substantial Resources (Staff, Time, $$)
CSG 2007 Data Management Survey
10
12
14
16
18
0
2
4
6
8
M et ad a ta
A rc hi v in g
S ec ur ity
S ch em as er fa c es
In t A cc e ss C on tr ol nd s ca S In s tit u C ha n gi ng La
Which data management areas are most critical?
CSG 2007 Data Management Survey
pe to ra g tio na e lD at a
Criticality of data classification for control of:
9 8 7 6 5 4 3 2 1 0 Extremely Critical Critical Somewhat Critical Not Critical
Access by Risk Level
Retention Time
CSG 2007 Data Management Survey
Classification by Risk Level
Yes
Do you have a system for classifying data by risk level (e.g. low risk, LOA 1 access; medium risk, LOA 2 access, etc)? If Yes, is there a formal process to access risk level and assign access rights? Is there a formal audit process to verify compliance?
No
13 8 7
3 4 6
CSG 2007 Data Management Survey
Classification by Risk Level (cont’d)
Yes
If you don’t have a system for classifying data by risk level, is there a plan to develop such a system? Is such a system considered to be a critical need?
No 5
2
3
2
CSG 2007 Data Management Survey
Classification by Retention Requirements
Yes Do you have a system for classifying data by retention requirements (e.g. never keep, keep for seven years, etc.)? Is there a formal process to assign retention times? Is there a formal audit process to verify compliance?
CSG 2007 Data Management Survey
No 10
9
4 2
1 3
Classification by Retention Level (cont’d)
Yes If you don’t have a system for classifying data by retention level, is there a plan to develop such a system? Is developing such a system considered to be a critical need? 4 No 6
6
3
CSG 2007 Data Management Survey
Data Warehouse
Yes
Do you have an Enterprise Data Warehouse? 16
No
0
Do you have more than one? Are you amazed at the things people will do with data from the data warehouse?
5 10
11 6
CSG 2007 Data Management Survey
Data Warehouse (cont’d)
Yes Are you concerned about misinterpretation of data from the data warehouse? Do you have a dedicated group creating reports? Do you have policies on correct use? 13 No 3
9
7
10
6
CSG 2007 Data Management Survey
Data Access
Yes
Do you have a governance structure for setting Data Access Policy? Do you have a formal/well understood process for getting access to data? 9
No
10
9
6
CSG 2007 Data Management Survey
Does your process for granting access to data cover:
14 12 10 8 6 4 2 0
at a D A ll
0 0 0 1 12 13
E R P s
se le ct io n
sy st em
ho us e(
ar e
M aj or
e
th e
so m
W
In
in
S
om
CSG 2007 Data Management Survey
e
D
at a
A n
ad
In
ho c
O
th er
s)
s
Data Repository
Yes No 9
Do you have a data repository?
Governance around getting new data into the repository? Standardized data representations for key data?
4
3 4
7 6
CSG 2007 Data Management Survey
Web Services/SOA
Yes Are you deploying Web Services?
Has this changed the discussion around data management?
No 2
13
9
11
2
4
Are you implementing Service Oriented Architecture?
Has this changed the discussion around data management?
6
3
CSG 2007 Data Management Survey
Research Data
Yes Do you have a plan for hosting Research Data? Is there meta-data that lets you determine that this data is research data? Is there a governance structure that handles the requests to host research data?
CSG 2007 Data Management Survey
No 11
5
2
13
4
10
Data Storage Fees
Yes No
Do you charge end-users for "enterprise storage" (storage that is separate from the disks in a server) Are there different fees for different types of storage?
10
6
11
2
CSG 2007 Data Management Survey
Data Storage Fee Factors
4 Uptime 6 Redundancy Offsite Colocation Other
9 4
CSG 2007 Data Management Survey
Data Storage Fee Factors: Other
1 1
Performance/Speed (e.g. FC vs. SATA; fast access vs. archival) Type of disk Cost Premium storage beyond basic allocation Backup, capacity
5 2
1
CSG 2007 Data Management Survey