Introduction to Metadata
Lynda Wayne
GeoMaxim / FGDC FGDC CAP Kickoff Meeting Denver, CO
September 28-29, 2004
1
Introductions
Name Organization Metadata Experience Workshop Expectations
2
Objectives
After the workshop, students can:
develop a metadata template that uses a range of mandatory, conditional, & optional CSDGM elements effectively read and comprehend metadata
make the business case for metadata
discriminate between minimal and quality metadata locate and access online resources
3
What IS Metadata?
Data ‘reporting’
WHO created the data? WHAT is the content of the data? WHEN was it created? WHERE is it geographically? HOW was the data developed?
WHY was the data developed?
4
What IS Metadata?
title supplemental information abstract
time period author
sources
(file) size
5
What IS Metadata?
entity
attributes
view actual metadata record
6
Let’s Make Metadata
Turn to your neighbor and document the following:
Title (name) Theme Keywords (work, play, life) Supplemental Information
7
Value of Metadata
Preserves investments in geospatial data development
people forget…………..people move on
8
Value of Metadata
Serves as a valuable resource to data use and analysis
9
Value of Metadata
Provides data consumers & stewards: • a common language • a context for data resources • needed information about the data
Projection? Attributes? Time Period? Albers, I think? Yes
I’ll have get back to you…
10
Value of Metadata
Supports data management • maintenance and update • project status • value assessment / cba • project estimates • deliverable and performance measures
11
Value of Metadata
12
Who creates metadata?
Officially
Federal organizations producing digital geospatial data Federally funded projects that produce digital geospatial data Anybody that creates digital data
Morally and practically
It’s the right thing to do!
13
Why metadata?
Be a part of the BIG picture
Federal Geographic Data Committee (FGDC) National Spatial Data Infrastructure (NSDI) Global Spatial Data Infrastructure (GSDI)
- creation and distribution of seamless global geospatial data
- coordinates national geospatial data development
- creation and distribution of seamless national geospatial data
14
Why metadata?
National Spatial Data Infrastructure
Old perspective………………….
……………………………….New perspective
15
Why metadata?
NSDI Framework Data
- cooperatively-designed data dictionaries for seven key ‘reference’ themes
• Elevation and Bathymetry
• Hydrography • Geodetic Control • Transportation • Governmental Units
• Cadastral
• Orthoimagery
16
Why metadata?
NSDI Geospatial Data Clearinghouse
- international network of metadata distribution ‘nodes’
user
internet
Clearinghouse
• FGDC
• EROS • ESRI • NRCS • NOAA CSC • ALASKA GDC
query available metadata records
nodes international GINA AGDC ASGDC Anchorage state / local national
17
Why metadata?
NSDI Geospatial One Stop
metadata and links to data webmapping services data acquisition plans data category communities
…..fast tracking the NSDI
18
Why Metadata?
National Map
Geospatial One-stop
policies • standards • partnerships
coordination
FGDC
19
How Do I Create Metadata?
the metadata gospel…
It’s not pretty
It’s not easy But sure is thorough…..
20
CSDGM
1. Identification Information
General bibliographic information including: title, originator, data contact, status, date, time period of content, abstract, purpose, keywords, geographic location
2. Data Quality Information
Lineage and data assessments
sources, process methods, accuracy, data processing contact
21
CSDGM
3. Spatial Data Organization
Data format: vector, point, raster
4. Spatial Reference Information
Coordinate system parameters: horizontal / vertical coordinate system, projection, datum
22
CSDGM
5. Entity and Attribute Information
Database design entities, attributes, domains, description of data values
6. Distribution Information
How to acquire the data
distribution contact, available formats, online distribution website, costs
23
CSDGM
7. Metadata Reference Information General information about the metadata
record itself metadata contact, metadata standard used, metadata creation date, metadata review date
24
CSDGM Elements
25
CSDGM Elements
26
CSDGM Elements
27
CSDGM Elements
Turn to page 35 in the Green Book
28
CSDGM Elements
29
CSDGM Elements
Calendar Dates
YYYYMMDD
Time of Day
HHMMSSSS
Coordinates
Lat/Lon Decimal Degrees
Network addresses & file names
Service://hostname:port/path/filename
30
CSDGM Production Rules
31
CSDGM Production Rules
32
CSDGM Production Rules
33
CSDGM Production Rules
34
CSDGM Production Rules
35
CSDGM
36
CSDGM
Section One: Identification
Elements you use to ‘shop’ for a data set of interest:
WHO ? WHAT ? WHERE ? WHEN ? title, originator, publication date abstract, keywords, native data set (software) environment geographic extent (mbr), browse graphic time period of content
Availability? access/use constraints, status
37
CSDGM
38
CSDGM
Section Two: Data Quality
How was the data set developed?
• •
Source Information Data processing Logical Consistency Positional Accuracy Attribute Accuracy
What checks were made of the data set?
• • •
39
CSDGM
40
CSDGM
Section Three: Spatial Data Organization
Point, Line, or Vector?
41
CSDGM
42
CSDGM
Section Four: Spatial Reference Information
Coordinate System
43
CSDGM
44
CSDGM
Section Five: Entity & Attribute Information
Database Description
45
CSDGM
46
CSDGM
Section Six: Distribution Information
How can I get the data set?
• • • • •
Distribution Contact Information Available Formats (digital and hardcopy) Standard and Custom Order Processes Online option Offline media
•
•
Distribution Liability
Fees
47
CSDGM
48
CSDGM
Section Seven: Metadata Information
Elements that describe the metadata record itself:
WHO wrote the metadata record WHAT metadata standard was used?
WHEN was the metadata created/updated?
Is the metadata available? metadata access and use constraints
49
CSDGM
50
CSDGM
Supporting Sections
Eight: Citation Information
Title, Originator, Publisher
Nine: Time Period Information
Date and Hour
Ten: Contact Information
Name, Job Title, Address, Phone, Email
51
CSDGM
52
Metadata Collection Tools Shareware
corpsmet - USACE tkme / xtme - USGS/FGDC NPSmeta (ArcCatalog Extension) MetaD - ISO
53
Metadata Collection Tools GIS Internal
ESRI ArcCatalog Intergraph SMMS for Geomedia
54
Metadata Collection Tools Software
Commercial
SMMS Data Logger Blue Marble
55
Metadata Collection Tools
Forms hardcopy or online
56
Metadata Validation Tools
mp - Metadata Parser
Checks for CSDGM syntax
cns – Chew ‘N Spit
Embeds tags for NSDI Clearinghouse Distribution Rectifies indentation Inserts capitalization and underbars
Element names Mandatory elements Element content (domains and logical consistency)
FGDC ‘compliant’ metadata must pass mp!
57
Metadata Resources: The Greenbook
An easy to use guide to implementing the CSDGM (is not the standard itself)
Background and Descriptive Information CSDGM Elements
NWI and DGL CSDGM history, value of meta, formats
Sample Metadata
definitions, domains, FAQs, common values
Glossary of Terms
58
Metadata Resources: Online Resources
http://www.fgdc.gov/metadata/ /online_resources
tutorials metadata guidance websites software publications thesauri sources
59
Making Metadata Part of the Process
New tools enable us to better integrate metadata creation into the data development process
60
Making Metadata Part of the Process
If metadata were collected throughout the data process… 1. more accurate (no guessing) metadata 2. more details 3. better decision-making.
61
Making Metadata Part of the Process
But what can I do to better incorporate metadata into the data development process?
62
step one: build adminstrative support
Make the business case:
• • • • • • preserve data investments limit data liability manage data resources find new data resources easier data transfer more efficient data distribution
63
step two: build technical support
Emphasize individual benefits:
• • • reduce workload..in the long term field fewer data inquiries document personal contributions
64
step two: build technical support
Support your staff
• include in job descriptions & performance measures provide staff support tools training
65
•
step three: create organizational templates
For each common data type:
identify pertinent fields populate fixed fields
standardized language
distribution methods standards used
build source and contact libraries
66
step four: distribute the effort
map fields to the work flow establish and assign responsibilities
technicians - lineage
analyst - process methodology
field scientists - accuracy assessments IT managers - tools, collection, management
67
step four: distribute the effort
Data Planning:
Section 1: Identification Info
title theme keywords originator abstract purpose content time period
68
step four: distribute the effort
Data Planning
Section 3: Data Organization
Indirect Spatial Reference
FIPS codes, county monikers, etc
Object-type
point, vector, or raster
69
step four: distribute the effort
Data Planning Section 5: Data Organization
Entities and Attributes
database design and configuration
Highways name surface [asphalt, concrete, unpaved] year_built restrictions [haz, 2axle, flood] district [A, B, C, D, E]
70
step four: distribute the effort
Data Processing
Section 2: Data Quality
completeness report
missing or omitted data
positional accuracy
digitize rms error, GPS triangulation, survey/photogrammetry/IP method
71
step four: distribute the effort
Data Processing
Section 2: Data Quality
lineage
data set source files data compilation process
maps photos dbases GIS
72
step four: distribute the effort
Data Analysis
Section 2: Data Quality
attribute accuracy
field checks, cross-checks, statistical analyses
lineage
data analysis process
73
step four: distribute the effort
Data Analysis
Section 3: Data Organizations
object type
number of points, pixels, lines
Section 5: Entity and Attributes
attribute values
74
step four: distribute the effort
Data Analysis
Section 7: Metadata Reference
metadata contact and dates
75
step five: lead by example
Managers Can WRITE metadata
data planning fields
and manage the effort
title abstract purpose geographic extent keywords and many more…
metadata coordination metadata enforcement
76
step six: policies and procedures
Establish Policies:
mandate use of standards and templates develop boilerplate metadata deliverable language for data contractors require units to publish their metadata publish metadata SOP to document policies and procedures
77