Metadata
Document Sample


FGDC Biological Data Profile
As it maps to
Dublin Core
What is Interoperability?
• It is people, standards, and machines that
understand each other
• It is all about breaking down the barriers and
silos
• It is not complicated to achieve, but it does
require commitment and perseverance
• Is it desirable?
…….. you tell us!
InterOperabilty
InterDependencies
People
• the will to make interoperability a reality
Standards
• data, metadata, web services
Nuts and Bolts
• distributed network
• Internet/Intranet
• development, testing, and deployment
• integrated search environment
• enabling tools
International Standards
Enable Interoperability
Metadata Standards
• international standards complement science-based
nature of Environment Canada
• FGDC - CSDGM
• FGDC - Biological Data Profile
• Station Level metadata
• Dublin Core
• permits interoperability across diverse information
holdings
• permits horizontal integration of business processes
Controlled vocabularies
• data owners use the same language which enables data
alignment and exhange
• searchers use the same language as data owners,
which enables consistent discovery
Locating the Repositories and
Searching the Metadata
FGDC CSDGM and the FGDC Biological Data Profile
• metadata generated exist in detailed
repositories
• users need to know where to go in order to
search repositories separately or
simultaneously through a node (NBII node of
FGDC Clearinghouse, Discovery portal, ..)
Searching the Metadata and
Locating the Repositories
For Dublin Core
• Government OnLine adopts Dublin Core as
part of Common Look and Feel Standard
• enables high level discovery of web
resources
… but what about all of the other
types of information assets that
are not web based?
Searching the Metadata and
Locating the Repositories
For Dublin Core
• desirable to develop high level repositories
of scientific information/data holdings
• enable high level discovery of data
sources
• alternate point of entry to specialized
repositories for more detailed searching
• need to map FGDC CSDGM and Biological
Data Profile elements to Dublin Core to
make this happen
• approximately 18 Dublin Core elements
Dublin Core and the FGDC Biological Data Profile
Map It!
Is anyone else doing this?
NBII ?
University of Louisiana - CORC (Cooperative Online
Resource Catalog) project to enhance access to
Federal Geographic Data Committee (FGDC) data
sets
alternate clearinghouse model
convert existing metadata to more widely used standards for
inclusion in other clearinghouses
(http://www.dlib.org/dlib/january00/chandler/01chandler.html)
Interoperability Links
Diverse Information Holdings
• Business logic
• Authority lists Holding Holding Holding
• Life cycle management
Apply Content Management for Business Processes
• Find
• Use Holding Holding Holding
• Share
Apply Metadata for consistency
Research Tabular Spatial
Libraries Maps Web
Reports Data Data
Existing situation: discrete holdings with inconsistent Metadata
Dublin Core and the FGDC Biological Data Profile
Map It!
DC Element Bio Profile Domain
maps to
Title Title None
Single Date (or)
maps to
Ending Date (or)
Date None
Last date entered in a multiple
date range
maps to
Date.Created Metadata Date Created None
maps to
Date.Modified Metadata Review Date None
Temporal maps to
Geologic Age Estimate None
DCMI Period
Dublin Core and the FGDC Biological Data Profile
Map It!
DC Element Bio Profile Domain
maps to
Creator Originator None
maps to
Description Abstract None
maps to
Publisher Primary Contact Organization None
maps to
Contributor Data Set Credit None
maps to
None
Identifier Online Linkage
maps to
Source Lineage None
Dublin Core and the FGDC Biological Data Profile
Map It!
DC Element Bio Profile Domain
Coverage
DCMI Point maps to
Bounding Box
DCMI Box
maps to
Place Name Place Keyword gcgeonames
maps to
Format Non-digital form (or)
Format Name (as
Format Name required)
maps to
Rights Access Access Constraints None
maps to
Rights Use Use Constraints None
maps to Geospatial Presentation
Ec.type Geospatial Presentation Form Form ++ additional type
elements to be
determined
Dublin Core and the FGDC Biological Data Profile
Map It!
…and what we don’t map
DC Element Bio Profile Domain Reason
Language No equivalent ISO 19115 Language Set No equivalent
Subject No equivalent Core Subject Thesaurus No equivalent
no
Audience mapping TBS audience type No equivalent
no
Does not include
Type mapping TBS type
scientific data or like
term.
could map to Non-digital form
Medium Format Name dc.format is better
(or)
(as required)
Format Name
ECMeta
data entry
ECMeta
ECMeta is a web-based data entry tool that
integrates three metadata standards and several
controlled vocabularies
• hybrid Document Type Definition (DTD)
• Dublin Core, FGDC CSDGM, FGDC Biological Data
Profile
• Numerous authority files most as web services
• Core Subject Thesaurus
• Global Change Master Directory (GCMD)
• Integrated Taxonomic Information System (ITIS)
• Envirotel
• Generate XML
CST
ECMeta
Geo Items
Envirotel
Architecture
ITIS
Web
Services
Hybrid Model
Dublin Core
Publishing XML CSDGM
Web-based Biological Data Profile
entry tool
Storage
Binary Files
Download XML
Binary Objects
to local drives
&
Java Classes
Hybrid DTD
Why Do it?
Integrated
Search
Searching
…should be interoperable
• Traditional searching is normally performed within
each specific resource
• time consuming, incomplete
• interoperability allows for searching across a
multitude of resources - at the same time
• made possible by use of metadata standards,
protocols, and XML
• allows for searching at Discovery, Access, and Use
Searching used to be
Ad-hoc
Without Metadata
The user must search each holding individually
SEARCH SEARCH SEARCH SEARCH SEARCH
GO GO GO GO GO
Holding 1 Holding 2 Holding 3 Holding 4 Holding 5
Searching is now
Integrated
With Metadata
One search to many holdings
METADATA SEARCH
GO
Distributed Metadata XML
repositories
Holding 1 Holding 2 Holding 3 Holding 4 Holding 5
Hierarchical Application of
Metadata Standards
Single object, Database or Collection
Specialists Practitioners Public
Content
Discovery Dublin Core
Management
Geospatial
Cluster
Resources
Management
Access Full CSDGM and Biological Data Profile
Temporal
Resources
Geospatial
Mapping
Use Station/Point source
Web
Services
Three Levels of
Metadata
A flexible strategy Discovery
Most EC information assets will be
that matches effort to need discovered at this simplest level.
This could be for a collection,
Discovery database or single object.
Access
Access
Using the full geospatial and/or
Use biological profile this level will
provide for the comprehensive
description and disclosure of data.
Use
This level will allow for the use of
biological or geospatial metadata at
Using internationally the station level for visualization
recognized standards and data extraction web services.
Searching for
Discovery
Discovery
Searching for
Access
Discovery
Access
Searching for
Use
Discovery
Access
Use
Next Steps
• prototype the Discovery, Access, and Use model
in EC
• integrate searching for
• web resources
• data (spatial, tabular, ..)
• books, reports (library)
• geospatial resources
• station level
• etc….
• work with NRCan and other 5NR departments to
explore Dublin Core model further
Get documents about "