Docstoc

Metadata

Document Sample
Metadata Powered By Docstoc
					FGDC Biological Data Profile
   As it maps to
       Dublin Core
                       What is Interoperability?

• It is people, standards, and machines that
  understand each other
• It is all about breaking down the barriers and
  silos
• It is not complicated to achieve, but it does
  require commitment and perseverance
• Is it desirable?
  …….. you tell us!
                                    InterOperabilty
                                 InterDependencies
People
   • the will to make interoperability a reality
Standards
   • data, metadata, web services
Nuts and Bolts
   • distributed network
   • Internet/Intranet
   • development, testing, and deployment
   • integrated search environment
   • enabling tools
                                International Standards
                                 Enable Interoperability
Metadata Standards
   • international standards complement science-based
     nature of Environment Canada
      •   FGDC - CSDGM
      •   FGDC - Biological Data Profile
      •   Station Level metadata
      •   Dublin Core
  • permits interoperability across diverse information
    holdings
  • permits horizontal integration of business processes
Controlled vocabularies
  • data owners use the same language which enables data
    alignment and exhange
  • searchers use the same language as data owners,
    which enables consistent discovery
                Locating the Repositories and
                     Searching the Metadata
FGDC CSDGM and the FGDC Biological Data Profile
• metadata generated exist in detailed
  repositories
• users need to know where to go in order to
  search repositories separately or
  simultaneously through a node (NBII node of
  FGDC Clearinghouse, Discovery portal, ..)
                 Searching the Metadata and
                   Locating the Repositories
For Dublin Core
• Government OnLine adopts Dublin Core as
  part of Common Look and Feel Standard
     • enables high level discovery of web
        resources
   … but what about all of the other
    types of information assets that
    are not web based?
                   Searching the Metadata and
                     Locating the Repositories
For Dublin Core
• desirable to develop high level repositories
  of scientific information/data holdings
     • enable high level discovery of data
       sources
     • alternate point of entry to specialized
       repositories for more detailed searching

• need to map FGDC CSDGM and Biological
  Data Profile elements to Dublin Core to
  make this happen
    • approximately 18 Dublin Core elements
Dublin Core and the FGDC Biological Data Profile
                                        Map It!

 Is anyone else doing this?
  NBII ?
  University of Louisiana - CORC (Cooperative Online
     Resource Catalog) project to enhance access to
     Federal Geographic Data Committee (FGDC) data
     sets
       alternate clearinghouse model
       convert existing metadata to more widely used standards for
        inclusion in other clearinghouses

    (http://www.dlib.org/dlib/january00/chandler/01chandler.html)
                       Interoperability Links
                Diverse Information Holdings
 • Business logic
 • Authority lists                         Holding     Holding   Holding
 • Life cycle management

            Apply Content Management for Business Processes

 • Find
 • Use                    Holding           Holding              Holding
 • Share

                      Apply Metadata for consistency


               Research                               Tabular     Spatial
Libraries                    Maps       Web
               Reports                                 Data        Data


     Existing situation: discrete holdings with inconsistent Metadata
Dublin Core and the FGDC Biological Data Profile
                                        Map It!
DC Element                Bio Profile                   Domain
                maps to
Title                     Title                         None

                          Single Date (or)
                maps to
                          Ending Date (or)
Date                                                      None
                          Last date entered in a multiple
                          date range
                maps to
Date.Created              Metadata Date Created         None
                maps to
Date.Modified             Metadata Review Date          None


Temporal        maps to
                          Geologic Age Estimate         None
DCMI Period
Dublin Core and the FGDC Biological Data Profile
                                        Map It!
DC Element              Bio Profile                 Domain
              maps to
Creator                 Originator                  None
              maps to
Description             Abstract                    None
              maps to
Publisher               Primary Contact Organization None

              maps to
Contributor             Data Set Credit             None
              maps to
                                                    None
Identifier              Online Linkage
              maps to
Source                  Lineage                     None
Dublin Core and the FGDC Biological Data Profile
                                        Map It!
DC Element                Bio Profile                  Domain

Coverage
 DCMI Point     maps to
                          Bounding Box
 DCMI Box
                maps to
 Place Name               Place Keyword                gcgeonames

                maps to
Format                    Non-digital form (or)
                                                       Format Name (as
                          Format Name                  required)
                maps to
Rights Access             Access Constraints           None

                maps to
Rights Use                Use Constraints              None

                maps to                                Geospatial Presentation
Ec.type                   Geospatial Presentation Form Form ++ additional type
                                                       elements to be
                                                       determined
Dublin Core and the FGDC Biological Data Profile
                                        Map It!
…and what we don’t map
DC Element                Bio Profile        Domain                   Reason

Language                  No equivalent      ISO 19115 Language Set   No equivalent

Subject                   No equivalent      Core Subject Thesaurus   No equivalent

                             no
Audience                   mapping           TBS audience type        No equivalent

                             no
                                                                      Does not include
Type                       mapping           TBS type
                                                                      scientific data or like
                                                                      term.

           could map to   Non-digital form
Medium                                       Format Name              dc.format is better
                          (or)
                                             (as required)
                          Format Name
ECMeta
 data entry
                                                 ECMeta

ECMeta is a web-based data entry tool that
integrates three metadata standards and several
controlled vocabularies
    • hybrid Document Type Definition (DTD)
      • Dublin Core, FGDC CSDGM, FGDC Biological Data
        Profile
   • Numerous authority files most as web services
      •   Core Subject Thesaurus
      •   Global Change Master Directory (GCMD)
      •   Integrated Taxonomic Information System (ITIS)
      •   Envirotel
   • Generate XML
                                    CST
                                                                      ECMeta
                                                Geo Items
               Envirotel
                                                              Architecture
ITIS



                               Web
                             Services


                                                                        Hybrid Model

                                                                         Dublin Core
                                            Publishing      XML          CSDGM

                            Web-based                                    Biological Data Profile
                            entry tool
 Storage
Binary Files
                                                             Download XML
                           Binary Objects
                                                             to local drives
                                 &
                            Java Classes
                                             Hybrid DTD
Why Do it?

Integrated
   Search
                                    Searching
                     …should be interoperable
• Traditional searching is normally performed within
  each specific resource
   • time consuming, incomplete
• interoperability allows for searching across a
  multitude of resources - at the same time
   • made possible by use of metadata standards,
      protocols, and XML
• allows for searching at Discovery, Access, and Use
                                      Searching used to be
                                                  Ad-hoc
Without Metadata

The user must search each holding individually


SEARCH        SEARCH        SEARCH        SEARCH        SEARCH
         GO            GO            GO            GO            GO




Holding 1     Holding 2     Holding 3     Holding 4     Holding 5
                                          Searching is now
                                               Integrated
With Metadata
One search to many holdings

                       METADATA SEARCH
                                    GO


                  Distributed Metadata XML
                         repositories




   Holding 1    Holding 2     Holding 3   Holding 4   Holding 5
                                    Hierarchical Application of
                                                     Metadata Standards

                                         Single object, Database or Collection
                                       Specialists    Practitioners     Public

                                                                         Content
Discovery                      Dublin Core
                                                                       Management


                                                        Geospatial
                                                                        Cluster
                                                        Resources
                                                                      Management
 Access     Full CSDGM and Biological Data Profile

                                                       Temporal
                                                       Resources
                                        Geospatial
                                         Mapping
  Use        Station/Point source
                                           Web
                                         Services
                                       Three Levels of
                                            Metadata
     A flexible strategy      Discovery
                              Most EC information assets will be
that matches effort to need   discovered at this simplest level.
                              This could be for a collection,
          Discovery           database or single object.

           Access
                              Access
                              Using the full geospatial and/or
            Use               biological profile this level will
                              provide for the comprehensive
                              description and disclosure of data.

                              Use
                              This level will allow for the use of
                              biological or geospatial metadata at
  Using internationally       the station level for visualization
  recognized standards        and data extraction web services.
            Searching for
               Discovery

Discovery
            Searching for
                  Access

Discovery

 Access
            Searching for
                     Use

Discovery

 Access

  Use
                                        Next Steps

• prototype the Discovery, Access, and Use model
  in EC
   • integrate searching for
      •   web resources
      •   data (spatial, tabular, ..)
      •   books, reports (library)
      •   geospatial resources
      •   station level
      • etc….
• work with NRCan and other 5NR departments to
  explore Dublin Core model further