Usability Test by rt3463df

VIEWS: 15 PAGES: 26

									           NCSU Libraries:
Digital Repository Activities



                         James Jackson Sanborn

                                   NCSU Libraries: Digital Repository Activities
    Digital Preservation in State Government– Best Practices Exchange 2006
                                                   James M Jackson Sanborn
                 Ongoing DR projects
• NCSU Technical Reports Repository

• NCSU Faculty Publications Repository

• North Carolina Geospatial Digital
  Archiving Project (NCGDAP)


                                         NCSU Libraries: Digital Repository Activities
          Digital Preservation in State Government– Best Practices Exchange 2006
                                                         James M Jackson Sanborn
   Shared Technical Backbone
• DSpace 1.3.2

• PostgreSQL 8.1

• Solaris 9i – Clustered environment



                                        NCSU Libraries: Digital Repository Activities
         Digital Preservation in State Government– Best Practices Exchange 2006
                                                        James M Jackson Sanborn
       DSpace in a Nutshell
                                    Metadata



                   DSpace


                                       Bitstreams




                               NCSU Libraries: Digital Repository Activities
Digital Preservation in State Government– Best Practices Exchange 2006
                                               James M Jackson Sanborn
                 DSpace in a Nutshell
Local Modifications/peculiarities
• Remote Handle server patch
• Multiple instances
  – Handle subdomains
  – Differing levels of access
  – More control over look and feel
  – Different Asset Store systems
  – Share Tomcat and PostgreSQL

                                         NCSU Libraries: Digital Repository Activities
          Digital Preservation in State Government– Best Practices Exchange 2006
                                                         James M Jackson Sanborn
                 DSpace in a Nutshell
DSpace and Preservation
  – Bitstream preservation
  – QDC Metadata, including basic
    preservation metadata
  – Format Support
     • Supported
     • Partially supported
     • Unknown/not supported

                                         NCSU Libraries: Digital Repository Activities
          Digital Preservation in State Government– Best Practices Exchange 2006
                                                         James M Jackson Sanborn
                DSpace in a Nutshell
DSpace and Preservation
• Asset Store backup

• Database backup

• Non-DSpace recovery option


                                        NCSU Libraries: Digital Repository Activities
         Digital Preservation in State Government– Best Practices Exchange 2006
                                                        James M Jackson Sanborn
         DSpace out of the Nutshell
Metadata Table “bitstream”
Filename:            PRVcan006.pdf
Checksum:            7f697a91a6c4f339872586fa7cd1f907
Internal_id:         119711803630287942345034590264166439013

   11 is dir

         97 is dir

               11 is dir

                      119711803630287942345034590264166439013 is file

                                                    NCSU Libraries: Digital Repository Activities
                     Digital Preservation in State Government– Best Practices Exchange 2006
                                                                    James M Jackson Sanborn
       NCSU Technical Reports
Technical Report Series
  – 23 targeted departments and institutes
  – ~1k reports available online
  – Digital continuations of print/mimeo series
  – Item level indexing
  – Not a replacement for usual means of
    distribution
  – Risk of content loss

                                         NCSU Libraries: Digital Repository Activities
          Digital Preservation in State Government– Best Practices Exchange 2006
                                                         James M Jackson Sanborn
        NCSU Technical Reports
DSpace
• Collection Structure:
  – University College or Office
     • Academic Department or Institute
        – Series
            » Individual reports

• Collection Workflow-- series by series
  – Library Harvested – Library described
  – Unit submitted – Library described
  – Unit submitted – Unit described

                                           NCSU Libraries: Digital Repository Activities
            Digital Preservation in State Government– Best Practices Exchange 2006
                                                           James M Jackson Sanborn
     NCSU Faculty Publications
                   Repository
• Previously existing faculty citation
  database

• Mix of citations and open access content
  – Open access journals
  – Journal repository archiving rights
  – Author negotiated rights

                                         NCSU Libraries: Digital Repository Activities
          Digital Preservation in State Government– Best Practices Exchange 2006
                                                         James M Jackson Sanborn
    NCSU Faculty Publications
                  Repository
• ~17,000 citations

• 5 year review of citations – 2000-2004
  – 370 publishers
     • 727 articles available for inclusion
     • 303 articles available for inclusion with embargo
     • More available for inclusion as post-refereed pre-
       print

                                          NCSU Libraries: Digital Repository Activities
           Digital Preservation in State Government– Best Practices Exchange 2006
                                                          James M Jackson Sanborn
    NCSU Faculty Publications
                  Repository
Technical Architecture
  – Oracle database
     • All citations
     • Advanced authority control
     • Easier maintenance (php, MSAccess)


  – DSpace
     • Full-text content only
     • Stripped down interface

                                          NCSU Libraries: Digital Repository Activities
           Digital Preservation in State Government– Best Practices Exchange 2006
                                                          James M Jackson Sanborn
    NCSU Faculty Publications
                  Repository
Technical Architecture

          Search
                           FPR                                  Citations
         Retrieve          (php)                                (Oracle)


     Deliver
       files           DSpace

                                              NCSU Libraries: Digital Repository Activities
               Digital Preservation in State Government– Best Practices Exchange 2006
                                                              James M Jackson Sanborn
                               NCSU Libraries: Digital Repository Activities
Digital Preservation in State Government– Best Practices Exchange 2006
                                               James M Jackson Sanborn
                               NCSU Libraries: Digital Repository Activities
Digital Preservation in State Government– Best Practices Exchange 2006
                                               James M Jackson Sanborn
                               NCSU Libraries: Digital Repository Activities
Digital Preservation in State Government– Best Practices Exchange 2006
                                               James M Jackson Sanborn
                                                       NCGDAP
North Carolina Geospatial Digital
 Archiving Project
  – Joint project
     • NCSU Libraries and
     • North Carolina Center for Geographic
       Information and Analysis (CGIA)
  – National Digital Information Infrastructure
    and Preservation Program (NDIIPP)
     • Collaborative Partnership with the
       Library of Congress
                                          NCSU Libraries: Digital Repository Activities
           Digital Preservation in State Government– Best Practices Exchange 2006
                                                          James M Jackson Sanborn
                                                       NCGDAP
• Leverage NC OneMap data inventory
• Acquire at risk geospatial data
  – static data (digital orthophotos)
  – time series data (land records & assessment data)
• Develop a digital repository architecture
  for geospatial data
• Enhance geospatial metadata
  – preservation metadata
  – METS
• Investigate automated ID and capture of data

                                          NCSU Libraries: Digital Repository Activities
           Digital Preservation in State Government– Best Practices Exchange 2006
                                                          James M Jackson Sanborn
                                                       NCGDAP
Local & State Gov’t Geospatial Data
• Raster
  – Orthophotography
  – Digital Elevation Models
• Vector
  – Land Parcels/Cadastral
  – Roads
  – Boundaries
  – etc.

                                          NCSU Libraries: Digital Repository Activities
           Digital Preservation in State Government– Best Practices Exchange 2006
                                                          James M Jackson Sanborn
                                                      NCGDAP
Geospatial Data Risks
• Distributed Creation
• Creator needs = Future needs
• Format issues
  – Proprietary
  – Complex
• Metadata problems


                                         NCSU Libraries: Digital Repository Activities
          Digital Preservation in State Government– Best Practices Exchange 2006
                                                         James M Jackson Sanborn
                                                        NCGDAP
Architecture
• DSpace with PostgreSQL
  – Low cost
  – Not heavily customized
  – Not geospatially aware

• Storage
  – Two 15 terabyte ATABeast disk systems
  – Offsite mirroring = 12.6tb usable

                                           NCSU Libraries: Digital Repository Activities
            Digital Preservation in State Government– Best Practices Exchange 2006
                                                           James M Jackson Sanborn
                                                         NCGDAP
Pre-ingest (hub with single spoke)
• Data cleaning (DMZ)
  – Automated data identification
  – File manifests and on-receipt migration
• Metadata Management (WMD)
  –   Pre-existing FGDC metadata
  –   Normalized FGDC metadata
  –   QDC mapping
  –   METS
• Ingest-object creation
                                            NCSU Libraries: Digital Repository Activities
             Digital Preservation in State Government– Best Practices Exchange 2006
                                                            James M Jackson Sanborn
                                                       NCGDAP
Ingest and Storage
• Batch ingest into DSpace
  – No direct submission
  – Original and migrated data,original and normalized
    metadata & METS record included
  – Collection structure based on data producer and
    data type
• Data verification
• No direct access to repository
                                          NCSU Libraries: Digital Repository Activities
           Digital Preservation in State Government– Best Practices Exchange 2006
                                                          James M Jackson Sanborn
                                                         NCGDAP
Export (still in planning)
• DSpace export object
  – Data
  – Metadata
     •   FGDC
     •   Normalized FGDC
     •   METS
     •   DSpace QDC
• Direct extraction
                                            NCSU Libraries: Digital Repository Activities
             Digital Preservation in State Government– Best Practices Exchange 2006
                                                            James M Jackson Sanborn
            Contact & Further info
• James_sanborn@ncsu.edu

• http://www.lib.ncsu.edu/ncgdap




                                        NCSU Libraries: Digital Repository Activities
         Digital Preservation in State Government– Best Practices Exchange 2006
                                                        James M Jackson Sanborn

								
To top