Docstoc

NIEeS SRB Introduction

Document Sample
NIEeS SRB Introduction Powered By Docstoc
					                     An Introduction to the SDSC
                      Storage Resource Broker

                          NIEeS SRB Workshop
                              January 2006
                             Peter Berrisford


NIEeS SRB Workshop                                        Peter Berrisford
January 2006                                       CCLRC e-Science Centre
                                        What is SRB?

  • Data Grid middleware developed at the
    San Diego Supercomputer Center (SDSC)
  • A distributed file system, based on a client-server
    architecture.
  • Allows users to access files seamlessly across a
    distributed environment, based upon their attributes
    rather than just their names or physical locations.
  • It replicates, syncs and archives data, connecting
    heterogeneous resources in a logical and abstracted
    manner.
NIEeS SRB Workshop                                    Peter Berrisford
January 2006                                   CCLRC e-Science Centre
                                   Why use SRB?
• Users need to know their data is:
   – Collected and stored safely
   – Easily accessible, from anywhere
      • By the right people
      • Reasonable response times
      • Metadata to assist searching
      • Available 24/7, this year and next and next…
• Don’t need to know how – need to be freed from IT
  concerns to focus on their research
• Available for collaboration
NIEeS SRB Workshop                                  Peter Berrisford
January 2006                                 CCLRC e-Science Centre
                Data Virtualization with SRB
                                      User Application




                            Archive        Database        File System
                            at RAL         At UCL          at Daresbury
NIEeS SRB Workshop                                           Peter Berrisford
January 2006                                          CCLRC e-Science Centre
                 Data Virtualization with SRB
Logical name space
 Location independent identifier             User Application
 Persistent identifier

Collection owned data
 Access controls             Common naming convention and set of
 Audit trails                attributes for describing digital entities
 Checksums
 Descriptive metadata

Inter-realm authentication         Archive        Database        File System
 Single sign-on system             at RAL         At UCL          at Daresbury
 NIEeS SRB Workshop                                                 Peter Berrisford
 January 2006                                                CCLRC e-Science Centre
                     The Application of SRB




NIEeS SRB Workshop                                   Peter Berrisford
January 2006                                  CCLRC e-Science Centre
                                            Archival Services
            Archival Services across single or distributed sites
                                    Archival Services operate on economy
   Institute Sites                  of scale and require expert staff to
                                    operate them, thus central services for
                                    larger Grids make financial sense.

                                    CCLRC operate these services both
                                    „on site‟ and for external partners.
   Archival
   Service                          This example is for a customer with
                                    about 16 sites across the UK. They
                                    operate on their own network, with
                                    only their main site connected to
                                    Janet. Scheduled archives and restores
        Central Site                are handled via this central site.
NIEeS SRB Workshop                                                 Peter Berrisford
January 2006                                                CCLRC e-Science Centre
                                         Archival Process Data Transfer

Remote Institute Site
                                                                 “Request Tracker” database used to
                  Local
                                                                    control data transfer process
                 Storage
      Filer


                                         Central ‘Cache’ Site
                                                                                              RAL
                                                                               Firewall
                                  Site                                 JANET
                                  WAN                  Central          WAN
                                                        SRB                                      Tape Traffic
 Local                Local
                                                       Server
machines               SRB Sphymove in
                      Server to container                                              SRB-ADS
                                                                                                                         ADS Tape
                                                disk             Sreplcont              Server
                                                                                                                         Resource
              disk                                                                        ADS SRB
                                                       Central                  disk        Disk
                                                       “cache”
              Local                                     Vault
                                                                                           Cache
              Vault                                                                       Resource
                                           Firewall
                       Firewall                         Note firewalls


NIEeS SRB Workshop                                                                                                     Peter Berrisford
January 2006                                                                                                    CCLRC e-Science Centre
                                                           Request Tracker
  • Tracks the movement of “archive packages” using a
    database application
  • Monitors archive, restore and extract utilities
  • Provides error handling, reporting and request
    priority intelligence
  • Interacts with front-end application, server scripts
    and CCLRC Atlas Data Store
  • Takes advantage of Oracle features
  • Scripts call stored procedures
      $ Spcommand "rt_set_archive_ingestion_start /archive/gdb/ MyArchive MySite 5 10296"
      0
      1000532
      $ Spcommand "rt_get_archive_packagename_by_accessionid 1000532"
      0
      /archive/gdb/remote-archive/archive/gdb/1000532
NIEeS SRB Workshop                                                                Peter Berrisford
January 2006                                                               CCLRC e-Science Centre
                                                 DIAMOND Architecture
       Beam line          …      ADSC                   ADSC               ADSC
         control
       system and
        Detector              RAID 2TB               RAID 2TB           RAID 2TB


       20TB disc store,       20TB disc store,       20TB disc store,   20TB disc store,
                                                                                                SRB
        some fraction     …    some fraction          some fraction      some fraction          space
         allocated as           allocated as           allocated as       allocated as
          SRB Vault              SRB Vault              SRB Vault          SRB Vault



                                                            160TB
   Background transfer of data to ADS                      SRB Vault

Logical view of data can                             Replicas maintained across resources
    be maintained
                                           ADS
                                          Resource
NIEeS SRB Workshop                                                                         Peter Berrisford
January 2006                                                                        CCLRC e-Science Centre
                                                    ISIS Facility
                     Experimental results distribution




                                   SRB                   Hole
                                  Server                 opened in
                                (no vault)
                                                         firewall



                                ADS-SRB
                                 Server

                               Atlas Drivers


                                                     ADS
NIEeS SRB Workshop                                              Peter Berrisford
January 2006                                             CCLRC e-Science Centre
                     DLS Zone




                     ISIS Zone

NIEeS SRB Workshop                      Peter Berrisford
January 2006                     CCLRC e-Science Centre
                     SRB Implementation




NIEeS SRB Workshop                               Peter Berrisford
January 2006                              CCLRC e-Science Centre
                     How is SRB implemented?

 • The logical file system structure is maintained
   in a database:
    – MCAT: Metadata Catalogue
 • All metadata (system and application) stored
   in the MCAT
 • Files stored in distributed physical resources
   (“vaults”)
 • Logical resources can be created, mapping to
   multiple physical resources

NIEeS SRB Workshop                            Peter Berrisford
January 2006                           CCLRC e-Science Centre
                  SRB – The Key Components
                                      MCAT
                                      Database            • The Metadata
                          c             d                   Catalogue (MCAT)
                                      MCAT
                                      Server              • The MCAT-Enabled
                  b
                              e
                                                            SRB Server
                                  f
                                                          • The SRB Storage
         SRB A
         Server
                                            g
                                                 SRB B
                                                 Server     Server
                                                          • The SRB Client
                      a
                                        SRB
                                        Client




NIEeS SRB Workshop                                                             Peter Berrisford
January 2006                                                            CCLRC e-Science Centre
                     The MCAT Database
                     • The MCAT database is a metadata
                       repository that provides a
                       mechanism for storing information
                       used by the SRB system.
                     • Includes both
                        – Internal system data required for
                          running the system
                        – Application (user) metadata
                          regarding data sets being
                          brokered by SRB.
NIEeS SRB Workshop                                       Peter Berrisford
January 2006                                      CCLRC e-Science Centre
                     The MCAT Server
                     • At least one SRB Server must be
                       installed on the node that can
                       access the MCAT database. This
                       is known as the MCAT-Enabled
                       Server (MES)
                     • The MES works directly against
                       the MCAT database to provide
                       SRB Services
                     • All other SRB Servers interact
                       through the MCAT Server

NIEeS SRB Workshop                                  Peter Berrisford
January 2006                                 CCLRC e-Science Centre
                     The SRB Server

                     • The SRB Server is a middleware
                       application that accepts requests
                       from clients and
                       obtains/queries/manages the
                       necessary data sets.
                     • It queries the MCAT SRB Server
                       to gather information on datasets
                       and supplies this back to the
                       SRB client.
NIEeS SRB Workshop                                   Peter Berrisford
January 2006                                  CCLRC e-Science Centre
                     SRB Client Tools

                      • Provide a user interface to send
                        requests to the SRB server.
                      • 4 main interfaces:
                         – Command line (S-Commands)
                         – MS Windows (InQ)
                         – Web based (MySRB).
                         – Java (JARGON)
                      • Web Services (MATRIX)
NIEeS SRB Workshop                                   Peter Berrisford
January 2006                                  CCLRC e-Science Centre
                                      Logical Resources

                                 SRB Server
      “instant replication”

         “fast archival”

            “resource pooling”




     “My-Logical-Resource”
                                 SRB Server




NIEeS SRB Workshop                                   Peter Berrisford
January 2006                                  CCLRC e-Science Centre
                     SRB Containers - Archival of Small files

• Performance issues with storing / retrieving a large number of
  small files to / from tape
• Container design
      – Physical grouping of small files
      – Implemented with a Logical Resource
         • A pool of Cache Resource for the front-end resource
          • An Archival Resource for the backend resource
      – Read/Write I/O always done on Cache Resource and sync
        to the Archival Resource
          • The entire container is moved between the cache and
            the archival resource and written to tape
          • Bulk operation with container - faster
NIEeS SRB Workshop                                            Peter Berrisford
January 2006                                           CCLRC e-Science Centre
                                            Effective use of SRB
                                              The ADS Interface

                           ADS-logical-resource




                     ADS-cache                      ADS-tape
                      resource                      resource
                                   Ssyncont

                                 Copies container
                                   from cache
                                     to tape



                                                        Logical Resource
                          Sput
                           –c <container-name>           for Containers
                           <data-file>

NIEeS SRB Workshop                                                    Peter Berrisford
January 2006                                                   CCLRC e-Science Centre
           SRB MCAT                        Atlas Data
           Database                        Store               SRB ADS Server
                                                        Port
                                                        8600     SRB-ISIS
                                                                   server
                                                                 instance
        SRB MCAT                                        Port
                                        SRB             8610     SRB-BADC
        Server                          ADS
                                        Server
                                                                    server
                                                                  instance
                                                        Port
                                                        8620     SRB-BBSRC
                      SRB Disk                                     server
                      Server                                      instance
                      (Local Server)




                            SRB
                            Client     SRB-ADS architecture
NIEeS SRB Workshop                                                     Peter Berrisford
January 2006                                                    CCLRC e-Science Centre
                                      SRB Services Architecture
                                           App                  App               App                     App                   App




                          ADS-SRB
                                                          SRB                        SRB
                                                                                       SRB                       SRB
                                                                                                                   SRB                 SRB
                                                                                                                                         SRB
             ADS           Multiple                         SRB
                                                           SRB                        SRB                         SRB                   SRB
                           Servers                       Server                     Server
                                                                                     Storage                    Server
                                                                                                                 Storage              Server
                                                                                                                                       Storage
                                                          Storage
                                                          Server                     Server                      Server                Server
                                                           Servers                    Servers                     Servers              Servers




                                      MCAT
                                      Server              MES                         MES                        MES                    MES
             Web Server



                                      Oracle Client




                                                                      DB-Instance-1                       DB-Instance-2                 Oracle RAC
                                                                                                                                         Database
                                                                                MCAT Database                                             Server


                                                      Schema1             Schema2               Schema3               Schema4




NIEeS SRB Workshop                                                                                                                             Peter Berrisford
January 2006                                                                                                                            CCLRC e-Science Centre
                      SRB APIs
• Shell commands (Scommands)
     – Provide access to all SRB functions
• mySRB web browser
     – Provides hierarchical collection view
• inQ Windows browser
     – Provides Windows style directory view
• C library calls
     – Provide access to all SRB functions
• Jargon Java API
     – Similar to java.io. API
• Matrix WSDL/SOAP Interface
     – Aggregate SRB requests into a SOAP request. Has a Java
         API and GUI
• Python, Perl, C++, OAI, Windows DLL, Mac DLL, Linux I/O
    redirection, GridFTP
 NIEeS SRB Workshop                                    Peter Berrisford
January 2006                                    CCLRC e-Science Centre
                     Terminology

  • Location: A physical node running an SRB
    Server
  • Physical Resource: A storage area
    managed by an SRB Server
  • Logical Resource: One or more Physical
    Resources – can be distributed
  • Collection – Data abstraction of resources
NIEeS SRB Workshop                           Peter Berrisford
January 2006                          CCLRC e-Science Centre

				
DOCUMENT INFO
Shared By:
Categories:
Stats:
views:7
posted:3/5/2010
language:
pages:26