The Grid and the Future of Business

Reviews
The Grid and the Future of Business Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer Science The University of Chicago http://www.mcs.anl.gov/~foster Grid Computing Technical Universe, Circa 1890 • Ubiquitous communication infrastructure – The telegraph • Local power generation – Electric power generators serve at most city blocks Technical Universe, Circa 2000 • Ubiquitous comms infrastructure – Internet, email, Web • Ubiquitous power distribution – (Reliable?) standard access – Tremendous variety of devices • Local computing – Most computing and storage on internal enterprise computers Exponentials (and Coefficients) • Network vs. computer performance – Computer speed doubles every 18 months – Network speed doubles every 9 months – Difference = order of magnitude per 5 years • 1986 to 2000 – Computers: x 500 – Networks: x 340,000 • 2001 to 2010 – Computers: x 60 – Networks: x 4000 Scientific American (Jan-2001) Therefore: A Computing Grid • On-demand, ubiquitous access to computing, data, and services • New capabilities constructed dynamically and transparently from distributed services “When the network is as fast as the computer's internal links, the machine disintegrates across the net into a set of special purpose appliances” (George Gilder) My Presentation • The emergence of the Grid concept – Origins in eScience, and the Globus Toolkit • Grids and e-business – Opportunities & requirements • Technology convergence – Open Grid Services Architecture • Summary My Presentation • The emergence of the Grid concept – Origins in eScience, and the Globus Toolkit • Grids and e-business – Opportunities & requirements • Technology convergence – Open Grid Services Architecture • Summary E-Science: The Original Grid Driver • Pre-electronic science – Theorize &/or experiment, in small teams • Post-electronic science – Construct and mine very large databases – Develop computer simulations & analyses – Access specialized devices remotely – Exchange information within distributed multidisciplinary teams  Need to manage dynamic, distributed infrastructures, services, and applications And Thus: The Grid “Resource sharing & coordinated problem solving in dynamic, multiinstitutional virtual organizations” • Early 90s The Grid: A Brief History –Gigabit testbeds, metacomputing • Mid to late 90s –Early experiments (e.g., I-WAY), academic software projects (e.g., Globus), applications • 2002 –Dozens of application communities & projects –Significant technology base (Globus ToolkitTM) –Global Grid Forum: ~500 people, 20+ countries Sloan Digital Sky Survey Analysis Sloan Digital Sky Survey Analysis Size distribution of galaxy clusters? 100000 Galaxy cluster size distribution Chimera Virtual Data System + iVDGL Data Grid (many CPUs) 10000 1000 100 10 1 1 10 Number of Galaxies 100 A Large Virtual Organization: CERN’s Large Hadron Collider 1800 Physicists, 150 Institutes, 32 Countries 100 PB of data by 2010; 50,000 CPUs? Data Grids for High Energy Physics ~PBytes/sec Online System There is a “bunch crossing” every 25 nsecs. There are 100 “triggers” per second Each triggered event is ~1 MByte in size ~100 MBytes/sec 1 TIPS is approximately 25,000 SpecInt95 equivalents Offline Processor Farm ~20 TIPS ~100 MBytes/sec ~622 Mbits/sec or Air Freight (deprecated) Tier 0 Italy Regional Centre CERN Computer Centre Tier 1 France Regional Centre Germany Regional Centre FermiLab ~4 TIPS ~622 Mbits/sec Tier 2 ~622 Mbits/sec Institute Institute Institute ~0.25TIPS Physics data cache ~1 MBytes/sec Caltech ~1 TIPS Tier2 Centre Centre Tier2 Tier2 Centre Tier2 Centre ~1 TIPS ~1 TIPS ~1 TIPS ~1 TIPS Institute Physicists work on analysis “channels”. Each institute will have ~10 physicists working on one or more channels; data for these channels should be cached by the institute server Tier 4 Physicist workstations Grids at NASA: Aviation Safety Wing Models •Lift Capabilities •Drag Capabilities •Responsiveness Stabilizer Models Airframe Models •Deflection capabilities •Responsiveness Crew Capabilities - accuracy - perception - stamina - re-action times - SOPs Engine Models Human Models •Braking performance •Steering capabilities •Traction •Dampening capabilities Landing Gear Models •Thrust performance •Reverse Thrust performance •Responsiveness •Fuel Consumption Life Sciences: Telemicroscopy DATA ACQUISITION PROCESSING, ANALYSIS ADVANCED VISUALIZATION NETWORK IMAGING INSTRUMENTS COMPUTATIONAL RESOURCES LARGE DATABASES Underlying Technical Requirements • Dynamic formation and management of virtual organizations • Online negotiation of access to services: who, what, why, when, how • Configuration of applications and systems able to deliver multiple qualities of service • Autonomic management of distributed infrastructures, services, and applications State of the Art: Globus ToolkitTM (since 1996) • Small, standards-based set of protocols – Authentication, delegation; resource discovery; reliable invocation; etc. • Information-centric design – Data models; publication, discovery protocols • Open source implementation & community – With commercial support • Enabler of services and applications Grid Projects in eScience My Presentation • The emergence of the Grid concept – Origins in eScience, and the Globus Toolkit • Grids and e-business – Opportunities & requirements • Technology convergence – Open Grid Services Architecture • Summary And What About Business? • Fragmentation of enterprise infrastructure – Specialized platforms -> commodity servers – “Intelligence” embedded in networks • The rise of the “eUtility” (IBM, HP, …) – Outsourcing, economies of scale • Business-to-business computing – Especially complex virtual organizations • Ever more challenging QoS requirements – “Green-screen” -> “ubiquitious web presence” Today’s Enterprise Computing Environment The Business Opportunity • On-demand computing, storage, services – Significant savings due to reduced buildout, economies of scale, reduced admin costs – Greater flexibility => greater productivity • Entirely new applications and services – Based on high-speed resource integration • Solution to enterprise computing crisis – Render distributed infrastructures manageable Grids and Industry: Early Examples Entropia: Distributed computing (BMS, Novartis, …) Butterfly.net: Grid for multi-player games Realizing the Promise Requires Significant Innovation • Automation of infrastructure operation to achieve economies of scale • Management and component models for distributed service provisioning • New applications and tools powered by distributed services and resources • Business and service models to support specialization of function My Presentation • The emergence of the Grid concept – Origins in eScience, and the Globus Toolkit • Grids and e-business – Opportunities & requirements • Technology convergence – Open Grid Services Architecture • Summary Grid Evolution: Open Grid Services Architecture • Refactor Globus protocol suite to enable common base and expose key capabilities • Service orientation to virtualize resources and unify resources/services/information • Embrace key Web services technologies: standard IDL, leverage commercial efforts • Result: standard interfaces & behaviors for distributed system management Transient Service Instances • “Web services” address discovery & invocation of persistent services – Interface to persistent state of entire enterprise • In Grids, must also support transient service instances, created/destroyed dynamically – Interfaces to the states of distributed activities – E.g. workflow, video conf., dist. data analysis • Significant implications for how services are managed, named, discovered, and used Open Grid Services Architecture • Defines fundamental (WSDL) interfaces and behaviors that define a Grid Service – Required + optional interfaces = WS “profile” • Defines WSDL extensibility elements – E.g., serviceType (a group of portTypes) • Open source Globus Toolkit 3.0 – Leverage GT experience, code, community • And also commercial implementations The Grid Service = Interfaces/Behaviors + Service Data Service data access Explicit destruction Soft-state lifetime Binding properties: - Reliable invocation - Authentication GridService (required) … other interfaces … (optional) Service data element Service data element Service data element Implementation Hosting environment/runtime (“C”, J2EE, .NET, …) Standard: - Notification - Authorization - Service creation - Service registry - Manageability - Concurrency + applicationspecific interfaces Example: Database Service • DBaccess Grid service supports at least Grid two portTypes Service – GridService – DBaccess DBaccess Name, lifetime, etc. DB info • Each has service data – GridService: basic introspection, lifetime, … – DBaccess: database type, current load, …, … • Maybe other portTypes as well – E.g., NotificationSource (SDE = subscribers) Data Mining for Bioinformatics Community Registry Mining Factory Database Service BioDB 1 User Application Compute Service Provider . . . Database Factory . . . Database Service BioDB n Storage Service Provider “I want to create a personal database containing data on e.coli metabolism” Data Mining for Bioinformatics “Find me a data Community mining service, and Registry somewhere to store data” Mining Factory Database Service BioDB 1 User Application Compute Service Provider . . . Database Factory . . . Database Service BioDB n Storage Service Provider Data Mining for Bioinformatics Community Registry Handles for Mining and Database factories User Application Mining Factory Database Service BioDB 1 Compute Service Provider . . . Database Factory . . . Database Service BioDB n Storage Service Provider Data Mining for Bioinformatics Community Registry “Create a data mining service with initial lifetime 10” User Application Mining Factory Database Service BioDB 1 “Create a database with initial lifetime 1000” Compute Service Provider . . . Database Factory . . . Database Service BioDB n Storage Service Provider Data Mining for Bioinformatics Community Registry “Create a data mining service with initial lifetime 10” User Application Mining Factory Database Service BioDB 1 . . . Database Service BioDB n Miner Compute Service Provider . . . Database Factory “Create a database with initial lifetime 1000” Database Storage Service Provider Data Mining for Bioinformatics Community Registry Mining Factory Query Database Service BioDB 1 . . . Database Service BioDB n Miner User Application Compute Service Provider . . Query . Database Factory Database Storage Service Provider Data Mining for Bioinformatics Community Registry Mining Factory Query Database Service BioDB 1 . . . Database Service BioDB n Keepalive User Application Miner Compute Service Provider . . Query . Keepalive Database Factory Database Storage Service Provider Data Mining for Bioinformatics Community Registry Mining Factory Database Service BioDB 1 Results . . . Keepalive User Application Miner Compute Service Provider . . . Keepalive Database Factory Database Service Results BioDB n Database Storage Service Provider Data Mining for Bioinformatics Community Registry Mining Factory Database Service BioDB 1 . . . Database Service BioDB n Database Storage Service Provider Miner User Application Compute Service Provider . . . Keepalive Database Factory Data Mining for Bioinformatics Community Registry Mining Factory Database Service BioDB 1 User Application Compute Service Provider . . . Keepalive Database Factory . . . Database Service BioDB n Database Storage Service Provider GT3: OGSA-Based Globus Toolkit • GT3 Core – Grid service interfaces – Reference impln of evolving standard – Multiple hosting envs: Java/J2EE, C, C#/.NET? GT3 Data Services Other Grid Services • GT3 Base Services – Globus capabilities GT3 Base Services GT3 Core • Many other services OGSA: Current Status • Grid service specification & other documents moving forward in GGF – www.gridforum.org/ogsi-wg • Globus Project on track for open source OGSA-based GT3 release end of 2002 – www.globus.org/ogsa • IBM committed to various OGSA-compliant software releases (e.g., WebSphere) • Other industrial efforts underway My Presentation • The emergence of the Grid concept – Origins in eScience, and the Globus Toolkit • Grids and e-business – Opportunities & requirements • Technology convergence – Open Grid Services Architecture • Summary Summary: The Grid • Resource sharing & coordinated problem solving in dynamic, multi-institutional virtual organizations – On-demand, ubiquitous access to computing, data, and services – New capabilities constructed dynamically and transparently from distributed services • Evolved to be dominant eScience, now transitioning to industry (think Web in 1994) Open Grid Services Architecture • Open Grid Services Architecture represents next step in Grid evolution • Service orientation enables unified treatment of resources, data, and services • Standard interfaces and behaviors (the Grid service) for managing distributed state • Open source Globus Toolkit implementation (and numerous commercial value adds) For More Information • Grid Book – www.mkp.com/grids • Survey articles – www.mcs.anl.gov/~foster • The Globus Project™ – www.globus.org • Global Grid Forum – www.gridforum.org – Edinburgh, July 22-24 – Chicago, Oct 15-17

Related docs
about grid
Views: 1  |  Downloads: 0
Future of Grid and Globus�
Views: 23  |  Downloads: 5
The Grid
Views: 10  |  Downloads: 2
The Grid and the Future of Business
Views: 111  |  Downloads: 3
Grid Computing The Future of Your Business
Views: 43  |  Downloads: 7
The Car, the Grid, the Future
Views: 6  |  Downloads: 0
1What is a Grid
Views: 108  |  Downloads: 9
What is the Grid
Views: 33  |  Downloads: 2
Single Sign-on to the Grid
Views: 0  |  Downloads: 0
THE FUTURE OF THE GRID
Views: 37  |  Downloads: 2
a grid service management model to e business
Views: 3  |  Downloads: 0
EGEE and the future of Grid Infrastructures
Views: 21  |  Downloads: 5
premium docs
Other docs by mirit35