Open DMIX High Performance Web Services for Data Mining,
Document Sample


Open DMIX
High Performance Web Services for
Data Mining, Data Integration, and
Data Exploration
Robert L Grossman &
Steve Eick, Yunhong Gu,
David Hanley & Xinwei Hong
National Center for Data Mining
Goal: Integrate & Explore
Distributed Data
Middleware # Sites Critical Avg. Avg. Avg.
Path Arch. Data Set Select
Data Grid 1000 cycles PB TB GB
Data Web 1M access & TB GB MB
integ.
High Performance Web Services
Discovery UDDI Bandwidth
Challenge
Description WSDL
Packaging XML DSTP
Transport SOAP/HTTP SOAP+
Network Protocol TCP SABUL/UDT
*Open DMIX is an open source collection of web services
for data mining, data integration and data exploration
SABUL/UDT Protocol Overview
Uses both Rate Control (RC) and (window based)
Flow Control (FC)
– Constant RC interval to remove RTT bias
– Employs bandwidth estimation
Selective acknowledgement (ACK)
– Reduces control traffic & results in faster recovery
Uses packet delay as well as packet loss to
indicate congestion
Slow start
– controlled by FC
UDT Fast, Fair & Friendly,
Easy & Efficient
Friendly to
TCP Flows
Fair to
other High
Perf. Flows
Fast
Efficien
t
New Trans-Atlantic Milestone for
FFFEE Data Transport
For More Information
Robert Grossman
grossman at uic dot edu
www.ncdm.uic.edu
www.dataspaceweb.net
www.sourceforge.net/projects/dataspace
Please join our open source project developing
network protocols, data protocols, and web
services on Source Forge.
Related docs
Get documents about "