|
|
"High Speed Data Gathering, Distribution and Analysis for Physics Discoveries at the Large Hadron Collider" - Bandwidth ChallengeSupercomputing 2006 - Tampa, FloridaA collaboration between: Caltech, CERN, UMichigan, UFlorida, and many others Participants:
(The above list shows those present in Tampa: many others not listed assisted from their home institutions) Partners:
The Powerpoint Animated Logos display is HERE. Extended Abstract:CERN's Large Hadron Collider is due to begin operating in 2007, and at that time several large experiments will begin to accumulate particle collision data at continuous rates from several hundred MBytes per second, up to over a GByte per second. The data will be distributed over international networks to collaborating institutes and universities, where it will be processed and analysed by physicists and engineers engaged in the experiments. The experiment collaborations each number several thousand scientists, who are spread among hundreds of institutes situated in all World regions. The architectures of the global computing systems that have been designed to handle these unusual computing demands comprise state-of-the-art high speed optical networks that interconnect a Grid of powerful processor and data storage systems, and a rich suite of applications and systems software to support the scientific missions of the experiments. The global systems are already in the final stages of deployment, and are being scaled to meet the required multi-Petabyte annual capability for 2007, and beyond. In this context, we will demonstrate high speed transfers of event datasets between particle physics laboratories and universities collaborating on the ATLAS and CMS experiments at the LHC, and the BaBar experiment at SLAC's Linear Collider. Using state of the art WAN infrastructure and Grid-based Web Services, our demonstration will involve real-time particle event analysis that requires optimized transfers of Terabyte-scale datasets. Individual 10Gbit/sec network links (lambdas) will be used to full capacity during the Challenge, and we will utilize servers and disk systems featuring the latest generation of processors, PCI-Express-based NICs, and the latest firmware (with the help of strategic partnerships with several vendors). We intend to saturate all available lambdas arriving at the Tampa show floor, in full duplex mode. During the Challenge, we will monitor the WAN performance using Caltech's MonALISA agent-based system. We will deploy analysis software that demonstrates the use of a suite of Grid-enabled Analysis tools developed at Caltech and University of Florida, and we will make use of EGEE, OSG, ATLAS and CMS data management software such as SRM, dCache, FTS, and PhEdEx. Prototypes of the latest version of parallel NFS will demonstrate high utilization of a 10Gb connection during real-time event analysis. The use of the network will be in a realistic mixture of streams: those due to the organized transfer of multi-TeraByte event datasets, and those due to many smaller flows of physics data, absorbing the remaining capacity, in both directions on each wave. The intention is to prove the capability of the global HEP networks to support distributed physics analysis at the LHC, which will begin operating in 2007, as well as to emulate the real demands on the WAN during LHC operations. We intend to beat our SC2005 Bandwidth Challenge winning entry in which we aggregated over 150 Gbits/sec of physics dataset traffic. The physics datasets will contain real and simulated events produced at Fermilab, Caltech, SLAC, BNL, CERN and other partner Grid Service sites including Michigan, Florida, Manchester, Rio de Janeiro (UERJ) and Sao Paulo (UNESP) in Brazil, Korea (KNU), and Japan (KEK). Our network partners include LHCNet, Starlight, NLR, Internet2’s Abilene and HOPI, ESnet, UltraScience Net, MiLR, FLR, CENIC, Pacific Wave, UKLight, TeraGrid, Gloriad, AMPATH, RNP, ANSP and CANARIE. Results Summary: |
|
|