...

Nessun titolo diapositiva

by user

on
Category: Documents
18

views

Report

Comments

Transcript

Nessun titolo diapositiva
GENIUS: a Web Portal for DataGRID
Roberto Barbera(*)
(*)work
Prague,
12.12.2002
CHEP 2000,
10.02.2000
in collaboration with A. Falzone and A. Rodolico
1 Barbera
Roberto
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
Outline
The Grid Vision: the concept of Collaboratory
Grid activities in Europe: the DataGrid Project
The main “actors” of the DataGrid Project
“Easy” ed “ubiquitous” access to the Grid: the
GENIUS web portal
The Future Grid Projects
Conclusions and perspectives
Prague, 12.12.2002
2
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
A new way to Co(l)laborate…
A Co(l)laboratory is a place where scientists and
researchers work together to solve complex interdisciplinary problems, despite geographic and
organizational boundaries.
Co(l)laboratories provide “uniform” and “democratic”
access to computational resources, services and/or
applications.
Co(l)laboratories expand the resources available to
researchers, enable multidisciplinary collaborations and
problem solving, increase the efficiency of research , and
accelerate the dissemination of knowledge.
Computational grids can be the natural solution for the
computing problems of a Co(l)laboratory.
Prague, 12.12.2002
3
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
Prague, 12.12.2002
4
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
DataGRID Project
The Grid metaphor
(courtesy of F. Gagliardi)
Mobile Access
G
R
I
D
Workstation
M
I
D
D
L
E
W
A
R
E
Supercomputer, PC-Cluster
Data-storage, Sensors, Experiments
Visualising
Internet, networks
Prague, 12.12.2002
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
DataGRID Project
-CPU
-storage
Mobile Access
- memory
- I/O
G
R
I
D
Workstation
01011010110
M
I
D
D
L
E
W
A
R
E
Visualising
Prague, 12.12.2002
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
DataGRID Project
Mobile Access
virtual
services
virtual
services
Workstation
Visualising
Prague, 12.12.2002
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
Present Grid investments in EU and USA
Million
6
French ACI GRID
38
Italian Funding (MIUR+CNR+INFN)
51
EU IST Funding
UK Government’s Office of
Science and Technology
196,1
60,3
Distributed Terascale Facility
(USA)
0,0
20,0
40,0
60,0
Future figures:
US Cyber Infrastructure:
Japan (A-P) Grid:
Prague, 12.12.2002
80,0
100,0
120,0
140,0
1020 M$
~500 M$
160,0
180,0
200,0
8
Roberto Barbera
GRID Projects – EU IST (~37 M€)
An integrated approach
Applications
EGSO
CROSSGRID
Middleware GRIP EUROGRID
& Tools
DAMIEN
Underlying
Infrastructures
Industry / business
Prague, 12.12.2002
GRIDSTART
GRIA
GRIDLAB
DATAGRID
DATATAG
iVDGL
Science
9
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
DataGRID main contractors
CERN – International (Switzerland/France)
CNRS - France
ESA/ESRIN – International (Italy)
INFN - Italy
NIKHEF – The Netherlands
PPARC - UK
Prague, 12.12.2002
10
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
Other DataGRID partners
Industrial Partners
•Datamat (Italy)
•IBM-UK (UK)
•CS-SI (France)
Research and Academic Institutes
•CESNET (Czech Republic)
•Commissariat à l'énergie atomique (CEA) – France
•Computer and Automation Research Institute,
Hungarian Academy of Sciences (MTA SZTAKI)
•Consiglio Nazionale delle Ricerche (Italy)
•Helsinki Institute of Physics – Finland
•Institut de Fisica d'Altes Energies (IFAE) - Spain
•Istituto Trentino di Cultura (IRST) – Italy
•Konrad-Zuse-Zentrum für Informationstechnik Berlin - Germany
•Royal Netherlands Meteorological Institute (KNMI)
•Ruprecht-Karls-Universität Heidelberg - Germany
•Stichting Academisch Rekencentrum Amsterdam (SARA) – Netherlands
•Swedish Research Council - Sweden
Prague, 12.12.2002
11
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
EDG overview : work packages
EDG Project is structured in 12 Work Packages:












}
WP1: Work Load Management System
WP2: Data Management
WP3: Grid Monitoring / Grid Information Systems
Middleware
WP4: Fabric Management
WP5: Mass Storage Management
WP6: Testbed and demonstrators
WP7: Network Monitoring
WP8: High Energy Physics Applications
Applications
WP9: Earth Observation
WP10: Biology
WP11: Dissemination
WP12: Management
Prague, 12.12.2002
}
12
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
Computational biology
Prague, 12.12.2002
13
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
Medical Diagnostic Imaging
Prague, 12.12.2002
14
Roberto Barbera
Earth Observation Community
GRID interactive scenario
Common access to EO
missions catalogues
Acquisition plan, order, delivery
On demand high
level products
generation
Parametric data fusion
and models integration
Collaborative
publishing of
results
Prague, 12.12.2002
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
Earth Observation
ENVISAT (launched 01.03.2002 !)
• 3500 MEuro programme cost
•
•
•
•
•
10 instruments on board
200 Mbps data rate to ground
400 Tbytes data archived/year
~100 “standard” products
10+ dedicated facilities in Europe
• ~700 approved science user projects
16 Barbera
Roberto
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
High Energy Physics
http://www.cern.ch
LHC
~9 km
SPS
CERN
Prague, 12.12.2002
17
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
High Energy Physics
Prague, 12.12.2002
18
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
High Energy Physics
ATLAS
CMS
~6-8 PetaBytes / year
~O(108) events/year
~O(103) batch and interactive users
LHCb
Prague, 12.12.2002
19
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
Prague, 12.12.2002
The Little Bang
The Big Bang
The Quark Gluon Plasma
Pb+Pb @ LHC (5.5 A TeV)
20
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
Total weight
External diameter
Total length
Magnetic Field
10.000t
16,00m
25m
0.2-0-5Tesla
ALICE On-line System
multi-level trigger to filter
background and reduce
the amount of data
ALICE Collaboration already
includes 1223 peoples from
85 institutes in 27 countries
Prague, 12.12.2002
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
1/100 of a Pb+Pb @ LHC !
Simulation and reconstruction of a “full” (central) Pb+Pb collision
at LHC (about 84000 primary tracks!) takes about 24 hours of a
top-PC and produces an output bigger than 2 GB.
Prague, 12.12.2002
22
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
6-8 Petabytes
˜10.000.000 CD-ROM
Prague, 12.12.2002
5 times the
Eiffel Tower
˜1500 m
23
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
ALICE “grid” sites
OSU/OSC
LBL/NERSC
Birmingham
Nantes
Dubna
NIKHEF
Saclay
GSI
CERN
Merida
Lyon
Torino
Padova
Bologna
IRB
Bari
~O(108) events/year
~O(103) batch and interactive users
in 2006
Cagliari
Yerevan
Catania
Kolkata, India
Capetown, ZA
Prague, 12.12.2002
24
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
The LHC “web” in the world
Europe:
267 institutes, 4603 users
Elsewhere: 208 institutes, 1632 users
Prague, 12.12.2002
25
Roberto Barbera
Prague, 12.12.2002
26 Barbera
Roberto
Job Submission work-flow
UI
JDL
Replica
Catalogue
Input “sandbox”
DataSets info
Information
Service
Output “sandbox”
Prague, 12.12.2002
Storage
Element
Globus RSL
Job Status
Logging &
Book-keeping
Publish
Job Query
Job Submit Event
Author.
&Authen.
Expanded JDL
Resource
Broker
Job Status
Job Submission
Service
Compute
Element
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
EDG m/w has been released but…
EDG software (Globus, UI, JDL, WP2, WP3, etc.) contains tens of
commands/switches which also have their own logical sequences (“B”
after “A”, “C” before “D” and so on).
Browsing Grid VO “directories” (users, RC’s, DB’s, etc.) requires LDAP
“speaking” and tomorrow could require SQL “speaking”.
“User gridification” is a tough task for a “rookie”  this does not fit with the
claim that “grids” are for everybody and that grid computing will be as
easy as surfing the Internet ?
Furthermore, all this holds for DataGrid. What will happen when other
grids’ software (especially UI’s) will come up (PPDG, iVDGL, etc.) ? Will
users have to learn tens of “grid dialects” ?
Today “grid computing” is a rather complicated experience only possible
at selected machines (UI’s)  this does not fit with the claim that one
could do “grid computing” even from a PDA ?
Is there any way to set-up a “user-friendly” grid ?
Prague, 12.12.2002
28
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
A web portal: why and how ?
It can be accessed from everywhere and by
“everything” (desktop, laptop, PDA, WAP phone).
It can keep the same user interface to several
back-ends (grid “dialects”  command-line UI’s).
It must be redundantly “secure” at all levels: 1)
secure for web transactions, 2) secure for user
credentials, 3) secure for user authentication, 4)
secure at VO level.
All available grid services must be incorporated in
a logic way, just “one mouse click away”.
Its layout must be easily understandable and user
friendly.
Prague, 12.12.2002
29
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
GENIUS®
(Grid Enabled web eNvironment for
site Independent User job Submission)
[https://genius.ct.infn.it]
GENIUS web portal
Applications’
specific layer
DataGRID
architecture
GLOBU
S
toolkit
ALICE
ATLAS
CMS
LHCb
Other apps
High level GRID middleware
Basic Services
OS & Net services
Prague, 12.12.2002
30
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
GENIUS:
ALICE Collaboration
how it works
https+java/xml+rfb
WEB Browser
GENIUS
EnginFrame
3-tier model
Loc
al
WS
Apache
ED
G
UI
EDG+GSI
the Grid
31
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
GENIUS
ALICE Collaboration
show: the main page
32
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
GENIUS
ALICE Collaboration
show: the authentication
33
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
GENIUS
ALICE Collaboration
show: file services
34
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
GENIUS
ALICE Collaboration
show: the authorization
35
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
GENIUS
ALICE Collaboration
show: job submission
36
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
GENIUS
ALICE Collaboration
show: job queue
37
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
GENIUS
ALICE Collaboration
show: job output
38
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
GENIUS
ALICE Collaboration
show: job data
39
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
GENIUS
ALICE Collaboration
show: personal spooler
40
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
GENIUS show: interactive analysis
41
Roberto Barbera
42
43
44
45
46
47
48
49
50
51
52
53
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
Present status and perspectives
Current implementation of GENIUS already includes:
 secure web transactions, user authentication and
authorization;
 remote interaction with the user’s file system;
 interfaces for job submission/control, to VO servers
(users’ and catalogues), and to monitoring systems;
 persistent (user’s) book-keeping and spooler system;
 interactive analysis !
 rpm available !
Todo:
 multi-jobs (parallel and sequential);
 interface to data management and other grid services;
 more application-specific customizations;
 web-guided creation of a work flow system.
54
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
GENIUS vs. other grid portals
It is not a toolkit. It is a complete production-ready
environment which combines the concepts of “user portal”
and “science portal”.
No client software needs to be installed apart from the
web browser. GENIUS can be accessed from everywhere.
No security delegation (“à la MyProxy”) is needed. Access
passwords are securely “streamed” only when needed.
Interactive analysis (via VNC) and web access to personal
spooling areas are possible.
User file system is not limited to input and output files.
EnginFrame modularity makes different customizations
easy to implement. Already available for EDG m/w,
GLOBUS, and LSF. Under definition for CONDOR
(hungarian grids).
It is compatible with the Tomcat open source java servlet
55
container available from SUN.
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
The LHC Computing Grid
Prague, 12.12.2002
56
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
Tier-1’s for LHC Experiments
Prague, 12.12.2002
57
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
LHCC
Reports
The LHC Computing Grid
Project Structure
Reviews
The LHC Computing Grid Project
Common
Computing
RRB
Resource
Matters
Project Overview Board
Other
Computing
Grid
Projects
Other HEP
Grid
Projects
Project Manager
Requirements,
Monitoring
Project
Execution
Board
Software and
Computing
Committee
(SC2)
GDB
EU
DataGrid
Project
Prague, 12.12.2002
implementation teams
RTAG
Other
Labs
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
LCG Applications Area Projects
Software Process and Infrastructure (operating)

Librarian, QA, testing, developer tools, documentation, training, …
Persistency Framework (operating)

POOL hybrid ROOT/relational data store
Mathematical libraries (operating)

Math and statistics libraries; GSL etc. as NAGC replacement
Core Tools and Services (just launched)

Foundation and utility libraries, basic framework services, system services,
object dictionary and whiteboard, grid enabled services
Physics Interfaces (being initiated)

Interfaces and tools by which physicists directly use the software.
Interactive (distributed) analysis, visualization, grid portals
Simulation (coming soon)

Geant4, FLUKA, virtual simulation, geometry description & model, …
Generator Services (coming soon)

Generator librarian, support, tool development
Prague, 12.12.2002
59
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
LCG Grid Deployment DRAFT
~April 03 - LCG-1 Global Grid Service






deploy a sustained 24 X 7 service
based on “converged” toolkit emerging from GLUE
and delivered by Datagrid and iVDGL
~10 sites – including sites in Europe, Asia, North America
> 2 times CERN prototype capacity (= ~1,500 processors, ~1,500
disks)
permanent service for all experiments
~Oct 03 –


reliability & performance targets
wider deployment
This Grid services evolves slowly through 2005



new middleware – functionality, performance
grows with more sites and users
provides continuous service for the LHC experiments
Prague, 12.12.2002
60
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
EoI for EU FP6 2002: EGEE
Enabling Grids for E-Science and industry in Europe
Creation and support
E-Science centres
European
Infrastructure
Modulable
Testbeds
R&D Agenda
Semantic GRID
Database
Security
Deployment with
IT Industry
S/W Hardening
GLOBUS
EuroGrid, Gridlab etc.
Prague, 12.12.2002
National
eScience
Centres
Integrated Project
ENABLING GRIDS
ESCIENCE EUROPE
EGEE
Science
Outreach
Consulting
Prototyping
Deployment
Industry
Applications
Industry Outreach
Consulting
Training Courses
Dissemination
Forum
SMEs developing
Grid-enabled Applications
Tools and
Service Development
Applications in
Other Sciences
EIROforum
61
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
EGEE key objectives
Integrate Grid technological developments and expertise from
across Europe
Establish a production quality Europe-wide Grid infrastructure
Enable the creation of e-Science applications from across the
science and industry spectrum
Ensure the timely delivery if its programme of work guided by
the needs of its users
See http://www.cern.ch/egee for more
Participants from Czech Republic:

Faculty of Informatics of Masaryk University, Brno


CESNET z.s.p.o., Prague

Prague, 12.12.2002
Contact: Ludek Matyska
Contact: Jan Gruntorad
62
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
Important news from EU
Date: Wed, 30 Oct 2002 12:01:49 +0100
From: [email protected]
Subject: RE: Request of information
To: [email protected]
Dear Domenico,
I know, it is even for us in the Commission hard to follow the story...
The current status is the following:
 There will be an invitation for work on Grids in the context of the
Research Infrastructures (RI) 1st Call (expected to be announced on 17
December this year, budget 50 MEuro).
 Important in this case:
 the RI instruments have differences compared to the IST
instruments since the first are targeted to support the building of RI.
 In IST: work on Grids will be invited under two mainly action-lines,
 the IST-RI and
 the Complex problem solving one.
 Both above action lines are expected to open in mid June 2003.
 I do not have safe information on the budget in the case of IST.
Kyriakos
63
Roberto Barbera
Indicative roadmap of calls
1. Budget from Structuring the ERA Programme (€200m)
Year 2003
Year 2004
€ 50m
Year 2005
€100m
Year 2006
€ 50m
2. Budget from IST (€100m)
Year 2003
€ ?m
Year 2004
Year 2005
Year 2006
€ ?m
64
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
Conclusions
Computational grids could represent the “natural”
environment for next generation high energy physics
experiments, computational bio-medecine, Earth
observation and many other inter-disciplinary
applications.
“Grid” could be the Internet “new age” where users can
seamlessly and ubiquitously access not only
information but also huge computing resources and
mass storage systems distributed worldwide with their
own applications.
However, in order to turn dreams into reality, grid
access must be easy and intuitive especially for the
vast majority of non-expert users and this is the
mandate of the GENIUS team.
Prague, 12.12.2002
65
Roberto Barbera
Dipartimento di Fisica dell’Università di Catania and INFN Catania - Italy
ALICE Collaboration
Prague, 12.12.2002
66
Roberto Barbera
Fly UP