...

Modernization of Statistical f Information systems initiatives

by user

on
Category: Documents
14

views

Report

Comments

Transcript

Modernization of Statistical f Information systems initiatives
Modernization of Statistical
Information
f
systems
Global initiatives
Eric Hermouet, Statistics Division, ESCAP
1
Presentation




Why modernization?
What is modernization?
How: Global initiatives and collaboration mechanisms
Main outputs of those mechanisms
1
Modernization
 Modernization of statistical information systems
 Industrialization of statistics
 Modernization of statistical production and services
The data deluge
The internet had
1800 exabytes of
data in 2011
Exabyte=1018
Or 1 million terabytes
y
2
The data deluge
50 000 exabytes by 2020
Even if only 0.1% of
these data is useful, that
leave millions of
terabytes for possible
statistical purposes
New data providers
 Google
– Real-time price indices
– Public data explorer
– First point of reference for the “data generation”
 Facebook
 Telecom companies
phones in Asia and the Pacific
– 4 billion mobile p
 How can statistical offices access and use those data sources?
3
Changing user expectations
 Expectation of data available at a faster rate
– Real-time
 Ability to customize datasets
– Linking coherently datasets across domains
– With a high degree of detail
 Data presentation addressing different target groups
– Governments
– General public
IT progress
 IT developments offers new solutions
– In processing huge amounts of data
– In accessing new sources of data
 In a more efficient way
– In terms of speed
– In terms of financial costs
4
So what is modernization?




Common generic processes
Common tools
Common methodologies
Recognizing that all statistics are produced in a similar way
– No special domains
y to adapt
 Increased flexibility
– To access new data sources
– To generate new statistical products
5
A shared production environment
From
To
Custom design
Use of information bank and
modular design
Built solutions
Assembled solutions
Design for a collection mode
Source independent design
“designed-in” quality
Working with existing data and
varying level of quality
Survey cycles with clear start and
end points
Continuous approach of ongoing collection, processing,
and release
A shared production environment
From
To
Direct data collection
supplemented with data from
administrative source
Tapping into existing data,
data
using direct data collection to
link sources and bridge gaps
Individually crafted data
structure
Management of data
Use of agreed standard
approaches
Management of data,
metadata, and paradata
Smaller field workforce with
specialized interviewing skills
A large field workforce
6
A shared production environment
From
To
Spending effort and resources
on collection and processing
More emphasis on specifying
needs, design, and analysis
Understanding user needs and Understanding data
designing collection
characteristics and negotiating
instruments
solutions that bridges gaps
between existing data and
user requirements
What is modernization and why is it
needed?
Questions so far ?
14
7
Global initiatives & collaboration mechanisms
15
Global initiatives & collaboration mechanisms
 High-Level Group for the Modernization of Statistical
Production and Services (HLG)
 Formerly HLG-BAS
 Objectives
– To promote common standards, models, tools and
methods to support the modernization of official statistics;
– To drive new developments in the production,
organization and
d products
d
off official
ff
l statistics, ensuring
effective coordination and information sharing within
official statistics, and with relevant external bodies;
8
HLG










Gosse van der Veen (Netherlands) - Chairman
Brian Pink (Australia)
Eduardo Sojo Garza-Aldape (Mexico)
Enrico Giovannini (Italy)
Mr Park Hyungsoo (Republic of Korea)
Irena Križman (Slovenia)
Katherine Wallman ((United States))
Walter Radermacher (Eurostat)
Martine Durand (OECD)
Lidia Bratanova (UNECE)
HLG reference documents
 Strategic vision
– http://www1.unece.org/stat/platform/display/hlgbas/S
http://www1 unece org/stat/platform/display/hlgbas/S
trategic+Vision
 Strategy to implement the vision of the HLG
– http://www1.unece.org/stat/platform/display/hlgbas/
HLG+Strategy
9
Technical groups - MSIS
 Management of Statistical Information Systems (MSIS)
– Objectives:
j
• To provide, through regular meetings and other means such as the
MSIS Wiki, a forum for exchange of experiences and good practices
among information systems managers from national and
international statistical organizations.
• To contribute to the coordination of activities of different national
and international organizations in the area of statistical information
y
systems.
• To facilitate and encourage implementation of international
standards and recommendations in the field of statistical
computing among national and international statistical
organizations.
Technical groups - MSIS
 Annual meetings jointly organized by UNECE, Eurostat,
OECD. And ESCAP for this 2013 MSIS
– MSIS meetings consider issues related to information
technology governance and management, system
architecture, accessibility and usability.
– First meeting in 2000
– Secretariat: UNECE
 Open
O
to all
ll UN countries and
d internationall organizations
 Reports to the Conference of European Statisticians
10
Technical Group - METIS
 Steering Group on Statistical Metadata – METIS
– Objectives:
• Promote
P
t th
the iimplementation
l
t ti off metadata
t d t systems
t
b
by d
developing
l i
advocacy targeting the senior management level and subjectmatter staff of NSOs;
• Oversee the maintenance of the Common Metadata Framework
(CMF) directing it towards a practical guide serving national
statistical offices;
• Facilitate collection, discussion and dissemination of best practices
in the field of statistical metadata
– Established in 1990
– Work sessions every 2-3 years
– Workshops in-between
– Organized with Eurostat / OECD
– Open to all UN member countries and international organizations
Technical Groups - SAB
 Sharing Advisory Board
– Objectives
• To promote harmonization of business and information
systems architectures;
• To support collaboration for the development of
statistical software
• To provide guidelines and tools to assess new statistical
software
f
tools
l and
d components
• To assist in the improvement of the technical statistical
infrastructure of countries both within and outside the
UNECE region as required.
11
Technical groups - others
– Statistical Data editing
– Statistical Data collection
 Not directly overseen by HLG
– Statistical Network
– SDMX Expert Group
– Statistical Open Standards Group
gg
group
p on q
quality
y in statistics
– Working
– ….
Main outputs
 Emerging “statistical industry” standards
– GSBPM
– GSIM
– SDMX
– DDI
12
GSBPM
 Generic Statistical Business Process Model
– To define and describe statistical processes in a coherent
way
– To standardize process terminology
– To compare and benchmark processes within and between
organisations
– To identify synergies between processes
– To inform decisions on systems architectures and
organisation of resources
GSBPM
 Applicability
– All activities undertaken by producers of official statistics
which result in data outputs
– National and international statistical organizations
– Independent of data source, can be used for:
• Surveys / censuses
• Administrative sources / register-based
g
statistics
• Mixed sources
13
GSBPM
 Not a linear model
 Sub-processes do not have to be followed in a strict order
 It is a matrix, through which there are many possible paths,
including iterative loops within and between phases
 Some iterations of a regular process may skip certain subprocesses
14
GSBPM
 Examples of use
– Harmonizing statistical computing systems
– Facilitating sharing of statistical software
– Framework for process quality management
– Structure for storage of documents
– Measuring operational costs
15
GSIM
 GSIM contains objects which
specify information about the
real world – 'information objects‘
– Examples include data and
metadata (such as
classifications) as well as the
rules and parameters needed
for production processes to
run (for example, data
editing rules).
rules)
 GSIM identifies around 150
information objects, which are
grouped into four top-level
groups
GSIM
 Generic Statistical Information Model
 To describe data and metadata objects and flows within the
statistical business process
 Implementation of GSIM, in combination with GSBPM, allow
– Creating an environment to prepare for reuse and sharing
of methods, components and processes;
– Implementing rule based process control, thus minimizing
h
human
intervention in the
h production
d
process;
– Economies of scale through development of common tools
by the community of statistical organizations.
16
SDMX and DDi
 Standard Data and Metadata eXchange
– UNSD/DFID project in Cambodia
Cambodia, Laos
Laos, Thailand
Thailand, Viet
Nam
 Data Documentation Initiatives
– World Bank Microdata Management Toolkit
SDMX and DDI
17
What’s next?
 Launch of “Plug and Play” project – February 2013
– To create a common statistical production architecture
– To create a standardized architecture for statistical
production solutions, including processes, information and
systems, coherent with the Generic Statistical Business
Process Model (GSBPM) and the Generic Statistical
Information Model (GSIM),
– To enable and advance the sharing of production processes
or components,
t th
thus reducing
d i costs.
t
– To provide the basis for a central inventory or repository
with life cycle management of sharable production
processes and components.
18
Fly UP