Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Project proposal for Data Science Lab, Slides of Statistics

A virtual space to collaborate on data science projects where the parties can share efforts to develop solutions, training sets, provide ...

Typology: Slides

2021/2022

Uploaded on 08/05/2022

dirk88
dirk88 🇧🇪

4.5

(206)

3.2K documents

1 / 3

Toggle sidebar

Related documents


Partial preview of the text

Download Project proposal for Data Science Lab and more Slides Statistics in PDF only on Docsity! UNECE High-level Group for the Modernisation of Official Statistics Project Proposal: Data Science Lab Prepared by the Blue Skies Thinking Network / Andrew O’Sullivan, Barteld Braaksma, Juan Muñoz 1 Purpose To create an expert group and a virtual collaboration community to facilitate the analysis of data, the support for emergent sources of information, and the introduction of new technologies to modernise the statistical processes and the products and services derived. 2 Project description The main goal of the project is to create an expert working group supported by a virtual collaborative Internet platform to develop, integrate, and share knowledge about emergent sources of information and new technologies to improve the production and outputs (products and services) of official statistics. The project outputs will be: 1. An experts group to share knowledge, practices and efforts to promote the development of methodologies, techniques and technologies to improve the statistical processes, products, and services based on the out-comings from the data science fields and the integration of traditional and non- traditional statistical and geographical data sources. 2. A systematized library of research publications, documented experiences, data resources, video tutorials and algorithms developed in collaboration by the statistical community. 3. A virtual space to collaborate on data science projects where the parties can share efforts to develop solutions, training sets, provide feedback, etc. 4. A virtual community. Its initial configuration can be implemented as a special section of the WikiStats using the collaboration tools already in use by the HLG-MOS. In a more evolved model, this community might grow to become a federated network sharing hardware and software resources from participating institutions and from other efforts made by other international organisations like UNSD and Eurostat. 3 Alternatives considered 1. UNSD Global Platform. The project of the Global Platform coordinated by UNSD has similar goals and when developed might have connections for the exchange and to establish collaborative projects on a broader way. 2. Doing nothing. If the project does not go on there will still be opportunities to collaborate but these efforts will be still being something disperse and information will still being difficult to find. 4 Expected Benefits ☒ Reduced costs ☒ Increased efficiency ☐ Reduced risks ☒ New capabilities to meet user needs 5 Which key priorities in the HLG-MOS Strategic Framework does the proposed project relate to? ☒ Take cost out of our organizations to reinvest in more value-added areas ☒ Explore new areas collectively and leverage each other’s' research investments in specific areas ☐ Provide whole of government data ecosystems based on international standards, for better estimates in key policy areas ☐ Renew our governance and operating processes Justification: Statistical Offices have already made some investments to develop knowledge, methodologies and technologies in the field of data science to improve its process and the products and services they offer. Under the HLG-MOS umbrella, the statistical community have been working in some related projects to explore the Use of Big Data for the production of statistics, the integration of different traditional and non-traditional sources of statistics, the big data Sandbox, and more recently, the machine learning development project. Although those projects have already delivered some value, the real potential of them has not been reached. The need still exists and there are many more applications to apply that may drive to modernise the official statistics. 6 How does the proposed project relate to other activities under the HLG-MOS? This project will consolidate and provide continuity to efforts that the HLG-MOS have been developing in the following areas: • Use of Big Data for the production of statistics • Integration of different sources of statistics and data • The Sandbox • The machine learning development project (HLG-MOS ML Project) As some of the products of the project will have some software as output, it might be considered as related to the efforts made by the Sharing Tools Group. However, as the scope is broader, the project is considered as the foundation of an evolution of this group with the participation not only of ICT people but also of statisticians and Data Scientist on formation in the different statistical offices. The project intention is not to replace other efforts like the one made by UNSD neither Eurostat’s one but to develop the field to improve the modernisation of official statistics while creating links to conform a federated experts network and share results potentializing the value of the outputs from different projects. 7 Proposed timetable The first stage of the project will be to constitute the group and to establish the structure of the community, this task may take the first quarter of 2020. During the rest of the year, the working group will be in charge of developing a more in-depth working program, organising the collaborative working space, and compiling the existing information to constitute the data-science library.
Docsity logo



Copyright © 2024 Ladybird Srl - Via Leonardo da Vinci 16, 10126, Torino, Italy - VAT 10816460017 - All rights reserved