Jump to ratings and reviews
Rate this book

Training Students to Extract Value from Big Data: Summary of a Workshop

Rate this book
As the availability of high-throughput data-collection technologies, such as information-sensing mobile devices, remote sensing, internet log records, and wireless sensor networks has grown, science, engineering, and business have rapidly transitioned from striving to develop information from scant data to a situation in which the challenge is now that the amount of information exceeds a human's ability to examine, let alone absorb, it. Data sets are increasingly complex, and this potentially increases the problems associated with such concerns as missing information and other quality concerns, data heterogeneity, and differing data formats. The nation's ability to make use of data depends heavily on the availability of a workforce that is properly trained and ready to tackle high-need areas. Training students to be capable in exploiting big data requires experience with statistical analysis, machine learning, and computational infrastructure that permits the real problems associated with massive data to be revealed and, ultimately, addressed. Analysis of big data requires cross-disciplinary skills, including the ability to make modeling decisions while balancing trade-offs between optimization and approximation, all while being attentive to useful metrics and system robustness. To develop those skills in students, it is important to identify whom to teach, that is, the educational background, experience, and characteristics of a prospective data-science student; what to teach, that is, the technical and practical content that should be taught to the student; and how to teach, that is, the structure and organization of a data-science program. Training Students to Extract Value from Big Data summarizes a workshop convened in April 2014 by the National Research Council's Committee on Applied and Theoretical Statistics to explore how best to train students to use big data. The workshop explored the need for training and curricula and coursework that should be included. One impetus for the workshop was the current fragmented view of what is meant by analysis of big data, data analytics, or data science. New graduate programs are introduced regularly, and they have their own notions of what is meant by those terms and, most important, of what students need to know to be proficient in data-intensive work. This report provides a variety of perspectives about those elements and about their integration into courses and curricula.

66 pages, Paperback

First published December 31, 2014

1 person want to read

About the author

National Research Council

6,246 books43 followers
The National Research Council (NRC) functions under the auspices of the National Academy of Sciences (NAS), the National Academy of Engineering (NAE), and the Institute of Medicine (IOM). The NAS, NAE, IOM, and NRC are part of a private, nonprofit institution that provides science, technology and health policy advice under a congressional charter signed by President Abraham Lincoln that was originally granted to the NAS in 1863. Under this charter, the NRC was established in 1916, the NAE in 1964, and the IOM in 1970. The four organizations are collectively referred to as the National Academies.

The mission of the NRC is to improve government decision making and public policy, increase public education and understanding, and promote the acquisition and dissemination of knowledge in matters involving science, engineering, technology, and health. The institution takes this charge seriously and works to inform policies and actions that have the power to improve the lives of people in the U.S. and around the world.

The NRC is committed to providing elected leaders, policy makers, and the public with expert advice based on sound scientific evidence. The NRC does not receive direct federal appropriations for its work. Individual projects are funded by federal agencies, foundations, other governmental and private sources, and the institution’s endowment. The work is made possible by 6,000 of the world’s top scientists, engineers, and other professionals who volunteer their time without compensation to serve on committees and participate in activities. The NRC is administered jointly by the NAS, NAE, and the IOM through the NRC Governing Board.

The core services involve collecting, analyzing, and sharing information and knowledge. The independence of the institution, combined with its unique ability to convene experts, allows it to be responsive to a host of requests.

The portfolio of activities includes:

* Consensus Studies: These comprehensive reports focus on major policy issues and provide recommendations for solving complex problems.
* Expert Meetings and Workshops: By convening symposia, workshops, meetings, and roundtables, the NRC connects professionals as well as the interested public and stimulates dialogue on diverse matters.
* Program and Research Management: At the request of state and federal agencies, the NRC manages and evaluates research programs, conducts program assessments, and reviews proposals.
* Fellowships: The NRC administers several postdoctoral fellowship programs.

Free Scientific Information: Publishing more than 200 reports and related publications each year, the institution is one of the largest providers of free scientific and technical information in the world. Most of it is now on the Web at www.nap.edu.

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
0 (0%)
4 stars
0 (0%)
3 stars
0 (0%)
2 stars
0 (0%)
1 star
0 (0%)
No one has reviewed this book yet.

Can't find what you're looking for?

Get help and learn more about the design.