
Big Data (BD SSG)
Big Data is a term applied to data sets whose size is beyond the ability of commonly used software tools to capture, manage, and process the data within a tolerable elapsed time. Big data sizes are a constantly moving target currently ranging from a few dozen terabytes to many petabytes of data in a single data set. – Wikipedia, May 2011
The Big Data Senior Steering Group (BD SSG) has been formed to identify current big data research and development activities across the Federal government, offer opportunities for coordination, and begin to identify what the goal of a national initiative in this area would look like. As data volumes grow exponentially, so does the concern over data preservation, access, dissemination, and usability. Research into areas such as automated analysis techniques, data mining, machine learning, privacy, and database interoperability are underway at many agencies and will help identify how big data can enable science in new ways and at new levels. The science of data includes the processes of turning data into knowledge, data mining and visualization, interoperability, search and discovery, and semantics.
Scope
BD SSG was formed to identify programs across the Federal government and bring together experts to help define a potential national initiative in this area. BD SSG has been asked to identify current technology projects as well as educational offerings, competitions, and funding mechanisms that take advantage of innovation in the private sector.
Functions
Current functions and activities include:
- Collecting information on current activities across the Federal Government.
- Creating a high-level vision of the goals of a potential national initiative.
- Developing the appropriate documents and descriptions to aid discussion within the government, and where appropriate, the private sector.
- Developing implementation strategies that leverage current investments and resources.














