The Berkeley Data Cloud (BDC) is a data management system developed by nuclear security and computer science researchers at LBNL in support of the DNN R&D Venture, Multi-Informatics for Nuclear Operations Scenarios (MINOS). In addition to MINOS, the BDC software now manages unclassified data for a variety R&D efforts, hosting over 1 PB of data, most of which has been generated through research activities supported by the DNN R&D. End users can download data directly from the BDC service either via the web interface or by making application programming interface (API) calls to RESTful and/or Python APIs. In recent years, LBNL developed the capability for users to use cloud services like Globus or Amazon Web Services to pull data directly into their cloud environments and to use Jupyter notebooks hosted on the BDC server. Access to data on the BDC is managed on a project-by-project basis based on the preferences of the data owners who may view the statistics associated with usage of their data through BDC queries and retrieval. All relevant data for this project will be curated to the BDC to enable future use by the community.
Researchers may use BDC (bdc.lbl.gov) to access data shared by other researchers or to host their data.
