BD2K

http://www.knoweng.org/

DIBBs Whole Tale

Scholarly publications today are still mostly disconnected from the underlying data and code used to produce the published results and findings, despite an increasing recognition of the need to share all aspects of the research process. As data become more open and transportable, a second layer of research output has emerged, linking research publications to the associated data, possibly along with its provenance. This trend is rapidly followed by a new third layer: communicating the process of inquiry itself by sharing a complete computational narrative that links method descriptions with executable code and data, thereby introducing a new era of reproducible science and accelerated knowledge discovery. In the Whole Tale (WT) project, all of these components are linked and accessible from scholarly publications. The third layer is broad, encompassing numerous research communities through science pathways (e.g., in astronomy, life and earth sciences, materials science, social science), and deep, using interconnected cyberinfrastructure pathways and shared technologies. The goal of this project is to strengthen the second layer of research output, and to build a robust third layer that integrates all parts of the story, conveying the holistic experience of reproducible scientific inquiry by (1) exposing existing cyberinfrastructure through popular frontends, e.g., digital notebooks (IPython, Jupyter), traditional scripting environments, and workflow systems; (2) developing the necessary 'software glue' for seamless access to different backend capabilities, including from DataNet federations and Data Infrastructure Building Blocks (DIBBs) projects; and (3) enhancing the complete data-to-publication lifecycle by empowering scientists to create computational narratives in their usual programming environments, enhanced with new capabilities from the underlying cyberinfrastructure (e.g., identity management, advanced data access and provenance APIs, and Digital Object Identifier-based data publications). The technologies and interfaces will be developed and stress-tested using a diverse set of data types, technical frameworks, and early adopters across a range of science domains.

https://opensource.ncsa.illinois.edu/confluence/display/WT

Materials Data Facility Resources

The Materials Data Facility (MDF) is a collaboration between Globus at the University of Chicago, the National Center for Supercomputing Applications (NCSA-UIUC), and the Center for Hierarchical Materials Design (CHiMaD) a NIST-funded center of excellence.

MDF is developing key data services for materials researchers with the goal of promoting open data sharing, simplifying data publication and curation workflows, encouraging data reuse, and providing powerful data discovery interfaces for data of all sizes and sources. Specifically, MDF services will allow individual researchers and institutions to 1) enable publication of large research datasets with flexible policies; 2) grant the ability to publish data directly from local storage, institutional data stores, or from cloud storage, without third-party publishers; 3) build extensible domain-specific metadata and automated metadata ingestion scripts for key data types; 4) develop publication workflows; 5) register a variety of resources for broader community discovery; and 6) access a discovery model that allows researchers to search, interrogate, and build upon existing published data.

https://www.materialsdatafacility.org/
http://matsci.registry.nationaldataservice.org/
http://mgi.registry.nationaldataservice.org:8181/ (Materials Resource Registry)
https://trial.publish.globus.org/ (Clowder instance following master branch)
http://petrel.alcf.anl.gov/ (Clowder instance following current development branch, feature/CATS-224-ability-to-launch-vm-from-dataset)
https://trello.com/b/lmDf7NDa/materials-data-facility (Issue tracker)
http://wiki.nationaldataservice.org/MaterialsMetadataTerms (Metadata)
Team Resources

Midwest BigData Hub

http://midwestbigdatahub.org/
Globus endpoints: ncsa#mdf (141.142.208.128), ~~ncsa#mdf-publish (141.142.193.28)~~

Plants in Silico

TERRA

https://terraref.ncsa.illinois.edu/
https://www.youtube.com/watch?v=Pp6IdkPtFC8&feature=youtu.be
http://141.142.208.144/clowder/ (Production Clowder Instance)
http://141.142.209.122/clowder/
https://github.com/terraref (Repository)
https://github.com/terraref/computing-pipeline/issues (Issue tracker)
https://opensource.ncsa.illinois.edu/jira/issues/?jql=labels%20%3D%20TERRA (Issue tracker)

Projects & Pilots