The Provenance group will address the issue of protecting the reliability of data as it moves through the cyberinfrastructure as well as its provenance: this is critical for both data providers (who need credit for the work they’ve done and the academic contribution of collecting, curating and annotating data) and the data users (who need to know where the data has come from so they can form an opinion of how much credence to give it and how to give proper credit to the originator of the data). Furthermore, as one person’s analysis is encoded in annotation it becomes the next person’s data, so the provenance and reliability mechanisms need to scale to multiple layers of annotation over one original data set.