Welcome – Viv Hutchison, Cassandra Ladino
Data Management Theme: Process and Analyze
Presentation: Capturing your processing and analysis workflow in R - Alison Appling
Abstract: The R language is a powerful tool for data analysis, modeling, and statistics, and its use is continuing to grow in the USGS and the broader scientific community. In the last few years, several new R packages have made R even more suitable for ushering data all the way from collection through to publication. In this talk we will describe and demonstrate some of the most useful new tools for capturing your R workflow in code and diagrams, scripting your data transfers to and from Amazon S3 or Google Drive, scaling your analyses from a few sites or models to hundreds or thousands, and preparing data and metadata for publication. We'll also briefly describe our current efforts to take the above tools one step further, integrating those tools with one another and with high throughput computing (HTC) to create reproducible, collaborative, powerful, and manageable systems for data processing and analysis.
News from the field:
Data management related updates, challenges, questions, announcements, ideas, etc. – Open discussion for all participants to input…(ALL)