At our last virtual monthly meeting on August 8, 2018, we heard about the upcoming Community for Data Integration Request for Proposals, opportunities for group learning about Git, and recent data-related activities at the USGS Office of Enterprise Information.
CDI sponsor Kevin T. Gallagher thanked the current CDI Project Teams that shared their progress at the Summer Earth Science Information Partners (ESIP) meeting.
Participants at the CDI Session at the ESIP Summer Meeting in July 2018: Supporting integrated and predictive science: Community for Data Integration focus on risk assessment.
Kevin also reminded us that the next CDI Request for Proposals will be happening soon, hopefully September! Like last year, Kevin and Tim Quinn will select a theme or themes for us to work on together as a community. Kevin stressed that the CDI is aiming to help develop the capacity of the entire USGS for data integration and management through this proposals process, and therefore we should always keep an eye out for project outcomes may be relevant to our own work. Selecting projects with wide applicability is a priority. Stay tuned for more info.
I announced that in the next month, I will complete the DataCamp Git for Data Science module, and I invite anyone interested to join me. Announcing this goal to the entire CDI is the best shot I have at actually doing it. This is a new CDI experiment in group learning. Read more at this wiki page and sign up to get weekly reminders and updates on my progress - this way we will complete the 4 hour module together.
Tim Quinn (Chief, USGS Office of Enterprise Information) and Nancy Sternberg (Senior Advisor for Strategic Planning, USGS OEI) presented us their vision of Component Architecture for Integrated Science and some recent workshops that have helped them to identify next steps for implementation. The OEI oversees a tremendous amount of data and hardware in the USGS, and is working hard to modernize the entire system. In the past year, they have held workshops on High Performance Computing and High Throughput Computing, Data Storage, and Sensor Networks, to guide their plans. They also gave us insight into the data storage strategy for the next 5 years and beyond. Tim, Nancy, and Paul Exter welcomed feedback from the community on their presentation.
Insights into a strategy for storing our increasing data volumes.
Meeting recording and slides are posted on the August 2018 Meeting page.
In July, the Metadata Reviewers Community took a look at the newly proposed metadata requirements for the old "legacy" data sets that have been traditionally released on USGS web sites. The USGS Web Re-engineering Team ("WRET") is planning on using these metadata to display legacy data. Lisa Zolly introduced the topic and led the discussion.
July’s theme was Metadata! Ben Wheeler gave a brief update on Science Data Catalog (SDC), an important part of the USGS Public Access Plan.
Colin Talbert provided an overview and demo of the Metadata Wizard (version 2.0!), a tool for creating robust metadata.
JC Nelson led a discussion on data release issues, and recent developments in guidelines for data sharing agreements and software release policies and issues. Pete Ruhl is currently compiling examples of eDNA (environmental DNA)-related data releases. He can be contacted at firstname.lastname@example.org.
Visit the eDNA wiki page.
The group began discussions on some upcoming federal activities related to crowdsourcing, citizen science, and prizes & challenge competitions as well as upcoming meetings in DC related to crowdsourcing and citizen science. The group will discuss the same topic at next month's meeting on Wednesday, August 15 with more updated materials.
A report to Congress about federal crowdsourcing and citizen science activities is due at the end of 2018 this winter. This report is required by the American Innovation and Competitiveness Act (of which the Citizen Science and Crowdsourcing Act is a component).
A draft of the Form for Collecting CCS Projects from the White House Office of Science and Technology Policy (OSTP) and the Science and Technology Policy Institute (STPI).
For more information, contact Sophia Liu, email@example.com.
Carl Schroedl from the USGS Water Mission Area gave us an in-depth look at the Git Fork and Feature Branch Workflow. This method works well for his group, which incorporates code reviews in their workflow.
Read more at this blog post on using the Fork-and-Branch Git Workflow
There are infinite possible workflows in Git, adjust it, make it work for you, Atlassian is a good resource to learn about different workflows https://www.atlassian.com/git/tutorials/comparing-workflows
If this workflow seemed too complicated for your purposes, you could simplify by not doing branches under a fork, or possibly not doing forks, but this takes away from the motivation and the benefits when getting code reviews.
Q: What about branch naming conventions? A: You can use an issue code/identifier. This can help you trace back to details on the issue or motivation for the code changes.
At the Community for Data Integration July 11, 2018 monthly meeting, we heard about two programs in the USGS, the STEP-UP program and the Cloud Hosting Solutions program.
Chris Hammond told us about the STEP-UP (Secondary Transition to Employment Program - USGS Partnership) program. STEP-UP provides employment training to young adults (ages 18-22) with cognitive and other disabilities. Despite these disabilities, they may be highly competent at certain tasks, for example data preparation tasks. The overview will explained how the program works and described several success stories. The CDI has recently heard from a number of research groups that are looking for solutions to migrating legacy data or websites, we introduce the STEP-UP program as a possible solution to investigate further. To learn more, get in touch with Chris at firstname.lastname@example.org.
We also heard an update on the latest services provided by USGS Cloud Hosting Solutions (CHS) from Jennifer Erxleben and Harry House. Cloud Hosting Solutions (CHS) is the required, supported, secure Cloud offering for USGS Science Centers and mission programs. Jennifer and Harry told us about CHS managed services, CHS custom services, and the sandbox environment. They also went over some example projects and costs. After the presentation, we had an active 30-minute long Q&A session, which is documented on the meeting page.
The best way to get started with CHS is to email email@example.com.
Things learned in the writing of this blog post: What is SNP, how do you pronounce SNP, and what USGS research involves SNPs? What is cloud.gov? Where can I find a up-to-date list of USGS science centers that are used in ScienceBase and the USGS Science Data Catalog?
Ray Obuch provided an overview of the new Department of the Interior Metadata Implementation Guide, available at https://doi.org/10.3133/tm16A1.
The group also examined the proposed FAIR metrics, mentioned at a recent CDI Monthly Meeting, which have a lot to do with metadata. (FAIR stands for findable, accessible, interoperable, reusable.)
Peter Schweitzer shared this link about a similar but different way of thinking about the problem, "5 Star Open Data," https://5stardata.info/en/.
At the DevOps Project Management Sync meeting, topics included
An update from USGS Cloud Hosting Solutions (CHS)
An update on the USGS Software Management website, which is under development (Cassandra Ladino, USGS)
A world wind tour of cloud.gov and the default DevOps pipeline to deploy applications to it (Andrew Burnes, 18F). Cloud.gov is a secure, fully compliant Platform as a Service (PaaS), built specifically for government work. Find out more at What is cloud.gov?
At the DevOps SysAd/Dev Sync, Dan Pilone of Element84 presented on Supporting NASA’s Earth Observing System Data and Information System (EOSDIS). “NASA's Earth Observing System Data and Information System (EOSDIS) is working towards a vision of a cloud-based, highly-flexible, ingest, archive, management, and distribution system for its ever-growing and evolving data holdings. This effort is emerging from its prototype stages and is poised to make a huge impact on how NASA manages and disseminates its nearly 30PBs Earth science data as that grows to over 300PBs in the coming years. This talk outlines the motivation for this work, presents the achievements and hurdles of the past 18 months and charts a course for the future expansion of NASA’s cloud based EOSDIS.”
Drew Ignizio presented on the data source list that is used in ScienceBase and the USGS Science Data Catalog. One way this data source list is used is to attribute official USGS data releases to their related USGS science center (the data source). As USGS science centers merge or otherwise change name, having an up-to-date authoritative list is important, not just for ScienceBase and Science Data Catalog, but for linking many other systems in the USGS. The list that Drew previewed during the talk can be accessed at https://www.sciencebase.gov/directory/organizations?displayHints=SDC_List.
Ray Obuch (USGS) provided an overview of the new Department of the Interior Metadata Implementation Guide, available at https://doi.org/10.3133/tm16A1.
Tim Crone of Lamont-Doherty Earth Observatory presented on Analysis of Massive Underwater Video Data in the Cloud using Pangeo.
Summary: An open-source environment for parallel analysis of massive (100TB) image data in the Cloud is now available via the Pangeo environment, which allows you to apply the power of the Python ecosystem from your browser. Technologies include JupyterHub, Kubernetes, Docker, and Dask distributed. Learn more about Pangeo at https://pangeo-data.github.io.
The Bioinformatics Group conducted a survey and had a discussion to gauge interest in various genome analysis topics and outlets for technical exchange. There was interest in analyzing SNP datasets and sharing knowledge about current SNP projects. See meeting notes here.
Perhaps you are wondering, “What is an SNP, and how does USGS use SNPs?”
SNP stands for single nucleotide polymorphism, and is pronounced “snips.”
An SNP is a variation in a single nucleotide that occurs at a specific position in the genome, where each variation is present to some appreciable degree within a population (e.g. > 1%). (https://en.wikipedia.org/wiki/Single-nucleotide_polymorphism).
SNPs can generate biological variation between two members of a species. Those differences can in turn influence a variety of traits such as appearance, disease susceptibility or response to drugs. (https://www.23andme.com/gen101/snps/)
USGS studies SNPs of many species, a quick Pubs Warehouse search returns recent studies on steelhead trout, wolves, prairie falcon, fungus, salmonid, and the Florida panther. See more at https://pubs.er.usgs.gov/search?q=SNPs
SNPs explanation from https://www.23andme.com/gen101/snps/
The USGS has a large and active community using Geographic Information Systems tools, including Esri ArcGIS, QGIS, Python, R, gdal, and many others. Recently, the CDI has become more involved in co-hosting presentations and activities to discuss GIS technology, enterprise solutions, and challenges.
Shane Wright, Roland Viger, and Andy Lamotte are a few of the USGS folks that are helping to coordinate this community. (Thanks!)
After gauging community interest last winter, the CDI recently helped to host a two-part series on ArcGIS Pro.
James Sill from Esri demonstrated capabilities of the new ArcGIS Pro in two presentations.
The CDI is also happy to help promote the meetings of the Alaska GIS and Data Science Webinar (contact: Evan Thoms).
You can access further information, presentation recordings, and the GIS Forum at the CDI GIS Focus Group wiki page.
June’s Python for Data Management Training Series reached over 380 participants, addressing the topics of Working with Local Files, Batch Creating and Updating Metadata, and Automation with PySB (python tools for working with ScienceBase).
Drew Ignizio and Madison Langseth of Core Science Analytics, Synthesis, and Library, led the three 1.5 hour sessions for participants of varying levels of Python experience. The training made use of the Jupyter notebook and Python bundle that ships with the USGS Metadata Wizard 2.0. Especially helpful were the example Jupyter notebooks and data files that were supplied as course resources, allowing participants to execute code in real time, and have a copy of the code for future modification and use.
I attended the first two sessions and am getting ready to watch the recording of the third session, and I know I am not alone in telling Drew and Madison: Thanks! Great job! Very helpful! Good work! YOU WERE AMAZING! (Because those are all direct quotes from the feedback form.)
After these short sessions, I feel that I have the knowledge I need to get started with Jupyter notebooks for my own purposes.
If you missed them, you can download the course resources and watch the recordings on the course wiki site: Python for Data Management.
Behind the scenes with our excellent instructors for the Python for Data Management Training Series:
At the June 13, 2018 Monthly Meeting, CDI had a data visualization extravaganza.
Jordan Read, the chief of the Water Mission Area’s Data Science Branch, spoke on “Amplifying USGS science with timely and digestible data visualizations.” Data show that web visualizations can reach a far larger audience than formal reports. Jordan gave us a view into the process of creating time-sensitive visualizations, such as those that address incoming hurricanes. He also noted that methods that allow reproducibility are key for efficiency, and that there are benefits of communicating more frequently between different mission areas in the USGS about data visualization techniques and projects.
A team from the USGS Western Geographic Science Center presented on “Data visualization for science: comparing 3 dashboard building software packages.” Despite an ill-timed power outage at the Menlo office, Kevin Henry and Jason Sherba (and Jeff Peters in spirit) told us about their experiences with Tableau, ArcGIS Online, and PowerBI in visualizing data for hazard exposure analysis. They told us about the pros and cons, summarized in a slide that may “live on in infamy” (shown below). They stressed that the best platform will depend on your specific case, and encouraged further sharing of people’s experiences with data visualization platforms.
Other news from the monthly meeting:
To help the process of formalizing USGS data sharing agreement guidelines, JC Nelson asked for your examples of when you needed to sign data sharing agreements with another agency. Contribute here.
We polled you on tracks and topics for the 2019 CDI Workshop (June 4-7, 2019 in Boulder, CO). Top three tracks (voted by the CDI distribution list): Data visualization, data management, and data science! See more results at the monthly meeting page.
See the full meeting notes, slides, and recording at the monthly meeting page.
Disclaimer: Any use of trade, product, or firm names in this publication is for descriptive purposes only and does not imply endorsement by the U.S. Government.
USGS DevOps 5/1/2018
If you are interested in updates on Git and different task management systems like Remedy and JIRA, this is the group to watch. Eric Martinez from USGS kept us up to date with Git and two presenters from Tasktop (a provider of software integration solutions) presented at the May 1st meeting about integrating USGS IT and System Development teams by sharing information from their ticketing systems. DevOps wiki space for more information.
Metadata Reviewers Community of Practice 5/7/2018
The group discussed the Genetics Guide to Data Release and Associated Data Dictionary, which was compiled by Barbara Pierson at the Alaska Science Center. Metadata Reviewers Meetings page for more information.
Semantic Web Working Group 5/10/2018
The group shared news and ideas, such as using the new Quality Management System (QMS) for USGS as a good example for which to start developing a data dictionary database, a blog post about the Semantic Web by Ken Bagstad, and a competency framework for professional development in the use of linked data at http://explore.dublincore.net/. Semantic Web working group meetings page.
Tech Stack Working Group 5/10/2018
What’s new with the Network Common Data Form - Climate and Forecast?
"NetCDF-CF Advances - Simple Geometries, Swaths, and Groups" was the May topic for the Tech Stack group. Speakers were Dave Blodgett (USGS), Tim Whiteaker (UT Austin), Aleksander Jelanek (HDF Group) and Daniel Lee (EUMETSAT). Simple geometry (points, lines, and polygons) has now been accepted as part of the Open Geospatial Consortium’s NetCDF-CF specification. This a major enhancement to a widely used standard whose utility has previously been limited to time-series of point or (raster) coverage data only. Advances on Groups and Swaths will also be presented. Exciting! Tech Stack and ESIP Tech Dive meeting page.
Data Management Working Group 5/14/2018
Data management to support integrative, FAIR, multidisciplinary modeling: Lessons from the last decade and paths forward - Ken Bagstad, USGS.
Since 2007, the Artificial Intelligence for Ecosystem Services (ARIES) project has been developing an open-source software package, modeling language, and data repository to enable integrated, multidisciplinary environmental and Earth systems modeling (more details here http://www.integratedmodelling.org/). Data Management Working Group May Meeting page.
Software Development Cluster 5/31/2018
Software Licensing Aspects, Leon Foks, USGS
There are many ways to license software as open source. Using his own example of software developed for USGS, Leon Foks walked us through what he learned about placing proper licenses on USGS-produced software that may have some special considerations. It is much more interesting than just “Use CC0”. USGS software products must be in the public domain, meaning that copyright is waived. However it is best practice to apply an approved Open Source license to the software. Usually it is advised that we apply the CC0 license. However, we learned that in some cases, it is necessary and possible to release code with a dual license, with different licenses (e.g., CC0, MIT, BSD, LGPL, GPL) applying to different parts of the code. (You would explain the intricacies of the licensing in the README.md). Maybe I’m weird, but this presentation blew my mind. See the recording on the Software Development Cluster meetings page, and those with access to the USGS GitLab instance can view an .ipynb at https://code.usgs.gov/nfoks1/Software_Licensing.
At the May 9, 2018 CDI Monthly Meeting, we heard from three FY17 CDI Funded Projects:
(Brian Reichert, Fort Collins Science Center and Becca Scully, PNAMP). NABat database (North American Bat database), NABat web portal and MonitoringResources.org are now linked with two way APIs, which will help collaborators to coordinate their sampling efforts.
(Mark Wiltermuth, Northern Prairie Wildlife Research Center). The ScienceCache application has been extended to use a more flexible data model, allowing more types of mobile app data collection. Testing phase will come later this year, contact Mark Wiltermuth (firstname.lastname@example.org) if interested.
(John Young, Leetown Science Center). A project on deriving vegetation metrics from lidar data provides 10m and 25m products for use by stakeholders such as staff at Shenandoah National Park.
A little bit more information about USGS lidar in the cloud from Jason Stoker:
There was a question from someone about the status of lidar in the cloud. We are in the process of replicating all lidar point cloud data that we serve on FTP in S3 as well. Our plan is to have all ~10 Trillion lidar points (110+ TB) that we have in archive on S3 by the end of the year. I believe we have a little more than half out there now, and growing.
Due to the large volumes of data (and small budgets) we only provide lidar point cloud data as a 'requester pays' option in S3. It is still free to access via FTP
A web page with instructions can be found here:
We also announced to save the date for the next CDI Workshop: June 4-7, 2019 in Boulder CO!
See the meeting recording, slides, and more details at the May Monthly Meeting Page!
Highlights from the last month of CDI Collaboration Area activity:
The group took a look at the Genetics Guide to Data Release and Associated Data Dictionary, which was spearheaded by Bobbi Pierson, Alaska Science Center Geneticist, and the Genetics Metadata Working Group. They found it to be a great resource for those that need to author genetics metadata under USGS guidelines. More meeting information.
In April, the DevOps Project Manager Sync had several topics:
Update from USGS Cloud Hosting Solutions
Software management website update (Cassandra Ladino)
Zero Trust Networking: What is it? (internal link) (Tom Van Dreser)
Overview of Cloud activities at Cal Poly (Paul Jurasin)
The Zero Trust Model: Should we be taking the information security advice from Congressmen? Drawbridge Network, November 2016
Lance Everette and Tara Bell presented in the theme of “Preserve”: Taking action against USGS legacy data challenges. See recording and slides at the meeting page.
Alan Allwardt demonstrated the creation of a new set of persistent identifiers using the PURL system (https://archive.org/services/purl/). He used the example of the Data Categories of Marine Planning vocabulary, and described other use cases. (Notes)
Jeremy Fischer from Indiana University presented on "Jetstream: A free national science and engineering cloud environment on XSEDE." (video)
The GIS Community of Practice hosted a webinar on ArcGISDesktop to ArcGIS Pro Transition. James Sill and Stephen Zahniser of Esri gave an overview of the user interface and architecture and a demo. We received over 50 questions and comments during the presentation via sli.do and chat and we’re working on getting the Q&A up on our wiki. Recording is available at the meeting page.
At the CDI monthly meetings, our goal is to bring you tools and information to help you do your daily work.
On April 11, 2018, we started with a review of the Reproducible Notebook Series, started in October 2017. The series has been showcasing different examples of reproducible and executable online notebooks. These notebooks are cast as the successor to the traditional scientific paper in a recent Atlantic article that has been making the rounds: The Scientific Paper is Obsolete.
April’s reproducible notebook installment: OBIS (Ocean Biogeographic Information System) and R - Filipe Fernandes, SECORRA/IOOS (Southeast Coastal Ocean Observing Regional Association/Integrated Ocean Observing System). Filipe’s presentation used the jupyter nbviewer, creating a presentation directly from the notebook! He showed how to connect sea turtle observation points to create possible migration paths in the Atlantic Ocean.
Screenshot from Filipe's notebook, plotting and connecting sea turtle observations.
Taxa Taxi: An automated process for using citizen science data to facilitate biodiversity monitoring (Erin Boydston and Toni Lyn Morelli)
iNaturalist citizen science observations are helping researchers understand biodiversity monitoring (after some automated data processing). iNaturalist got a thumbs up from a meeting participant as a neat mobile app to take on your hikes.
USGS Data at Risk: Expanding Legacy Data Inventory and Preservation Strategies (Lance Everette and Tara Bell)
Rescuing legacy data at the USGS remains a Herculean effort. The Legacy Data Inventory Reporting System (LDIRS) and its evaluation criteria can help the USGS address this need.
Web Mapping Application for a Historical Geologic Field Photo Collection (Sarah Nagorsen and Jason Sherba)
Need guidance for proper documentation and publication of geolocated photo collections? See the CDI-funded project on a web mapping application for photo collections.
Some highlights from March 2018 CDI collaboration area activity:
The group discussed a CDI proposal to create specifications for USGS data products so that ISO standard metadata records can be created in tools like the ADIwg metadata toolkit (mdEditor, mdTools). (Update: funded). The group also got a sneak peak at the new Data Dictionary page on the USGS Data Management Website (Update: published).
Brian Fox shared a cloud training resources wiki page.
Ross Wickman gave an update from Cloud Hosting Solutions (CHS).
Eric Martinez gave a presentation entitled Software Inventory, What it is, how it's made, and how you can make it better (internal link). More info: https://sourcecode.cio.gov/
Zarr: A simple, open, scalable solution for big NetCDF/HDF data on the Cloud": Alistair Miles, University of Oxford. The motivation, current status and future plans for Zarr were discussed, along with a demo of basic functionality, and, an analogy between virtual machines and cows. (link to video)
Capturing your processing and analysis workflow in R - Alison Appling. Alison introduced tools in R for dealing with reproducibility of analysis, size and complexity of analysis, collaboration on analysis, and dissemination. (Just a sampling of tools: remake, drake, googledrive, sbtools, whisker). (slides)
R tools for modern data analyses
The group discussed potential activities for future conference calls. The group also maintains links to eDNA talks being hosted outside of the CDI on their wiki page.
Chris Johnson presented on USGS EDGE (Equipment Development Grade Evaluation): What is it, how does it apply to you, and why you may be interested in participating. You can access the recording on their meetings page if you are logged in.
We continued learning about the FAIR Data Principles - I and R stand for Interoperable and Reusable. Awareness of these principles is growing within the CDI.
Cheryl Morris gave the opening announcements, displaying the 18 CDI Proposals that moved to Phase 2, shown around the CDI Science Support Framework. For teams that did not advance to Phase 2, the we always welcome further discussion about how to better frame projects with CDI principles (and we’re not just saying that). She also reminded us that the FY19 Request for Proposals is not too far away, and encourages groups to start the discussion.
FY18 proposals advancing to full proposal stage, around the CDI Science Support Framework.
Group Announcements - A USGS Software Management Website is being planned and Cassandra Ladino (email@example.com) is looking for volunteers to help with the design - this includes everyone from the individual scientist developing software to large development teams. See all announcements.
Kristin Ludwig briefed us “Science for a Risky World: A USGS Plan for Risk Research & Applications” giving us more information about efforts around the USGS that we can join.
There were four CDI funded project presentations from last year, sharing their findings regarding making data more accessible, high throughput computing and docker containers, benefits and limitations of using Tableau for USGS data, and new technologies that allow us to “do science” in the cloud.
We circulated a very brief survey for feedback on the community voting phase of this year’s CDI Request for Proposals. Please take it if you haven’t already!
We’re trying out a new “Highlights” section on the Monthly Meeting pages that will list major links and resources presented at the meeting, these will be posted well before my blog posts!
Tsunami Evacuation Tableau app: https://geography.wr.usgs.gov/science/vulnerability/oahuEvacDashboard.html
USGS Data Life Cycle in the Cloud: https://github.com/USGS-CMG/data-life-cycle-cloud
The frequency of exciting CDI collaboration area meetings is far greater than the frequency of my writing about them. Here are some highlights from the past two months:
The DevOps group has started a Cloud Training Resources page. (Example: Amazon Web Services "What is Cloud Computing?") If you find other training opportunities, please let Brian Fox (firstname.lastname@example.org) know, so that he can add them to the list.
Online platforms for data analysis have arrived, as illustrated by the recent Tech Stack/Tech Dive presentations. The webinar page has links and recordings for The Pangeo Project (an open-source big data science platform), and the National Data Service Labs Workbench (a scalable platform for research data access, education, and training).
The Data Management Working Group has covered several topics including: Publishing metadata to the Science Data Catalog and Data Management Challenges (Jan 2018); Tidy data, Biological Analysis Packages, and Volunteered Geographic Information (Feb 2018).
Bonus: Read the original Tidy Data paper (Wickham, Journal of Statistical Software, 2014). Tidy datasets are easy to manipulate, model and visualize, and have a specific structure: each variable is a column, each observation is a row, and each type of observational unit is a table.
In February, the Semantic Web Working Group tuned in to a National Park Service presentation on the use of linked data to protect cultural heritage resources in the national parks from climate change, using the Digital Index of North American Archaeology.
In February, the Bioinformatics group covered All things microbiome. Many different groups within the USGS have some element of microbiome research - check out the USGS Fact Sheet on microbiome research for more information.
Open Source Coffee Talks decided to combine forces with the Software Development Cluster as of February 2018. The Software Development cluster discussed the topics of the USGS HPC/HTC Workshop, the USGS EDGE (Equipment Development Grade Evaluation) program (more info), and 508 Compliance (IT Accessibility) for websites and web applications. In March, Chris Johnson will give a presentation with further information on the EDGE Program.
The Citizen-Centered Innovation group held its inaugural call on February 21, 2018. Anyone interested in crowdsourcing, citizen science, civic hacking, and challenge & prize competitions are encouraged to join. Contact Sophia B Liu at email@example.com for more information.
The eDNA Community of Practice held its inaugural call on January 16, 2018. They will be held every other month at the same time slot as the Bioinformatics group (3rd Tuesday from 2-3p ET). Contact Pete Ruhl (firstname.lastname@example.org) for more information.
The USGS GIS community followed up on their inaugural call with a message about next steps. You can reply to this short form to log your interest in future talks and topics, including ArcGIS Pro, Serving GIS Data with ESRI, Open-source GIS topics, GIS on the cloud, and Global mapper. You can also suggest a new topic!
Whew. My next goal: Update the blog with collaboration area news in less than two months!
Kyle Enns and Cristiana Falvo from the USGS gave a presentation on "Using Python to Bring Geophysical Data to the Surface", showing the CDI another example of a way to officially share python scripts for reproducibility. Kyle and Cristiana also shared the documents they use for Pre-review quality control, Releasing accessible python code, and their Technical peer review checklist (log in at the meeting page to view).
The feature presentation was "Semantic web for scientific information: streamlining how we write, find, link, and reuse data and models" by Ferdinando Villa of the Basque Centre for Climate Change. After describing the challenge of data and model integration and reuse, and a project he is working on to address the problem (The Integrated Modelling Partnership, www.integratedmodelling.org), he invited us all to come join in the adventure of working together in partnership to build an integrated information landscape! You can contact him at email@example.com.
Ferdinando's presentation was followed by a panel discussion on the semantic web and the USGS with Ken Bagstad, Dalia Varanka, Julia Moriarty, and hosted by myself. It was clear that we need more time to learn from each other about the challenges and opportunities of the semantic web!
You can view the recording and slides on the February 2018 monthly meeting page.