RDM and COVID-19 (Part 3): Institutional Collaborations

This is part 3 of a 4 part series of posts on Research Data Management and COVID-19. Click here for part 1 and part 2.

Collaborative efforts between institutions have yielded numerous open-access datasets, visualization tools, and resources designed to accelerate COVID-19 research by making datasets, tools, and computational resources  more discoverable and accessible. Examples include:

  • MIDAS – A “global network of scientists and practitioners from academia, industry, government, and non-governmental agencies” has created a COVID-19 portal which collates publicly available datasets and code to enable researchers to contribute towards COVID-19 modeling research.
    Midas Network
  • Terra – Terra provides cloud-based work spaces to support bioinformatics research by integrating data pipelines with analytical tools, such as Jupyter Notebook. They have set up specific work spaces to accelerate COVID-19 research. Access to Terra is free but requires an account setup. You can use your google account.Terra
  • VODAN – The Virus Outbreak Data Network (VODAN) is a collaborative effort between the Committee on Data Science and Technology (CODATA), the Research Data Alliance, World Data Systems, and GO FAIR; organizations which promote Open Science efforts in research data. In an effort to encourage best practices and standards for data repositories while still hastening access to COVID-19 data, these organization have created the FAIR Data Points repository which can be granted access to the local data of participating institutions, such as hospitals and research centers. VODAN can then assist institutions with data stewardship, best practices, and managed access to the data in their repositories.
    Virus Outbreak Data Network (VODAN) - GO FAIR
  • High Performance Computing Consortium – This consortium of academic, industry, government organizations pools available computing resources to provide accelerator services for participating members around COVID-19 research. Access to the computing resources are free but require submission and approval of a research proposal. Researchers utilizing the consortium resources are expected to maintain public updates of their progress and eventually publish their findings.
    IBM & DOE COVID-19 High Performance Computing Consortium

Likewise, there are numerous institution-specific sources of COVID-19 research data out there. No doubt many people are already aware of the Johns Hopkins Coronavirus Research Center whose maps, visualizations, dashboards, and data connections have been used to display tracking by many news organizations, such as the NY Times’ global outbreak map. Another example is NYU’s list of ongoing data projects mobilized to combat COVID-19. Many of the on-going institution specific data projects are also being indexed by the above-mentioned government and generalist repositories. As a members of the Data Discovery Collaboration, the MSK Library has been participating in discussions about how institutions and researchers can best make their COVID-19 research more visible and useful to other researchers.

Interactive map from Johns Hopkins shows coronavirus

In the final post in this series, we’ll showcase some of the Data Repository and Publisher responses to the COVID-19 pandemic.

Curious About the Total Number of Publications by MSK Authors in 2019?

The 2019 Synapse Publications Report is now available to help answer this question! Compiled by librarians, this report provides an analysis and documents the publications produced by MSK researchers, clinicians, nurses, and other healthcare professionals for the year in review. The final bibliography includes 5,130 works, comprised of references to research and conference papers, reviews, meeting abstracts, books and book sections. This represents an increase of over 11% from publications identified in our 2018 report.

Synapse is a public facing resource and the authoritative bibliographic database developed and maintained by the Library. Our ultimate goal is to track and showcase the wealth of knowledge attributed to Memorial Sloan Kettering.

I hope that by browsing the 2019 Synapse Publications Report, readers will gain a sense of the contributions made by MSK authors to the ever-growing body of scholarly research.

Feel free to send me an email and let me know what you think about this year’s report.

Donna Gibson
Director of Library Services

RDM and COVID-19 (Part 2): NIH/NLM Sponsored Resources, Open Datasets, and Tools

This is part 2 of a 4 part series of posts on Research Data Management and COVID-19. Click here for part 1.

The National Institutes of Health and the National Library of Medicine have put together a broad spectrum of open-access resources to provide researchers and the public specific types of data, such as:

They have also added COVID-specific features to many of their computing resources such as BLAST. These are just a few examples and they are continuing to add more datasets and research tools all the time. To find out more about their initiatives and get direct links to the individual data sources, they have compiled them into list at the Open-Access Data and Computational Resources to Address COVID-19.

NIH Office of Data Science StrategyAnother very useful government resource for COVID-19 data is the result of a collaboration between The National Center for Data to Health (CD2H), the Clinical and Translational Science Awards (CTSA), and other partner institutions. They have worked together to introduce the National COVID Cohort Collaboration (N3C). Open to membership for individual researchers as well as institutions, they provide a “pathway to rapidly share collaborative results and get attribution for your contributions.” Among the products of this group is a searchable list of COVID-19 data sources which include datasets, visualizations, code, and informational websites from a variety of national and international institutions.

In the next post in this series, we’ll highlight some of the institutional collaborations supporting open science, visualization, and computational resources in response to the COVID-19 pandemic.