Storage and Computing
The core of our Research Data Management (RDM) services is a dedicated team offering guidance and information on effectively managing research data. Key pillars include cloud solutions for secure, accessible data storage and collaboration; RDM and cloud software platforms for data analysis and sharing; high-performance computing environments for complex research projects; and resources for publishing and archiving research data for long-term accessibility. These components are designed to support and enhance research endeavors by providing a comprehensive range of tools and resources for collaboration, data security, and computing needs.
-
Julia Cluster RZ Wü: The Julia Cluster at the Information Technology Centre is a high-performance computing (HPC) environment specifically configured to support research projects using the Julia programming language. It provides access to scalable computing resources for computationally intensive tasks.
-
de.NBI Cloud is a cloud computing infrastructure provided by the German Network for Bioinformatics Infrastructure (de.NBI). It offers computing resources and services tailored to the needs of bioinformatics research, supporting tasks such as sequence analysis, molecular modeling, and data integration.
-
Europäische Open Science Cloud (EOSC): In addition to its role in data storage and sharing, EOSC also provides high-performance computing resources to support computationally intensive research projects across Europe.
Storage and sharing of research data
-
bwSync&Share: Originally developed at Karlsruhe Institute of Technology (KIT), bwSync&Share is a cloud storage solution tailored for academic and research collaboration. Provided by the Information Technology Centre, it offers secure storage and easy data management.
-
Europäische Open Science Cloud (EOSC) is a large-scale initiative aimed at providing Europe with a common framework for managing and sharing research data across borders and disciplines. It offers a wide range of services and resources to support open science practices.
-
Individual consortia solutions such as the TR240 and DECIDE cloud: These are specialized cloud solutions developed by specific research consortia or projects, such as TR240 and DECIDE, tailored to the needs and requirements of their respective research communities. They often offer unique features and integrations to support specific research workflows.
-
Bavarian Cloud is planned as a cloud computing infrastructure provided by the state of Bavaria in Germany. It will offer storage and computing resources for research and innovation projects, supporting collaborative work and data sharing among researchers in the region.
-
VRE Charité - Virtual Research Environment (GDPR compliant) is a Virtual Research Environment designed for the Charité - Universitätsmedizin Berlin, complying with the General Data Protection Regulation (GDPR). It provides tools and services for collaborative research while ensuring data privacy and security, especially licensed to deal with sensitive patient data.
-
Aruna Cloud - sophisticated metadata management according to FAIR is a cloud-based platform that focuses on sophisticated metadata management following the FAIR principles (Findable, Accessible, Interoperable, and Reusable). It helps researchers organize and annotate their data effectively, making it more discoverable and usable. Aruna Cloud is the core of the VirJenDB hosted in Jena, which is a use case of the NFDI microbiota.
-
Neo4j AuraDB is a robust cloud-hosted graph database service offered by Neo4j, renowned for its efficiency in storing and querying highly connected data. Tailored for applications like social networks, recommendation engines, and network analysis in research, AuraDB stands out for its powerful and advantageous graph-based data management capabilities.
-
Nextcloud is an open-source, self-hosted file sync and sharing platform that offers similar functionalities to commercial cloud storage solutions but with greater control over data privacy and security. It is highly customizable and extensible, making it suitable for research collaboration.
-
OMERO - Imaging Research Environment is an open-source platform for FAIR managing and analyzing microscopy and imaging data in research. It provides tools for organizing, visualizing, and sharing large volumes of image data, facilitating collaborative research in the life sciences. OMERO is integrated within the RDM plans and systems of several universities, such as the WWU Münster.
-
SODAR - Sequencing Research Environment is a cloud-based platform designed specifically for FAIR managing and analyzing sequencing data in research. It offers tools for processing, annotating, and visualizing sequencing data, supporting various types of sequencing experiments and workflows, thus allowing the integration of multiomics data. SODAR is part of the RDM at the Charité in Berlin.
Publishing and archiving research data
-
WueData research data repository is a research data repository provided by the University of Würzburg, Germany. It allows researchers to publish and share their data following the FAIR principles (Findable, Accessible, Interoperable, and Reusable), ensuring long-term accessibility and usability.
-
Archive server for long-term data storage is a dedicated server or storage infrastructure used for archiving research data that is not actively used in ongoing research projects. It ensures the long-term preservation of valuable data for future reference and analysis.
-
PDB: The Protein Data Bank (PDB) is a specialized repository for 3D structural data of large biological molecules, such as proteins and nucleic acids. It provides access to atomic coordinates and related experimental data, supporting research in structural biology and drug discovery.
-
GEO: The Gene Expression Omnibus (GEO) is a public repository for high-throughput gene expression data and other types of molecular abundance data. It serves as a valuable resource for researchers studying gene regulation, biomarker discovery, and disease mechanisms.
-
ProSite: ProSite offers a database of protein families, domains, and functional sites, providing valuable information for understanding protein structure and function. It helps researchers identify conserved motifs and patterns associated with specific biological functions or activities.
-
IDR: The Image Data Repository (IDR) focuses on image data, particularly microscopy data, serving as a resource for storing, integrating, and analyzing images in the scientific and biomedical research fields. It supports collaborative research and data sharing among the imaging community.
-
Zenodo: Zenodo is a general-purpose open-access repository developed under the European OpenAIRE program. It allows researchers to share and preserve any research outputs, including datasets, software, papers, and presentations, in any size, format, and discipline.
-
GitHub: GitHub is a widely used platform for hosting and sharing code repositories, supporting version control and collaboration using Git. It is particularly popular for open-source projects but also used for private and commercial development.
-
GitLab: GitLab is similar to GitHub but offers additional features such as built-in CI/CD pipelines, issue tracking, and wiki pages, providing a comprehensive platform for project management and collaboration.
-
Bitbucket: Bitbucket is a Git repository hosting service provided by Atlassian, offering free accounts with unlimited private repositories for small teams. It is a good option for individuals and organizations needing private or small-scale code hosting.
-
SourceForge: SourceForge is a web-based service that provides centralized hosting for open-source software projects. It offers features such as version control, bug tracking, and project management tools, supporting collaborative development and distribution of software.