The Working Group "Scientific Data Management"
Research Project Lifecycle
Source: Dr. Knodel, Oliver
For a state-of-the-art research data management it is recommended to consider the data life cycle first. Our long-term goal is to document the entire data life cycle regarding the Findable, Accessible, Interoperable and Reusable (FAIR) way. A similar principles also applies to software software.
Especially the steps from the creation of the research data until the publication of data and results should be included in the final publication. We developed a concept for the entire research project lifecycle which contains all essential components and with a focus on (meta)data exchange. Descriptions about the individual components provided by our working group can be found in our Top-Level Service Strategy.
Mission
We are developing a concept of a data management lifecycle for our scientists at HZDR. This implies the integration of existing data sources, documentation of experiments, gathering metadata, management and integration of data analysis of primary data as well as establishing a complete data provenance with integrated workflows.
Support
We provide support for existing or planned projects concerning the above presented topics. The electronic documentation is often the first point of contact with this subject. Automated data acquisition and interfaces with analysis programs are also common main topics. We will support the establishment and initialization of an executable version of data aquicition. The goal will always be that the involved scientists will be able to continue and optimize the projects by themselves afterwards. Our group will always be available for further questions.
Overall Services of the Group
Get in touch with us for questions or assistance. Our Services can be searched in the Research Data Services and the HZDR IT service catalog. The main topics of our group are:
- All around "Research Data Management",
- Support in optimizing HPC applications and workloads,
- Infrastructure for managing a project life cycle with our HZDR infrastructure,
- Documentation of experiments (Lab Documentation System),
- Automated inbound data transfers into our systems from multiple data sources,
- Establishment of workflows related to the FAIR principles,
- Support in archiving of research data, workflows and the scientific publication itself.
Services for the Topic "Data Management & Analysis"
"Data Management & Analysis" is a new research topic in the research program "Matter & Technology" of the Helmholtz research field "Matter". The Computational Science department hosts a small group, which supports the domain scientists in the Institute of Radiation Physics working on the same topic. The group is responsible for maintaining mission critical software components as well as interfacing them with the software solutions of HZDR as well as the research field.
- Software co-design for high performant, platform independent components
- Performance analysis and support with the optimization of existing applications
- Traingings and workshops on all our work topics, especially also supporting the "Highly parallel programming of GPUs" class at TU Dresden
The Team
Name | Bld./Office | +49 351 260 | |
---|---|---|---|
Gruber Dr., Thomas | 270/216 | 3846 | t.gruber@hzdr.de |
Knodel Dr., Oliver | 270/218 | 3845 | o.knodel@hzdr.de |
Müller Dr., Stefan | 270/216 | 3847 | stefan.mueller@hzdr.de |
Pape, David | 270/216 | 3808 | d.pape@hzdr.de |
Useful Links
- IT Service catalogue
- Publication of research data and research software at HZDR
- High Performance Computing at HZDR
- Systems:
- HELIPORT: heliport.hzdr.de
- Proposal managment system GATE: gate.hzdr.de
- Data Management Plan (RDMO): rdmo.hzdr.de
- Grafana: grafana.hzdr.de
- Lab documentations systems:
- MediaWiki: wiki.hzdr.de
- MediWiki (FWK): athene.fz-rossendorf.de
- Metadata Catalogue: SciCat (evaluation system)
- Version control system GitLab: gitlab.hzdr.de
- Data publication platform RODARE: rodare.hzdr.de
Data Management Ecosystem
Bild: Dr. Knodel, Oliver
Current and Past Projects
Post-processing for TELBE Experiment
Description: | After each measurement the user would like to have the post-processed file available as soon as possible. The experiments are controlled with Labview and the post-processing should initiated automatically on the cluster. |
Expected Results (Goals): | After each measurement the post-processing "workflow" is initiated, runs on the cluster and the resulting file is available on bigdata |
Owner: | Thomas Gruber |
Customer: | TELBE (FWKP) |
Project Page: | https://gitlab.hzdr.de/fwcc/data-management/user-project-documentation/post-processing-for-telbe-experiment |
Organize Documentation for FMR experiments
Description: | Kilian Lenz would like to setup a documentation platform for all FMR experiments in his group. |
Expected Results (Goals): |
|
Owner: | Thomas Gruber |
Customer: | FMR experiments (FWIN-D) |
Project Page: | https://gitlab.hzdr.de/fwcc/data-management/user-project-documentation/documentation-of-fmr-experiments |
Provide an ETL Workflow for Turbulence Fluid Dynamic Simulations
Description: | Create an ETL Workflow based on Celery with PostgrSQL and optional Elasticsearch integration. |
Expected Results (Goals): |
|
Owner: | Oliver Knodel |
Customer: | (FWDC, Thomas Ziegenhein) |
Automated GitLab CI-Job for the bitstream cration on Hemera
Description: | Create a CI-Job to automate the FPGA bitstream creation after every commit (with a special Tag) to provide a valid bitstream and to bring the GitLab project to the next level. |
Expected Results (Goals): |
|
Owner: | Oliver Knodel |
Customer: | FWDF (André Bieberle) |
Project Page: | https://gitlab.hzdr.de/fwdf/measurementscience/projects/ufxct/fpga-dev |
Manage DRESDYN Simulations
Description: | Document every simulation and store selected values/parameters from results. |
Expected Results (Goals): |
|
Owner: | Thomas Gruber |
Customer: | DRESDYN Simulation (FWDH) |
Project Page: | https://gitlab.hzdr.de/fwcc/data-management/user-project-documentation/manage-dresdyn-simulations |
Organize Documentation in Blitzlab
Description: | Documentation of every experiment of the Helmholtz Innovation Lab Blitzlab in openBIS |
Expected Results (Goals): |
|
Owner: | Thomas Gruber |
Customer: | Blitzlab (FWIM) |
Project Page: | https://gitlab.hzdr.de/fwcc/data-management/user-project-documentation/experiment-documentation-in-blitzlab |
Provide a Toolflow for FPGA-DAQ Development using High-Level-Synthesis
Description: | Create a service which generates FPGA designs from OpenCL code using the High-Level-Synthesis (HLS) Tools from Xilinx on Hemera and implement first data aquisition cores. |
Expected Results (Goals): |
|
Owner: | Oliver Knodel |
Customer: | ELBE Experimant (FWKK, Andreas Wagner) |
Provide project IDs with and without proposals
Description: | Create a service which validates proposal IDs or provides a "HZDR-ID" for non proposal projects |
Expected Results (Goals): |
|
Owner: | Oliver Knodel |
Customer: |
|
Project Page: |
https://gitlab.hzdr.de/fwcc/data-management/user-project-documentation/project-id and https://gitlab.hzdr.de/fwcc/data-management/dms-guidance-system |
Provide access to environmental data of ELBE laboratories
Description: | Find secure tool to get selected data from Gebäudeleittechnik (GLT) without comprising the controlling functionality. |
Expected Results (Goals): | Provide parameter data which influence the laser characteristic/quality |
Owner: | Stefan Müller |
Customer: | Experimental groups which use laser, e.g. FWKT, FWKP |
Project Page: | https://gitlab.hzdr.de/fwcc/data-management/user-project-documentation/glt-data |
Test OPC-UA Client
Description: | Interact via OPC-UA with ELBE data and make it available for user in different ways. |
Expected Results (Goals): |
|
Owner: | Stefan Müller |
Customer: |
|
Project Page: | https://gitlab.hzdr.de/muelle94/opcua_test_elbe |
Prepare data for upload to HEPData repository
Description: | Make data sets available in a consistent and useful way. |
Expected Results (Goals): | Prepare data sets of KLOE05, KLOE08, KLOE10, KLOE12 and the updated sets of KLOE17 for upload to the HEPData repository using the hepdata_lib python library |
Owner: | Stefan Müller |
Customer: | http://www.strong-2020.eu/ |
Project Page: | https://redmine.hzdr.de/projects/fluka/repository/show/MISC/KLOE/HEPdata |
Organize Documentation for flotation model experiments
Description: | Document flotation model experiments and subsequent analysis steps and results. |
Expected Results (Goals): |
|
Owner: | Thomas Gruber |
Customer: | Flotation model experiments (FWDT) |
Project Page: | https://gitlab.hzdr.de/fwcc/data-management/user-project-documentation/documentation-of-flotation-model-exp. |