New functionalities in latest DataHub software version

10 January 2019

We are happy to announce several major improvements in the latest release of the DataHub software. In this release (version 2.3.0), our primary focus has been on improving the overall user experience. On a more technical level we have improved the stability of several processes. As a result, the DataHub infrastructure is more scalable to be prepared for the future. More details below.

User experience

Based on feedback of our users, we brought more functionality into the DataHub website in the so called self-service portal. Retrieving the data uploaded by you or your colleagues has never been this easy. In this release, we introduce the Project and collection browser, that enables users to browse project and collection (meta)data directly from within the DataHub portal (see Figure 1 and 2).

 

Figure 1: The projects overview lists all projects the current user has access to.
Principal investigators can also see and edit financial information regarding the project.


Figure 2: Clicking on a project lists all data collections that have been ingested


Data files inside a collection can now be downloaded using the Collection browser page, which opens after clicking a collection (see Figure 3).
For downloading large files or recursive directories, we recommend using the Direct download options from the menu.

Figure 3: On the collection browser page, the collection can be traversed and individual files can be downloaded.

The semantic search platform DISQOVER has been launched for end users to search through publicly available data sets and variables used in the Maastricht Study. In this release we have added functionality to search and retrieve data sets that were ingested to the DataHub system. DISQOVER will eventually replace the old research data warehouse. (See Figure 4 and 5).

Figure 4: The DISQOVER system now contains project- and collection metadata and can be used to search for data sets ingested to the DataHub system.

 

Figure 5: The old Research Datawarehouse will be replaced by DISQOVER


Furthermore, there is now a tighter integration between DISQOVER, the persistent identifier landing page (Figure 6) and the collection browser.
A complete overview of all data ingest and retrieval methods is shown in Figure 7.

Figure 6: Persistent identifier landing page

Figure 7: Overview of data ingest and retrieval methods

 

Other technical details

Features

  • New menu structure in web portal.
  • The storage capacity has been expanded by attaching new hardware resources to the iRODS system.
  • Increased performance for data ingest operations.
  • Various improvements in logging.
  • Upgrade to iRODS 4.1.12.
  • New microservice to create ePIC persistent identifiers.
  • New queuing mechanism for various workflows (RabbitMQ).

Bugfixes

  • Fixed creation of ePIC persistent identifier after certificate update at SURFsara.
  • Prevent unexpected reboots of Windows servers.
  • No extra write on checksum operations.
  • Fixed case where drop zones without creator were not handled properly.

 

*Special courtesy to Dr. Dennie Hebels (cBITE group, MERLN) for allowing us to use the above mentioned screenshots.