We provide illustrative code that demonstrate how the data for the 2021 QSR Data Challenge Competition can be accessed in R.
The data for each layer are provided in the HDF5 format, and can be accessed at the following link:
To access the variables in the dataset in R, we utilize the rhdf5 library, which is part of the Bioconductor suite of R libraries.
install.packages("BiocManager")
BiocManager::install("rhdf5")
library(rhdf5)
Once the libraries are installed and loading, we read the data into R by means of the h5read function. Note that to execute this function, you must provide the name in addition to the filename of the file. For the illustrative data, the name is "OpenData".
sample_dataset = h5read(name="OpenData", file="DATA_PORTION_layer352.hdf5")
head(sample_dataset)
In this data matrix, the columns correspond to (from left to right)
We provide a scatterplot visualization of the X, Y, and SignalInGaAs variables.
install.packages("scatterplot3d")
library(scatterplot3d)
scatterplot3d(sample_dataset[,1],sample_dataset[,2],sample_dataset[,7],
xlab="X", ylab="Y", zlab="SignalInGaAs",
pch=".")