Advanced Data Import (Templates)

Data Import Templates are currently in Beta. Exact features and layout are subject to frequent change until this feature is released.

Data Import Templates allow you to define how to upload data in a file. These can be saved for re-use or applied one-off for a data file. This allows you to upload files and create new Datasets immediately, without having to define rules for upload each time.

Data Import Templates are made using the following steps.

While in Beta, this feature can be accessed from the Dataset Library using the “Advanced Import” button. This location may change in the future.

1

Upload File

To upload your file(s) you can click on “Upload File” in the “Add New Source” section. You can add one or multiple files.

Data Import Template Inital

2

Add Files and Tabs to a Workspace

Each file and tab (if using .xlsx) will be show in the left menu. You can add each file into the workspace by clicking “add”. Inside a file are ‘data regions’ – which are the areas where your data are located. The below image shows four files uploaded (left) and a data region (right).

Data Import Workspace

If your file has multiple data regions, such as those described in Data and File Layouts you can add the file from your workspace any number of additional times to capture additional data regions.

Extract Data

3

Define File layouts

Each data region will be automatically detected and shown to the left, with a preview of the final Dataset shown to the right. The below image shows a tabular layout as described in Data and File Layouts.

Data Region Tabular

If you have transposed data you can select the “Unpivot” option. Unpivot can operate on one or many rows and columns in the data region. For a transposed table you want to select everything except the first column using the “Selection Bounds”, as the first column’s values will become the new table’s column names.

Data Region Unpivot

Merge data

After you have defined your data file and data region you can define how they are mereged together to create a single Dataset. Merging of data regions occurs by matching values in each selected column to values in a selected column in another data region. Merging by matching the columns should be used when different variables exist across data files and need to be merged together, resulting in a wider table (outer joining).

Data Merge

You can alternatively add a data region to your final Dataset as rows. This can be used when each data region contains the same kind of data, resulting in a longer table (concatenation). Once you are done merging all data regions into one, you can select “Finalize Dataset”.

Add Details and Finalize

Once in the finalize step, you can enter a name and click “Create Dataset”. After creating the Dataset you can enter the Dataset Name, Dataset Description, Link to ELN entry, and Tags. Completing this step will lead you to Dataset details page.

We can now use this Dataset to create/run an analysis.