HT Parsers – High Throughput Experiment Data Parser

HT Parsers is a Python package developed at Institut Néel to parse and structure data from high-throughput experiments.

The package supports multiple characterization techniques and stores data using the MaMMoS ontology, enabling consistent, machine-readable datasets that can be exported to HDF5 (NeXus-inspired format).

Supported techniques include:

EDX – elemental composition
MOKE – magnetic measurements
XRD – structural characterization
Profilometry (DEKTAK) – film thickness
SEM – microstructure imaging

Ontology used in this project: https://github.com/MaMMoS-project/MagneticMaterialsOntology

Installation

Clone the repository and install in editable mode:

git clone https://github.com/MaMMoS-project/ht-data-parser.git
cd ht-data-parser
python -m venv .venv
source .venv/bin/activate
pip install -e .

Dependencies are managed through pyproject.toml, please note that you need to use Python 3.11 or higher.

Basic Usage

A Jupyter Notebook DataParser.ipynb is providing a full detail on how to use the ht-data-parser, a basic usage has also been written down below:

Each measurement is represented by a Meas class containing:

metadata – instrument metadata
data – raw measurement data
results – processed quantities

Example with EDX:

import pathlib
from src.measurements.edxmeas import EdxMeas

path = pathlib.Path("Spectrum_(9,9).spx")

edx_spectrum = EdxMeas(path)
fig = edx_spectrum.plot()
fig.show()

Quantities are stored as ontology-aware entities:

energy = edx_spectrum.data["Energy"]

energy.value
energy.unit
energy.ontology

High Throughput Scans

For wafer-scale experiments (~250 positions), the package provides Scan classes that parse entire folders of measurements.

Example:

from src.scans.edxscan import EdxScan

edx_scan = EdxScan("EDX_folder")
edx_scan.heatmap("results.Nd.AtomPercent")

edx_scan.list_scalar_quantities() # List all values that can be plotted

Supported scan classes:

EdxScan
MokeScan
SmartlabScan
EsrfScan
ProfilScan
SemScan

Data Export

Measurements and scans can be exported to HDF5:

edx_scan.to_hdf5("dataset.hdf5")

The resulting structure follows conventions inspired by the NeXus scientific data format:

https://www.nexusformat.org/

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
src		src
.gitignore		.gitignore
DataParser.ipynb		DataParser.ipynb
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HT Parsers – High Throughput Experiment Data Parser

Installation

Basic Usage

High Throughput Scans

Data Export

About

Uh oh!

Releases 5

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

HT Parsers – High Throughput Experiment Data Parser

Installation

Basic Usage

High Throughput Scans

Data Export

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages