MIST Demo

This repository contains tutorials for fine-tuning and applying MIST (Molecular Insight SMILES Transformer) foundation models to chemical problems. Model checkpoints for MIST models are available on HuggingFace and on Zenodo. The full code, including pre-training, model development and full scale application demos can be found in the mist repository.

Tutorials

run_finetuning.ipynb

Complete fine-tuning workflow for MIST encoder models:

Finetuning with LoRA (Low-Rank Adaptation) for parameter-efficient training
Hyperparameter optimization for task network
Training on the QM9 dataset for molecular property prediction
Model evaluation

molecular_property_prediction.ipynb

Inference demonstrations using fine-tuned MIST models:

Loading pretrained MIST checkpoints from HuggingFace
Predicting boiling point, flash point, and melting point
Analyzing property trends for alkenes and alcohols

Installation

Local Installation

Clone the repository:

git clone <repository-url>
cd mist-demo

Create a virtual environment and install dependencies using uv

uv sync
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Running the Notebooks

Launch Jupyter and open any notebook in mist-demo/tutorials:

jupyter notebook

Cite

If you use the MIST models in your work please cite:

@online{MIST,
  title = {Foundation Models for Discovery and Exploration in Chemical Space},
  author = {Wadell, Alexius and Bhutani, Anoushka and Azumah, Victor and Ellis-Mohr, Austin R. and Kelly, Celia and Zhao, Hancheng and Nayak, Anuj K. and Hegazy, Kareem and Brace, Alexander and Lin, Hongyi and Emani, Murali and Vishwanath, Venkatram and Gering, Kevin and Alkan, Melisa and Gibbs, Tom and Wells, Jack and Varshney, Lav R. and Ramsundar, Bharath and Duraisamy, Karthik and Mahoney, Michael W. and Ramanathan, Arvind and Viswanathan, Venkatasubramanian},
  date = {2025-10-20},
  eprint = {2510.18900},
  eprinttype = {arXiv},
  eprintclass = {physics},
  doi = {10.48550/arXiv.2510.18900},
  url = {http://arxiv.org/abs/2510.18900},

Name		Name	Last commit message	Last commit date
Latest commit History 420 Commits
mist_demo		mist_demo
tutorials		tutorials
.gitignore		.gitignore
LICENSE		LICENSE
NOTICE.md		NOTICE.md
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MIST Demo

Tutorials

run_finetuning.ipynb

molecular_property_prediction.ipynb

Installation

Local Installation

Running the Notebooks

Cite

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MIST Demo

Tutorials

run_finetuning.ipynb

molecular_property_prediction.ipynb

Installation

Local Installation

Running the Notebooks

Cite

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages