Process State Evaluation

Research codebase for evaluating process-state-based short-term simulation, supporting two papers:

ICPM-2025 — Three-flavour comparison (process state vs warm-up approaches)
Uncertainty/clustering extension — Confidence estimation with clustering models

Installation

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install -r requirements.txt

Directory Structure

process_state/
  main.py                              -- CLI entry point
  src/                                 -- core library (process state + simulation)
  evaluation/                          -- paper evaluation pipelines
    evaluation.py                      -- shared evaluation metrics
    helper.py                          -- data I/O and window utilities
    rtd.py                             -- remaining time distribution metric
    short_term_simulation/
      evaluate_with_existing_alog.py   -- three-flavour comparison (ICPM-2025)
    clustering/
      clustered_short_term_simulation.py -- uncertainty/clustering orchestration
      features.py                      -- feature engineering at cut timestamps
      models.py                        -- clustering model training & evaluation
      check_ci_calibration.py          -- CI calibration validation
      check_clusters.py                -- visualization/analysis
  tests/                               -- automated test suite (pytest)
  tools/
    fix_timestamps.py                  -- timestamp format fixer
  samples/
    icpm-2025/                         -- data for ICPM-2025 paper
    extension-uncertainty/             -- data for clustering/uncertainty extension

Running Evaluation

ICPM-2025 pipeline

python -m evaluation.short_term_simulation.evaluate_with_existing_alog DATASET_NAME --runs 10 --cut-strategy fixed

Arguments:

DATASET_NAME: name of a dataset (e.g. BPIC_2012, LOAN_STABLE) or group (ALL, SYNTHETIC, REAL-LIFE).
--runs: number of Monte-Carlo repetitions per cut-off (default: 10).
--cut-strategy: method to choose cut-off timestamps.
- fixed — single timestamp from dataset config.
- wip3 — three WiP percentiles (10%, 50%, 90%).
- segment10 — ten random points in equal time segments.

Datasets

Real-life: BPIC_2012, BPIC_2017, WORK_ORDERS

Synthetic: LOAN_STABLE, LOAN_CIRCADIAN, LOAN_UNSTABLE, P2P_STABLE, P2P_CIRCADIAN, P2P_UNSTABLE

Output

Results are saved in outputs/<DATASET>/<run_id>/, including:

Reference subsets (A_event_filter.csv, A_ongoing.csv, A_complete.csv)
Simulation logs and statistics for each repetition
final_results.json with per-cut and overall aggregated metrics

Running Tests

pytest tests/ -v

Samples

samples/icpm-2025/ — Real-life and synthetic event logs for the ICPM-2025 paper
samples/extension-uncertainty/ — Pre-split train/test logs for the clustering extension

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Process State Evaluation

Installation

Directory Structure

Running Evaluation

ICPM-2025 pipeline

Datasets

Output

Running Tests

Samples

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
evaluation		evaluation
samples		samples
src		src
tests		tests
tools		tools
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Process State Evaluation

Installation

Directory Structure

Running Evaluation

ICPM-2025 pipeline

Datasets

Output

Running Tests

Samples

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages