-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy path_pkgdown.yml
More file actions
320 lines (287 loc) · 13.2 KB
/
_pkgdown.yml
File metadata and controls
320 lines (287 loc) · 13.2 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
url: https://ntiGideon.github.io/ReproStat
# ---------------------------------------------------------------------------
# Open Graph / social card metadata
# ---------------------------------------------------------------------------
opengraph:
twitter:
card: summary_large_image
# ---------------------------------------------------------------------------
# Site template — Bootstrap 5 + bslib theming
# ---------------------------------------------------------------------------
template:
bootstrap: 5
bootswatch: flatly
css: pkgdown/extra.css
bslib:
# Brand colours
primary: "#2C7FB8" # steel blue — buttons, links, active nav
secondary: "#1B9E77" # forest green — accents
success: "#41AE76" # muted green
warning: "#E07B39" # amber
danger: "#C0392B" # red
info: "#31B0D5" # sky blue
# Backgrounds & surface colours
body-bg: "#FFFFFF"
body-color: "#2d3748"
light: "#F4F7FA"
dark: "#1a202c"
# Typography
base_font: {google: "Inter"}
heading_font: {google: "Inter"}
code_font: {google: "Fira Code"}
font-size-base: "0.935rem"
headings-color: "#1a202c"
headings-font-weight: "600"
# Links
link-color: "#2C7FB8"
link-hover-color: "#1a5a8a"
link-decoration: "none"
link-hover-decoration: "underline"
# Code
code-color: "#C0392B"
code-bg: "#F7F4F4"
# Cards & borders
border-radius: "0.5rem"
border-radius-lg: "0.75rem"
card-border-width: "0"
box-shadow: "0 2px 8px rgba(0,0,0,0.07)"
# Spacing
spacer: "1.1rem"
includes:
in_header: |
<meta name="description"
content="ReproStat is an R package for reproducibility diagnostics
in statistical modeling. Quantify coefficient, p-value, selection,
and prediction stability under bootstrap, subsampling, and noise
perturbations, and summarise results with a 0–100 Reproducibility
Index." />
<meta name="keywords"
content="reproducibility, statistical modeling, R package, stability
diagnostics, bootstrap, perturbation, reproducibility index,
coefficient stability, model comparison" />
<meta property="og:type" content="website" />
<meta property="og:title" content="ReproStat — Reproducibility Diagnostics for R" />
<meta property="og:description"
content="Bootstrap, subsample, and noise-based stability diagnostics
for statistical models in R. Includes a composite Reproducibility
Index, CI estimation, repeated CV ranking stability, and four
modelling backends." />
<meta name="twitter:card" content="summary" />
<meta name="twitter:title" content="ReproStat — Reproducibility Diagnostics for R" />
# ---------------------------------------------------------------------------
# Home page
# ---------------------------------------------------------------------------
home:
title: "ReproStat — Reproducibility Diagnostics for Statistical Modeling"
description: >
ReproStat answers a question standard model summaries leave open:
*if the data changed a little, would the conclusion still hold?*
It perturbs a dataset B times, refits the same model each time, and
aggregates how much coefficient estimates, significance decisions,
variable-selection patterns, and predictions move — producing a
0–100 Reproducibility Index with bootstrap uncertainty intervals.
Four modelling backends (lm, glm, rlm, glmnet) and repeated
cross-validation ranking stability are supported out of the box.
strip_header: false
links:
- text: "CRAN"
href: https://cran.r-project.org/package=ReproStat
- text: "GitHub"
href: https://github.com/ntiGideon/ReproStat
- text: "JSS article"
href: https://github.com/ntiGideon/ReproStat
- text: "Report a bug"
href: https://github.com/ntiGideon/ReproStat/issues
sidebar:
structure: [links, license, citation, authors, dev]
# ---------------------------------------------------------------------------
# Navigation bar
# ---------------------------------------------------------------------------
navbar:
bg: primary
type: dark
structure:
left: [intro, articles, reference, news]
right: [search, github]
components:
intro:
text: "Get Started"
href: articles/ReproStat-intro.html
articles:
text: "Articles"
menu:
- text: "─── Foundations ───────────────"
- text: "Introduction & Quick Start"
href: articles/ReproStat-intro.html
- text: "Interpreting ReproStat Outputs"
href: articles/interpreting-reprostat.html
- text: "─── Going Deeper ───────────────"
- text: "Choosing a Modeling Backend"
href: articles/backend-guide.html
- text: "Practical Workflow Patterns"
href: articles/workflow-patterns.html
reference:
text: "Reference"
href: reference/index.html
news:
text: "Changelog"
href: news/index.html
github:
icon: fa-github
href: https://github.com/ntiGideon/ReproStat
aria-label: "View source on GitHub"
# ---------------------------------------------------------------------------
# Articles (vignettes)
# ---------------------------------------------------------------------------
articles:
- title: "Foundations"
desc: >
Start here if you are new to ReproStat. These articles explain what the
package does, how to run your first analysis, and how to read the outputs
it produces.
contents:
- ReproStat-intro
- interpreting-reprostat
- title: "Going Deeper"
desc: >
These articles cover modelling backend selection and practical patterns
for incorporating reproducibility diagnostics into applied analysis
workflows.
contents:
- backend-guide
- workflow-patterns
# ---------------------------------------------------------------------------
# Reference (function documentation)
# ---------------------------------------------------------------------------
reference:
# ── Overview ──────────────────────────────────────────────────────────────
- title: "Package Overview"
desc: >
High-level entry point and the S3 class that all diagnostic functions
operate on. Start here if you are reading the reference for the first
time.
contents:
- ReproStat-package
- run_diagnostics
- print.reprostat
# ── Perturbation ──────────────────────────────────────────────────────────
- title: "Data Perturbation"
desc: >
`perturb_data()` is the building block underneath `run_diagnostics()`.
Use it directly when you need fine-grained control over how the data
are perturbed — for example, to pass the perturbed datasets to an
external modelling pipeline.
Three strategies are available:
- **Bootstrap** (`"bootstrap"`) — draws *n* rows with replacement,
mimicking ordinary sampling variability.
- **Subsampling** (`"subsample"`) — draws *m = ⌊ρn⌋* rows without
replacement, stressing robustness to sample composition.
- **Noise injection** (`"noise"`) — adds Gaussian noise scaled to each
predictor's standard deviation, simulating measurement error.
contents:
- perturb_data
# ── Stability metrics ─────────────────────────────────────────────────────
- title: "Stability Metrics"
desc: >
Four complementary views of how model outputs move across perturbation
runs. Each function takes a `reprostat` object returned by
`run_diagnostics()` and returns a numeric summary.
| Function | Question answered | Unit |
|---|---|---|
| `coef_stability()` | How much do coefficient estimates vary? | variance (lower = more stable) |
| `pvalue_stability()` | How often is each predictor significant? | frequency in [0, 1] |
| `selection_stability()` | Do predictors keep the same direction/inclusion? | proportion in [0, 1] |
| `prediction_stability()` | How much do predictions change? | variance (lower = more stable) |
`pvalue_stability()` and `selection_stability()` measure *different*
things: the former asks about the stability of a binary significance
decision; the latter asks about the direction or inclusion pattern of each
predictor.
contents:
- coef_stability
- pvalue_stability
- selection_stability
- prediction_stability
# ── Composite RI ──────────────────────────────────────────────────────────
- title: "Reproducibility Index"
desc: >
The Reproducibility Index (RI) aggregates the four stability components
into a single 0–100 score using a per-component normalisation and a
simple average. `ri_confidence_interval()` estimates uncertainty in
that score by resampling the stored perturbation draws — no additional
model fitting required.
**RI quick-reference guide**
| RI | Interpretation |
|---|---|
| 90–100 | Highly stable under the chosen perturbation design |
| 70–89 | Moderately stable; overall pattern is dependable |
| 50–69 | Mixed stability; inspect component breakdown |
| < 50 | Low stability; results may be fragile |
These are interpretive anchors, not universal cutoffs. Always inspect
the component decomposition alongside the aggregate score.
contents:
- reproducibility_index
- ri_confidence_interval
# ── Model comparison ──────────────────────────────────────────────────────
- title: "Cross-Validation Ranking Stability"
desc: >
`cv_ranking_stability()` evaluates *model-selection* stability: given
several candidate formulas, which one wins most consistently across
repeated K-fold cross-validation? It records each model's rank in every
repeat and summarises the distribution of those ranks.
Two summary statistics are particularly useful:
- **`top1_frequency`** — proportion of repeats in which a model ranked
first. High values mean the model is a consistently strong choice.
- **`mean_rank`** — average rank across all repeats (lower is better).
It is possible for the model with the lowest mean error to *not* have
the highest top-1 frequency.
Supports the same four backends as `run_diagnostics()`.
contents:
- cv_ranking_stability
- plot_cv_stability
- plot_cv_stability_gg
# ── Visualization ─────────────────────────────────────────────────────────
- title: "Visualization"
desc: >
Two families of plotting helpers are provided. The base-graphics
functions (`plot_stability()`, `plot_cv_stability()`) have no external
dependencies. The **ggplot2** variants (`plot_stability_gg()`,
`plot_cv_stability_gg()`) return `ggplot` objects that can be further
customised with standard ggplot2 layers and themes.
Both families are called for their side effects; the ggplot2 variants
additionally return a `ggplot` object invisibly.
contents:
- plot_stability
- plot_stability_gg
# ---------------------------------------------------------------------------
# Changelog (NEWS.md)
# ---------------------------------------------------------------------------
news:
one_page: true
cran_dates: true
releases:
- text: "ReproStat 0.1.1"
href: https://github.com/ntiGideon/ReproStat/releases/tag/v0.1.1
- text: "ReproStat 0.1.0"
href: https://github.com/ntiGideon/ReproStat/releases/tag/v0.1.0
# ---------------------------------------------------------------------------
# Footer
# ---------------------------------------------------------------------------
footer:
structure:
left: [package]
right: [developed_by, built_with]
components:
package: >
**ReproStat** is free software released under the
[GPL (≥ 3)](https://www.gnu.org/licenses/gpl-3.0.html) licence.
# ---------------------------------------------------------------------------
# Authors
# ---------------------------------------------------------------------------
authors:
"Gideon Nti Boateng":
href: "mailto:gidiboateng200@gmail.com"
# ---------------------------------------------------------------------------
# Search
# ---------------------------------------------------------------------------
search:
exclude: ["news/index.html"]