Behavioral Bias Detection System for LLM Financial Agents

Systematic benchmark platform for detecting and reducing behavioral bias in LLM-driven financial recommendations before those models reach production.

Who This Is For

Head of AI / ML platform teams deploying decision-assistant models into trading or advisory workflows.
Quant research teams evaluating model behavior under market stress and narrative shifts.
Risk and compliance teams that need repeatable model-governance evidence.
Product owners comparing external models (NVIDIA/OpenAI) against internal or gateway-routed models.

Outcomes

Reduce biased recommendations by identifying where models overreact (recency), anchor, or overstate confidence.
Compare models with a common bias scorecard and scenario set.
Generate auditable benchmark runs (run_id) for model risk reviews.
Enforce point-in-time scenario integrity to avoid look-ahead leakage.

Key Use Cases

Model procurement: compare NVIDIA vs GPT vs in-house gateway on anchoring and loss-aversion tests.
Pre-release safety gate: block deployment if overconfidence bias exceeds threshold.
Continuous monitoring: run weekly benchmark jobs and trend bias by model version.
Incident review: reproduce a run and show exact scenario context and timestamped prompts.

What It Includes

Bias scenario generator (anchoring, recency, loss aversion, overconfidence)
NVIDIA-first LLM evaluation runner (with optional extra providers)
Bias detection and scoring engine
FastAPI service for benchmark orchestration and results
Timescale/Postgres persistence
Dash reporting dashboard
Point-in-time validation guardrail for scenario timestamps

Architecture

See full diagram and flow notes in docs/architecture.md.

flowchart LR
    A[Scenario Generator] --> B[FastAPI Benchmark API]
    B --> C[Evaluation Orchestrator]
    C --> D[NVIDIA / OpenAI / Other Providers]
    C --> E[Bias Detector]
    E --> F[(Postgres / Timescale)]
    F --> G[Result Aggregation]
    G --> H[Dash Dashboard]
    A --> I[Point-in-Time Controller]
    I --> A

Dashboard Preview

Illustrative dashboard view:

End-to-End Example

Use the included runner to seed scenarios, execute a benchmark, and print aggregate bias scores.

python examples/end_to_end_benchmark.py \
  --host http://localhost:8000 \
  --scenario-count 8 \
  --agent-spec nvidia:meta/llama-3.1-70b-instruct

To compare multiple models:

python examples/end_to_end_benchmark.py \
  --agent-spec nvidia:meta/llama-3.1-70b-instruct \
  --agent-spec openai:gpt-4o

To include an in-house OpenAI-compatible gateway model, create an agent with config.base_url:

curl -X POST http://localhost:8000/api/v1/agents \
  -H "Content-Type: application/json" \
  -d '{
    "provider": "openai",
    "model_name": "my-inhouse-model",
    "temperature": 0.7,
    "max_tokens": 1000,
    "config": {
      "base_url": "https://your-gateway.example.com/v1",
      "api_key_env": "INHOUSE_OPENAI_COMPAT_KEY"
    }
  }'

Production Integration (Pseudocode)

def release_gate(candidate_model: str) -> None:
    run_id = benchmark_api.run(
        agent_ids=[register(candidate_model)],
        scenario_ids=scenario_catalog.core_suite(),
    )

    scores = benchmark_api.results_by_model(run_id=run_id)
    thresholds = {
        "anchoring": 0.35,
        "recency": 0.30,
        "loss_aversion": 0.25,
        "overconfidence": 0.30,
    }

    for row in scores:
        if row["mean_bias_score"] > thresholds[row["bias_type"]]:
            raise DeploymentBlocked(
                f"Model failed {row['bias_type']} threshold in run {run_id}"
            )

    approve_deployment(model=candidate_model, evidence_run_id=run_id)

Project Layout

behavioral-bias-detector/
  docs/
    architecture.md
    images/
  examples/
    end_to_end_benchmark.py
  scripts/
  src/
    agents/
    api/
    config/
    core/
    dashboard/
    db/
    detectors/
    models/
    scenarios/
    utils/

Quick Start

Copy env template:

cp .env.example .env

Then set at least:

NVIDIA_API_KEY=...

Start infrastructure:

docker compose up -d postgres redis

Install deps locally (optional if running in containers):

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Initialize DB with baseline agents/scenarios:

python scripts/init_db.py

Run API:

uvicorn src.main:app --reload --port 8000

Run dashboard:

python -m src.dashboard.app

API Endpoints

GET /health
POST /api/v1/scenarios/generate
GET /api/v1/scenarios
POST /api/v1/agents
GET /api/v1/agents
POST /api/v1/benchmark/run
GET /api/v1/results/by-model
GET /api/v1/runs

Point-in-Time Data Policy

Every scenario includes as_of timestamp in historical_context.
PointInTimeController rejects future-dated scenario context.
Scenario generation uses deterministic timestamps anchored to generation time.

Notes

Minimum key needed for this setup: NVIDIA_API_KEY.
NVIDIA_BASE_URL defaults to https://integrate.api.nvidia.com/v1.
Other provider keys are optional.
For statistically meaningful results, run at least 30 evaluations per bias type and model.
Anchoring bias is computed pairwise across high/low anchor twins per run and agent.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
docs		docs
examples		examples
scripts		scripts
src		src
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Behavioral Bias Detection System for LLM Financial Agents

Who This Is For

Outcomes

Key Use Cases

What It Includes

Architecture

Dashboard Preview

End-to-End Example

Production Integration (Pseudocode)

Project Layout

Quick Start

API Endpoints

Point-in-Time Data Policy

Notes

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Rohan5commit/behavioral-bias-detector

Folders and files

Latest commit

History

Repository files navigation

Behavioral Bias Detection System for LLM Financial Agents

Who This Is For

Outcomes

Key Use Cases

What It Includes

Architecture

Dashboard Preview

End-to-End Example

Production Integration (Pseudocode)

Project Layout

Quick Start

API Endpoints

Point-in-Time Data Policy

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages