Help configuring Ollama/Continue to split 7B model between 4GB VRAM and 24GB RAM (Exit Status 2) #11788

opaulomatias · 2026-03-24T23:56:14Z

opaulomatias
Mar 24, 2026

Hello everyone,

I'm trying to set up Continue to run local models via Ollama, specifically qwen2.5-coder:7b, but I keep running into memory crashes when trying to use file context, and I'm hoping to find a way to properly balance the load between my VRAM and system RAM.

My Hardware:

OS: Windows 10
CPU: Intel i5-7200U
System RAM: 24 GB
GPU: NVIDIA GeForce 940MX (4 GB VRAM)

The Problem:
If I run the 3B model, everything works perfectly. However, when I load the 7B model and try to use @index.html or @codebase, Continue instantly throws this error:
"llama runner process has terminated: exit status 2"

What I've Tried:

I tried limiting the context window in my config.yaml by setting num_ctx: 2048 for the 7B model, but it still crashes the moment I attach a file.
I tried forcing CPU-only mode by adding num_gpu: 0. Same results.

My Question:
Since Ollama normally auto-splits models, is there a specific config.yaml configuration or Ollama parameter I can use to successfully force the 7B model to utilize my 4GB VRAM for speed, but safely offload the rest (and the context window) to my 24GB of RAM without triggering the out-of-memory crash?

Any guidance on how to optimize this specific hardware split would be hugely appreciated!

omni-front · 2026-03-25T15:03:13Z

omni-front
Mar 25, 2026

not sure about this one tbh, but it might be a limitation with the VRAM. ollama's auto-split should handle it, but some models just need more VRAM than it can offer. maybe check their docs or discord for any specific flags or settings.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Help configuring Ollama/Continue to split 7B model between 4GB VRAM and 24GB RAM (Exit Status 2) #11788

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Help configuring Ollama/Continue to split 7B model between 4GB VRAM and 24GB RAM (Exit Status 2) #11788

Uh oh!

opaulomatias Mar 24, 2026

Replies: 1 comment

Uh oh!

omni-front Mar 25, 2026

opaulomatias
Mar 24, 2026

omni-front
Mar 25, 2026