Orderbook native price estimators fallback by squadgazzz · Pull Request #4161 · cowprotocol/services

squadgazzz · 2026-02-16T10:52:37Z

Background

The orderbook's native price estimator is currently configured to use a Forwarder estimator, which is basically the Autopilot's API. In case Autopilot is down, quote competition can't proceed without native prices, and no new orders can be placed during that time.

Description

Adds an optional fallback native price estimator that kicks in when the primary estimator experiences sustained failures. This protects native price availability during primary estimator outages.

The fallback estimator tracks consecutive ProtocolInternal errors from the primary. After a configurable threshold (3 errors), it switches to the fallback estimator and periodically probes the primary to detect recovery.

Changes

New FallbackNativePriceEstimator, which wraps a primary and fallback estimator with automatic failover logic:
- Tracks consecutive ProtocolInternal errors on the primary
- Switches to fallback after 3 consecutive errors
- Probes primary every 60s while in fallback mode
- Recovers to primary when a probe succeeds, otherwise, continue using the fallback
Forwarder error mapping: HTTP send failures now return ProtocolInternal instead of a generic error, so the fallback estimator can detect them
New CLI argument --fallback-native-price-estimators on the orderbook to optionally configure fallback estimators
New factory method caching_native_price_estimator_from_inner to allow injecting a pre-built inner estimator (with fallback wrapping) into the caching layer

How to test

New unit and e2e tests.

gemini-code-assist

Code Review

This pull request introduces a fallback mechanism for the native price estimator to improve reliability during primary estimator outages. While the implementation is well-structured, includes comprehensive tests, and uses a state machine for failover, two critical areas require attention. A high-severity issue was found regarding the use of .unwrap() on a Mutex, which could lead to a service panic if the lock is poisoned; while panicking is acceptable for unrecoverable errors, using .expect() improves debuggability. Additionally, a potential 'thundering herd' issue was identified in the probe logic that could lead to a burst of requests when recovering from a failure. Addressing these concerns will significantly improve the robustness and stability of the system.

crates/shared/src/price_estimation/native/fallback.rs

jmg-duarte

Nits only, nice job

jmg-duarte · 2026-02-16T16:13:07Z

crates/shared/src/price_estimation/native/fallback.rs

+        let mut state = self.state.lock().unwrap();
+        if let Err(PriceEstimationError::ProtocolInternal(err)) = result {
+            let State::Primary {
+                consecutive_errors, ..
+            } = &mut *state
+            else {
+                return false;
+            };
+            *consecutive_errors += 1;
+            if *consecutive_errors >= CONSECUTIVE_ERRORS_THRESHOLD {
+                tracing::info!(
+                    ?err,
+                    "primary native price estimator down after {} consecutive errors, switching \
+                     to fallback",
+                    *consecutive_errors
+                );
+                *state = State::Fallback {
+                    last_probe: Instant::now(),
+                };
+                return true;
+            }
+            tracing::debug!(
+                ?err,
+                consecutive_errors = *consecutive_errors,
+                "primary native price estimator error, not yet switching to fallback"
+            );
+            false
+        } else {
+            if let State::Primary {
+                consecutive_errors, ..
+            } = &mut *state
+            {
+                *consecutive_errors = 0;
+            }
+            false
+        }
+    }


I find that using early returns will improve readability here

Suggested change

let mut state = self.state.lock().unwrap();

if let Err(PriceEstimationError::ProtocolInternal(err)) = result {

let State::Primary {

consecutive_errors, ..

} = &mut *state

else {

return false;

};

*consecutive_errors += 1;

if *consecutive_errors >= CONSECUTIVE_ERRORS_THRESHOLD {

tracing::info!(

?err,

"primary native price estimator down after {} consecutive errors, switching \

to fallback",

*consecutive_errors

);

*state = State::Fallback {

last_probe: Instant::now(),

};

return true;

}

tracing::debug!(

?err,

consecutive_errors = *consecutive_errors,

"primary native price estimator error, not yet switching to fallback"

);

false

} else {

if let State::Primary {

consecutive_errors, ..

} = &mut *state

{

*consecutive_errors = 0;

}

false

}

}

/// Returns `true` if the fallback should be used.

fn should_use_fallback(&self, result: &NativePriceEstimateResult) -> bool {

let mut state = self.state.lock().unwrap();

let State::Primary {

ref mut consecutive_errors,

} = *state

else {

return false;

};

let Err(err) = result else {

*consecutive_errors = 0;

return false;

};

*consecutive_errors += 1;

if *consecutive_errors >= CONSECUTIVE_ERRORS_THRESHOLD {

tracing::info!(

?err,

"primary native price estimator down after {} consecutive errors, switching \

to fallback",

*consecutive_errors

);

*state = State::Fallback {

last_probe: Instant::now(),

};

return true;

}

tracing::debug!(

?err,

consecutive_errors = *consecutive_errors,

"primary native price estimator error, not yet switching to fallback"

);

false

}

jmg-duarte · 2026-02-16T16:24:01Z

crates/shared/src/price_estimation/native/forwarder.rs

    crate::price_estimation::PriceEstimationError,
    alloy::primitives::Address,
-    anyhow::Context,
+    anyhow::Context as _,


Why change the import style?

jmg-duarte · 2026-02-16T16:31:21Z

crates/shared/src/price_estimation/native/fallback.rs

+    fn token() -> Address {
+        Address::with_last_byte(1)
+    }
+
+    fn timeout() -> Duration {
+        Duration::from_secs(5)
+    }


both these functions are const fns

Suggested change

fn token() -> Address {

Address::with_last_byte(1)

}

fn timeout() -> Duration {

Duration::from_secs(5)

}

const TOKEN: Address = Address::with_last_byte(1);

const TIMEOUT: Duration = Duration::from_secs(5);

squadgazzz added 6 commits February 16, 2026 09:22

Init

401c3ee

Consecutive errors

92aad9d

Minor simplification

1cc086a

Nits

beb9387

e2e test

265c5d1

Fix

541d48b

squadgazzz marked this pull request as ready for review February 16, 2026 13:46

squadgazzz requested a review from a team as a code owner February 16, 2026 13:46

gemini-code-assist bot reviewed Feb 16, 2026

View reviewed changes

crates/shared/src/price_estimation/native/fallback.rs Show resolved Hide resolved

crates/shared/src/price_estimation/native/fallback.rs Show resolved Hide resolved

squadgazzz added 2 commits February 16, 2026 13:51

Docs

90005ab

Fix

95de395

jmg-duarte approved these changes Feb 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Orderbook native price estimators fallback#4161

Orderbook native price estimators fallback#4161
squadgazzz wants to merge 8 commits intomainfrom
orderbook/native-prices-estimator-fallback

squadgazzz commented Feb 16, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

jmg-duarte left a comment

Uh oh!

jmg-duarte Feb 16, 2026

Uh oh!

jmg-duarte Feb 16, 2026

Uh oh!

jmg-duarte Feb 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

squadgazzz commented Feb 16, 2026

Background

Description

Changes

How to test

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

jmg-duarte left a comment

Choose a reason for hiding this comment

Uh oh!

jmg-duarte Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

jmg-duarte Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

jmg-duarte Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants