FFE: add delivery delay test proving Agent bypass gap#6396
Draft
leoromanovsky wants to merge 1 commit intomainfrom
Draft
FFE: add delivery delay test proving Agent bypass gap#6396leoromanovsky wants to merge 1 commit intomainfrom
leoromanovsky wants to merge 1 commit intomainfrom
Conversation
Adds a backend-mode FFE scenario and a test that demonstrates FFE_FLAGS is not delivered to the tracer within 5 seconds when the Agent uses a 60s background refresh interval. This codifies the root cause described in the RC fast-lane proposal: the Agent's cache bypass fires on new clients, not new products, so FFE_FLAGS sits undelivered until the next background poll.
Contributor
|
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Motivation
Datadog's feature flagging SDK (FFE) relies on Remote Configuration to deliver flag definitions to tracers. In production, tracers wait 60-130+ seconds at startup before flag configuration arrives. The root cause is a structural mismatch in the Agent: the RC cache bypass fires when a new client connects, but not when an existing client registers a new product. All tracer products (APM, ASM, FFE) share a single RC client. By the time FFE_FLAGS is registered, the client is already active -- no bypass fires, and the Agent only discovers FFE_FLAGS on its next background poll (50-60s later).
This PR adds a system-test that codifies this delay so we can prove the problem exists today, and later prove it is fixed when the Agent ships a product-aware bypass.
Changes
Decisions