Skip to content

fix(accounts/keystore): fix flaky TestUpdatedKeyfileContents notification race#2231

Open
gzliudan wants to merge 1 commit intoXinFinOrg:dev-upgradefrom
gzliudan:fix-xdc-TestUpdatedKeyfileContents
Open

fix(accounts/keystore): fix flaky TestUpdatedKeyfileContents notification race#2231
gzliudan wants to merge 1 commit intoXinFinOrg:dev-upgradefrom
gzliudan:fix-xdc-TestUpdatedKeyfileContents

Conversation

@gzliudan
Copy link
Copy Markdown
Collaborator

Proposed changes

TestUpdatedKeyfileContents was intermittently failing with:

  • Emptying account file failed
  • wasn't notified of new accounts

Root cause: waitForAccounts required the account list match and an immediately readable ks.changes notification in the same instant, creating a timing race between cache update visibility and channel delivery.

This change keeps the same timeout window but waits until both conditions are observed, which preserves test intent while removing the flaky timing dependency.

Validation:

  • go test ./accounts/keystore -run '^TestUpdatedKeyfileContents$' -count=100

Types of changes

What types of changes does your code introduce to XDC network?
Put an in the boxes that apply

  • build: Changes that affect the build system or external dependencies
  • ci: Changes to CI configuration files and scripts
  • chore: Changes that don't change source code or tests
  • docs: Documentation only changes
  • feat: A new feature
  • fix: A bug fix
  • perf: A code change that improves performance
  • refactor: A code change that neither fixes a bug nor adds a feature
  • revert: Revert something
  • style: Changes that do not affect the meaning of the code
  • test: Adding missing tests or correcting existing tests

Impacted Components

Which parts of the codebase does this PR touch?
Put an in the boxes that apply

  • Consensus
  • Account
  • Network
  • Geth
  • Smart Contract
  • External components
  • Not sure (Please specify below)

Checklist

Put an in the boxes once you have confirmed below actions (or provide reasons on not doing so) that

  • This PR has sufficient test coverage (unit/integration test) OR I have provided reason in the PR description for not having test coverage
  • Tested on a private network from the genesis block and monitored the chain operating correctly for multiple epochs.
  • Provide an end-to-end test plan in the PR description on how to manually test it on the devnet/testnet.
  • Tested the backwards compatibility.
  • Tested with XDC nodes running this version co-exist with those running the previous version.
  • Relevant documentation has been updated as part of this PR
  • N/A

Copy link
Copy Markdown

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR addresses flakiness in accounts/keystore’s TestUpdatedKeyfileContents by changing the waitForAccounts helper to decouple “accounts list matches expected” from “keystore change notification received”, removing a timing dependency between cache visibility and channel delivery.

Changes:

  • Updates waitForAccounts to wait (within the same timeout window) until both the expected account list is observed and a ks.changes notification is observed.
  • Defers the “wasn't notified of new accounts” failure until the timeout elapses (instead of failing immediately on a single check).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment on lines +74 to 86
if !haveAccounts {
list = ks.Accounts()
haveAccounts = reflect.DeepEqual(list, wantAccounts)
}
if !haveChange {
select {
case <-ks.changes:
haveChange = true
default:
return errors.New("wasn't notified of new accounts")
}
}
if haveAccounts && haveChange {
return nil
Copy link

Copilot AI Mar 25, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

haveAccounts is latched true after the first time ks.Accounts() matches wantAccounts, and the helper stops re-reading the account list. Since the cache update path can expose intermediate states (e.g., scanAccounts processes updates as deleteByFile then add with separate locks), a transient match can be observed and then later diverge; this helper would still return success once a change is seen, or return the "wasn't notified" error even if accounts no longer match. Recompute list/equality each loop (or at least re-check immediately before returning and before deciding the final error) so success requires the current account list equals wantAccounts while still allowing the change and match to happen in either order.

Copilot uses AI. Check for mistakes.
@coderabbitai
Copy link
Copy Markdown

coderabbitai bot commented Mar 25, 2026

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 3187004b-8f97-4fc6-b674-c283b5da070b

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

  • 🔍 Trigger review
✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

…tion race

TestUpdatedKeyfileContents was intermittently failing with:

- Emptying account file failed
- wasn't notified of new accounts

Root cause: waitForAccounts required the account list match and an immediately readable ks.changes notification in the same instant, creating a timing race between cache update visibility and channel delivery.

This change keeps the same timeout window but waits until both conditions are observed, which preserves test intent while removing the flaky timing dependency.

Validation:
- go test ./accounts/keystore -run '^TestUpdatedKeyfileContents$' -count=100
@gzliudan gzliudan force-pushed the fix-xdc-TestUpdatedKeyfileContents branch from c3b0ad3 to 495921f Compare March 25, 2026 06:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants