perf(informer): add TransformFuncs to reduce cache memory usage by theakshaypant · Pull Request #2667 · tektoncd/pipelines-as-code

theakshaypant · 2026-04-09T09:39:48Z

📝 Description of the Change

Add cache transform functions for the Repository and PipelineRun informers, stripping large unnecessary fields before objects enter the informer cache. Inspired by tektoncd/pipeline#9316.

For Repository objects, ManagedFields, Annotations and Status are stripped. The reconciler never reads Repository annotations or Status from the lister; Status is always fetched fresh via direct API call before updates.

For PipelineRun objects, ManagedFields and large Spec and Status fields are stripped. The watcher only needs Annotations, Spec.Status (pending check), Status.Conditions, and timing fields. All other data is fetched directly from the API when needed.

Benchmark results with production-realistic objects show an 89% JSON size reduction for Repository objects (5.6KB to 600B) and 94% for PipelineRun objects (10.7KB to 677B), with corresponding 8-10x reductions in heap allocation per cached object.

🔗 Linked GitHub Issue

N/A

🧪 Testing Strategy

Ran a script to simulate a high-load env on the watcher, creating ~5000 PipelineRuns in 10 minutes with bloated annotations on the PipelineRuns, the heap profile does not show majority resource utilisation from the Informer as was the case earlier in such a test

🤖 AI Assistance

AI assistance can be used for various tasks, such as code generation,
documentation, or testing.

Please indicate whether you have used AI assistance
for this PR and provide details if applicable.

I have not used any AI assistance for this PR.
I have used AI assistance for this PR.

Important

Slop will be simply rejected, if you are using AI assistance you need to make sure you
understand the code generated and that it meets the project's standards. you
need at least know how to run the code and deploy it (if needed). See
startpaac to make it easy
to deploy and test your code changes.

If the majority of the code in this PR was generated by an AI, please add a Co-authored-by trailer to your commit message.
For example:

Co-authored-by: Claude noreply@anthropic.com

✅ Submitter Checklist

codecov-commenter · 2026-04-09T09:43:41Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

❌ Patch coverage is 73.58491% with 14 lines in your changes missing coverage. Please review.
✅ Project coverage is 58.86%. Comparing base (de6de63) to head (9966baa).

Files with missing lines	Patch %	Lines
pkg/reconciler/controller.go	0.00%	8 Missing ⚠️
pkg/informer/transform/transform.go	86.66%	4 Missing and 2 partials ⚠️
❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2667      +/-   ##
==========================================
+ Coverage   58.82%   58.86%   +0.04%     
==========================================
  Files         204      205       +1     
  Lines       20134    20186      +52     
==========================================
+ Hits        11844    11883      +39     
- Misses       7525     7536      +11     
- Partials      765      767       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

gemini-code-assist

Code Review

This pull request introduces new cache transform functions (RepositoryForCache and PipelineRunForCache) in pkg/informer/transform to reduce memory usage by stripping unnecessary fields from Repository and PipelineRun objects before they are stored in informer caches. The pkg/reconciler/controller.go is updated to apply these transforms to the respective informers. The review comments point out a critical issue where the transform functions modify objects in-place, which could lead to race conditions or unexpected behavior. It is suggested to use DeepCopy() to operate on copies of the objects for safety.

gemini-code-assist · 2026-04-09T09:50:35Z

+	repo, ok := obj.(*pacv1alpha1.Repository)
+	if !ok {
+		return obj, nil
+	}
+
+	repo.ManagedFields = nil
+	repo.Annotations = nil
+	repo.Status = nil
+
+	return repo, nil


The transform function modifies the input object repo in-place. This can lead to race conditions or unexpected behavior if other parts of the system hold a reference to the original object. It's safer to operate on a copy of the object. Please add a repo.DeepCopy() at the beginning of the function body. The Tekton implementation this PR was inspired by also uses DeepCopy for safety.

repo, ok := obj.(*pacv1alpha1.Repository) if !ok { return obj, nil } repo = repo.DeepCopy() repo.ManagedFields = nil repo.Annotations = nil repo.Status = nil return repo, nil

Intended behaviour to strip these fields from the cache.

gemini-code-assist · 2026-04-09T09:50:35Z

+	pr, ok := obj.(*tektonv1.PipelineRun)
+	if !ok {
+		return obj, nil
+	}
+
+	pr.ManagedFields = nil
+
+	// Strip large Spec fields — watcher only checks Spec.Status (pending state)
+	pr.Spec.PipelineRef = nil
+	pr.Spec.PipelineSpec = nil
+	pr.Spec.Params = nil
+	pr.Spec.Workspaces = nil
+	pr.Spec.TaskRunSpecs = nil
+	pr.Spec.TaskRunTemplate = tektonv1.PipelineTaskRunTemplate{}
+	pr.Spec.Timeouts = nil
+
+	// Strip large Status fields — watcher only reads Conditions, StartTime, CompletionTime
+	pr.Status.PipelineSpec = nil
+	pr.Status.ChildReferences = nil
+	pr.Status.Provenance = nil
+	pr.Status.SpanContext = nil
+
+	return pr, nil


The transform function modifies the input object pr in-place. This can lead to race conditions or unexpected behavior if other parts of the system hold a reference to the original object. It's safer to operate on a copy of the object. Please add a pr.DeepCopy() at the beginning of the function body. The Tekton implementation this PR was inspired by also uses DeepCopy for safety.

pr, ok := obj.(*tektonv1.PipelineRun) if !ok { return obj, nil } pr = pr.DeepCopy() pr.ManagedFields = nil // Strip large Spec fields — watcher only checks Spec.Status (pending state) pr.Spec.PipelineRef = nil pr.Spec.PipelineSpec = nil pr.Spec.Params = nil pr.Spec.Workspaces = nil pr.Spec.TaskRunSpecs = nil pr.Spec.TaskRunTemplate = tektonv1.PipelineTaskRunTemplate{} pr.Spec.Timeouts = nil // Strip large Status fields — watcher only reads Conditions, StartTime, CompletionTime pr.Status.PipelineSpec = nil pr.Status.ChildReferences = nil pr.Status.Provenance = nil pr.Status.SpanContext = nil return pr, nil

Intended behaviour to strip these fields from the cache.

theakshaypant · 2026-04-09T10:23:24Z

Since the repo status field has already been deprecated, we could also target removing it altogether.
Major effort would be required in refactoring the e2e tests which use the repository status to confirm if a pipelinerun has completed.

pipelines-as-code · 2026-04-10T12:35:54Z

🤖 AI Analysis - pr-complexity-rating

To provide an accurate assessment, please provide the diff/file changes associated with PR #2667.

Since the current metadata only shows a merge commit and pipeline success, I cannot evaluate the specific code logic. However, based on the branch name feat/add-repo-informer-cache, here is the template ready for your input:

📊 PR Review Complexity

Dimension	Score	Rationale
Size	TBD	Pending diff analysis
Logic complexity	TBD	Pending cache implementation details
Risk	TBD	Caching layer changes often involve consistency/concurrency risks
Cross-cutting	TBD	Likely affects informer patterns and repository state
Test coverage	TBD	Needs verification of cache invalidation tests

Overall difficulty: [TBD]

Summary

[Awaiting diff] This PR appears to implement a cache for repository informers in pipelines-as-code. Reviewers should focus on cache invalidation logic, thread safety, and potential memory impact.

Suggested reviewers focus

Cache Lifecycle Management: Ensure cache expiration and invalidation logic prevent stale repository data.
Concurrency: Look for potential race conditions if the informer cache is accessed by multiple controller loops.

Please paste the code diff or file list to complete this triage.

Generated by Pipelines-as-Code LLM Analysis

pipelines-as-code · 2026-04-10T12:44:24Z

🤖 AI Analysis - pr-complexity-rating

Based on the provided metadata, this pull request appears to be a merge commit synchronizing a feature branch (feat/add-repo-informer-cache) with the main branch.

📊 PR Review Complexity

Dimension	Score	Rationale
Size	1	This is a merge commit; typically involves no direct code changes, only synchronization.
Logic complexity	1	No new logic introduced in this specific commit.
Risk	1	Minimal risk as it is a sync of existing branches.
Cross-cutting	1	Confined to branch synchronization.
Test coverage	5	The CI pipeline (`go-testing-dj2vn`) passed successfully.

Overall difficulty: Easy

Summary

This PR is a merge commit from main into feat/add-repo-informer-cache. It serves to bring the feature branch up to date with the latest changes in the upstream repository.

Suggested reviewers focus

No code review is required for this specific commit. The reviewer should focus on verifying that the merge did not introduce any unexpected conflicts and that the feature branch is ready for final testing or integration.

Generated by Pipelines-as-Code LLM Analysis

theakshaypant · 2026-04-12T06:07:18Z

Push to see the PR Complexity Rating in action.

chmouel · 2026-04-14T15:29:31Z

@theakshaypant fyi i disabled it... need to do a rework of that feature

Add cache transform functions for the Repository and PipelineRun informers, stripping large unnecessary fields before objects enter the informer cache. Inspired by tektoncd/pipeline#9316. For Repository objects, ManagedFields, Annotations and Status are stripped. The reconciler never reads Repository annotations or Status from the lister; Status is always fetched fresh via direct API call before updates. For PipelineRun objects, ManagedFields and large Spec and Status fields are stripped. The watcher only needs Annotations, Spec.Status (pending check), Status.Conditions, and timing fields. All other data is fetched directly from the API when needed. Benchmark results with production-realistic objects show an 89% JSON size reduction for Repository objects (5.6KB to 600B) and 94% for PipelineRun objects (10.7KB to 677B), with corresponding 8-10x reductions in heap allocation per cached object. Signed-off-by: Akshay Pant <akpant@redhat.com> Asisted-by: Claude <noreply@anthropic.com>

theakshaypant changed the title ~~perf(informer): add TransformFuncs to reduce cache memory usage~~ [WIP] perf(informer): add TransformFuncs to reduce cache memory usage Apr 9, 2026

gemini-code-assist bot reviewed Apr 9, 2026

View reviewed changes

theakshaypant changed the title ~~[WIP] perf(informer): add TransformFuncs to reduce cache memory usage~~ perf(informer): add TransformFuncs to reduce cache memory usage Apr 10, 2026

theakshaypant force-pushed the feat/add-repo-informer-cache branch from 58ed503 to 049275e Compare April 12, 2026 06:06

theakshaypant force-pushed the feat/add-repo-informer-cache branch from 049275e to 9966baa Compare April 14, 2026 07:22

theakshaypant force-pushed the feat/add-repo-informer-cache branch from 9966baa to 781ae7a Compare April 17, 2026 08:25

theakshaypant marked this pull request as ready for review April 17, 2026 08:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(informer): add TransformFuncs to reduce cache memory usage#2667

perf(informer): add TransformFuncs to reduce cache memory usage#2667
theakshaypant wants to merge 1 commit intotektoncd:mainfrom
theakshaypant:feat/add-repo-informer-cache

theakshaypant commented Apr 9, 2026 •

edited

Loading

Uh oh!

codecov-commenter commented Apr 9, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 9, 2026

Uh oh!

theakshaypant Apr 14, 2026

Uh oh!

gemini-code-assist bot Apr 9, 2026

Uh oh!

theakshaypant Apr 14, 2026

Uh oh!

theakshaypant commented Apr 9, 2026

Uh oh!

pipelines-as-code bot commented Apr 10, 2026

Uh oh!

pipelines-as-code bot commented Apr 10, 2026

Uh oh!

theakshaypant commented Apr 12, 2026

Uh oh!

chmouel commented Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

theakshaypant commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📝 Description of the Change

🔗 Linked GitHub Issue

🧪 Testing Strategy

🤖 AI Assistance

✅ Submitter Checklist

Uh oh!

codecov-commenter commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

theakshaypant Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

theakshaypant Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

theakshaypant commented Apr 9, 2026

Uh oh!

pipelines-as-code bot commented Apr 10, 2026

🤖 AI Analysis - pr-complexity-rating

📊 PR Review Complexity

Summary

Suggested reviewers focus

Uh oh!

pipelines-as-code bot commented Apr 10, 2026

🤖 AI Analysis - pr-complexity-rating

📊 PR Review Complexity

Summary

Suggested reviewers focus

Uh oh!

theakshaypant commented Apr 12, 2026

Uh oh!

chmouel commented Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

theakshaypant commented Apr 9, 2026 •

edited

Loading

codecov-commenter commented Apr 9, 2026 •

edited

Loading