feat: cloud manager client by rpapani · Pull Request #1335 · adobe/spacecat-shared

rpapani · 2026-02-11T05:25:20Z

spacecat-shared-cloud-manager-client — Shared client for Adobe Cloud Manager: git operations (clone, zip, branch, apply patch from S3, push) and CM Repo REST API (create pull request). Supports BYOG (GitHub, Bitbucket, GitLab, Azure DevOps) and standard (non-BYOG) repos. Used by code-import and autofix flows.

Requirements

Context must provide S3 client (e.g. s3Wrapper) and env: IMS credentials (IMS_HOST, ASO_CM_REPO_SERVICE_IMS_CLIENT_ID, ASO_CM_REPO_SERVICE_IMS_CLIENT_SECRET, ASO_CM_REPO_SERVICE_IMS_CLIENT_CODE), CM_REPO_URL, ASO_CODE_AUTOFIX_USERNAME, ASO_CODE_AUTOFIX_EMAIL.
Optional for standard repos: CM_STANDARD_REPO_CREDENTIALS (JSON mapping programId → "username:accessToken").
Git at /opt/bin/git with Lambda layer env (PATH, GIT_EXEC_PATH, LD_LIBRARY_PATH). Node 22.x.

API

Creation: CloudManagerClient.createFrom(context) or new CloudManagerClient(config, s3Client, log).
Git: clone(programId, repositoryId, { imsOrgId, repoType, repoUrl }), zipRepository(clonePath), createBranch(clonePath, baseBranch, newBranch), applyPatch(clonePath, branch, s3PatchPath), push(clonePath, programId, repositoryId, options), cleanup(clonePath).
CM REST: createPullRequest(programId, repositoryId, imsOrgId, { destinationBranch, sourceBranch, title, description }).
Typical flows: Import: clone → zipRepository → cleanup. Autofix: clone → createBranch → applyPatch → push → createPullRequest → cleanup.

Security notes:

Clone directories are created via mkdtempSync under the OS temp directory, producing unique, unpredictable paths safe from symlink attacks and concurrent-run collisions.
For standard repos, after cloning the client runs git remote set-url origin <repoUrl> to strip basic-auth credentials from the stored remote, so git remote -v never exposes secrets.
Patch files are also written into unique temp directories, cleaned up in a finally block.
Git error output is sanitized before logging — Bearer tokens are replaced with [REDACTED] and basic-auth credentials in URLs are masked with ***.

Validation Status (In Progress)

Git operation	BYOG (GitHub)	BYOG (GitLab)	Standard
Clone	✅	✅	✅
Apply patch	✅	❌	✅
Push	❌	❌	✅
Pull request	✅	❌	N/A

Example Code config for test site in dev

"code": {
    "owner": "138954",
    "ref": "master",
    "s3StoragePath": "code/706369ba-d362-4772-9fd9-1dcf5581db33/github/138954/161702/master/repository.zip",
    "type": "github",
    "url": "https://github.com/ravkiran/acs-aem-samples.git",
    "repo": "161702"
 }

packages/spacecat-shared-cloud-manager-client/src/index.js

github-actions · 2026-02-12T02:56:58Z

This PR will trigger a minor release when merged.

ramboz

A few nitpicking comments, but looks good otherwise

packages/spacecat-shared-cloud-manager-client/src/index.d.ts

packages/spacecat-shared-cloud-manager-client/src/index.js

solaris007

Thanks for this, @rpapani - really solid work on a non-trivial piece of infrastructure. A few things I especially liked:

The credential stripping from the remote URL after standard repo clone - nice defensive practice
mkdtempSync for unique temp paths + the prefix validation in cleanup() to prevent accidental deletion of arbitrary paths
Error sanitization for Bearer tokens and basic-auth URLs in git output
GIT_TERMINAL_PROMPT=0 to prevent Lambda hangs
Thorough test suite covering both BYOG and standard flows

The main concerns are around the token lifecycle (the authorization_code grant + no expiry tracking combo could leave the client in an unrecoverable state) and credential exposure during push/pull for standard repos. See the inline comments for details.

One broader question: now that we have a Cloud Manager client that can talk to the CM Repo API, could we leverage this to also enrich our site detection? I'm thinking things like - does a site actually exist in CM, what's its configuration, which programs/repos are associated with it? That kind of context could really help with discovery and onboarding flows. Would be great to hear if that's on the radar or if the CM API surface supports it.

solaris007 · 2026-02-18T04:03:33Z

packages/spacecat-shared-cloud-manager-client/src/index.js

+      client_id: clientId,
+      client_secret: clientSecret,
+      code: clientCode,
+      grant_type: 'authorization_code',


Two things here that I'd like to understand:

grant_type: 'authorization_code' with a static code from env vars - in standard OAuth 2.0, authorization codes are single-use. If the cached token expires and we try to re-exchange the same code, IMS should reject it and the client can't recover. If this is an IMS-specific reusable code pattern (I've seen some Adobe service-to-service configs do this), could you add a comment explaining that? Otherwise, should this be client_credentials?

The expires_in from the IMS response is ignored - the token gets cached forever on the instance. For warm Lambdas that can live for hours, we'd eventually use a stale token. Combined with Configure Renovate #1, this means one expired token = dead client. Something like this would help:

if (this.accessToken && Date.now() < this.tokenExpiresAt) { return this.accessToken; }

@solaris007

grant_type: 'authorization_code' with a static code from env vars - in standard OAuth 2.0, authorization codes are single-use.

The same access_token, I'm able to use it for multiple API calls to CM, so I'm not sure if it this is single-use. The API response is giving

{ "access_token": "eyJhbGciOiJSU...o4eSRY6a0Kg", "refresh_token": "eyJhbGciOiJSU..._doaZSAOICPn8pRjDgjhPs_Q", "token_type": "bearer", "expires_in": 86399 }

so, I've thought that this token is valid for the whole day and Lambda life time is 15 min, so used the same grant_type the other clients are using in spacecat-shared.

Just realized that in cm-client, I'm making the IMS call directly, instead of leveraging the IMS client like in brand-client, though the IMS Client is also using the same authorization_code

To be consistent with other clients, I'll change cm-client to use the ims client, please let me know if I should avoid using the authorization_code indirectly through ims client

The expires_in from the IMS response is ignored - the token gets cached forever on the instance

good point, will fix

Thanks for clarifying, @rpapani - and good catch on the existing IMS client.

So I dug into this a bit more. The authorization_code with a static code is indeed an Adobe IMS convention for service-to-service integrations - it's not a standard OAuth single-use code, it's more like a long-lived service credential that IMS lets you exchange repeatedly. All existing SpaceCat clients use this same pattern, so you're in good company.

Switching to use spacecat-shared-ims-client is definitely the right call - centralizes the IMS integration and keeps things consistent with brand-client and others. One thing to be aware of though: the IMS client has the same token expiry gap (caches the token but never checks expires_in). For a 15-min Lambda with a 24h token that's fine in practice, but ideally we'd fix that in the IMS client itself so all consumers benefit. That can be a separate effort though - no need to block this PR on it.

One broader heads-up worth flagging for the team: Adobe's legacy JWT/Service Account credentials have reached end of life as of June 30, 2025, with a hard March 1, 2026 deadline for forced auto-conversion (migration guide, JWT deprecation notice). The recommended server-to-server pattern going forward is grant_type: client_credentials via /ims/token/v3 (implementation guide) - which the IMS client already supports via getServiceAccessTokenV3(). The v4 authorization_code endpoint may be tied to the legacy credential type. Might be worth checking whether the CM Repo Service TA supports OAuth Server-to-Server credentials, and if so, using getServiceAccessTokenV3() instead. That way we're future-proof and don't have to scramble when the deadline hits.

tl;dr:

Yes, switch to IMS client - good idea

The authorization_code pattern is fine for now, it's an established Adobe IMS thing

Consider using getServiceAccessTokenV3() (client_credentials) if the TA supports it - the v4 path may stop working after March 2026

Token expiry tracking can be fixed in the IMS client as a separate PR

@solaris007

Consider using getServiceAccessTokenV3() (client_credentials) if the TA supports it - the v4 path may stop working after March 2026

when I'm using POST /ims/token/v4 with grant_type: client_credentials its failing with

{ "error": "unauthorized_client" }

Can I use v4 for now and once I get input from CM team, will switch to oauth?

That unauthorized_client error makes total sense - it means the CM Repo Service Technical Account is provisioned as a legacy "Service Account (JWT)" credential in Adobe Developer Console, not as an "OAuth Server-to-Server" credential. Only the OAuth S2S credential type supports client_credentials.

Using authorization_code via ImsClient (which is what getServiceAccessToken() does) is perfectly fine for now - that's what all other SpaceCat clients use. No need to change anything for this PR.

When you hear back from the CM team, the path forward would be:

Someone with Dev Console access adds an "OAuth Server-to-Server" credential to the CM Repo Service integration

Switch from getServiceAccessToken() to getServiceAccessTokenV3() (which uses /ims/token/v3 with client_credentials)

Configure IMS_SCOPE env var (ImsClient picks it up for the v3 flow)

But that's a separate effort entirely. The authorization_code flow isn't tied to the JWT deprecation timeline (it's a different credential type), so there's no urgency here.

solaris007 · 2026-02-18T04:03:40Z

packages/spacecat-shared-cloud-manager-client/src/index.js

+    this.log.info(`Cloning CM repository: program=${programId}, repo=${repositoryId}, type=${repoType}`);
+
+    const args = await this.#buildAuthGitArgs('clone', programId, repositoryId, { imsOrgId, repoType, repoUrl });
+    this.#execGit([...args, clonePath]);


If the clone throws here (network timeout, auth failure, etc.), the mkdtempSync-created directory at clonePath gets leaked. applyPatch handles this nicely with try/finally below - would be great to do the same here. Lambda's /tmp is 512MB by default, and with retries those orphaned dirs could pile up.

in the applyPatch, the tmp is a patch file, so I'm cleaning with finally block, but in the case of clone the tmp directory holds the cloned repo, so this shouldn't be cleaned until the client uses the cloned path, either for applying a patch, zipping etc. I've the cleanup(clonePath) which consumers can call to clean this up.

I can clean the tmp directory when there's an exception if you want it clean but its in ephemeral storage, so not sure if that's really important.

Lambda's /tmp is 512MB by default, and with retries those orphaned dirs could pile up.

Good point, I've thought about this. I've seen some repos 460MB in size, so we'll run into issues with such repos. We don't seem to have a way to configure this in hedy, so planning to open a pull request in hedy to support this and then will update our workers with that configuration. I'm thinking of using 1-2 GB for this ephemeral storage, would that be fine?

To clarify - I'm not suggesting cleaning up the happy path. You're absolutely right that the clone dir is the output and the consumer owns its lifecycle via cleanup(clonePath). That design is solid.

The suggestion is specifically about the error path: if #execGit throws during the clone itself, the mkdtempSync-created directory is already on disk but contains only a partial or empty clone that no caller will ever use. Since the method throws, the caller never gets clonePath back and has no way to call cleanup() on it. unzipRepository already handles this pattern nicely - it cleans up extractPath in its catch block. Something like:

try { this.#execGit([...args, clonePath]); } catch (error) { rmSync(clonePath, { recursive: true, force: true }); throw error; }

On ephemeral storage - good call on the hedy PR. For sizing, think about peak concurrent usage: clone (~460 MB) + .git history, and if unzipRepository runs in the same invocation (zip on disk + extracted tree simultaneously), that can spike to ~1 GB easily. I'd recommend 3 GB as a starting point - comfortable headroom without being wasteful. AWS Lambda supports up to 10 GB, so there's room to grow. Monitoring TmpStorageUsed in CloudWatch would give you the data to right-size it.

Also worth considering - could we reduce clone size with git flags for flows that don't need full history? The import flow needs .git history for downstream, but the autofix flow (clone -> branch -> patch -> push -> PR) likely doesn't. Options like --depth 1 --single-branch for autofix clones could cut that 460 MB down dramatically and ease the /tmp pressure. Even for the import flow, --single-branch --no-tags could help if downstream only needs one branch's history.

could we reduce clone size with git flags for flows that don't need full history?

git history is required for our learning agent to understand if the suggestions are taken as-is or updated or what code changes contributed to page performance, customer coding style etc. Most of the time we need the production branch, not sure if the other branches are required atm. Can I do this optimization in a separate ticket?

I think we can be more granular with that requirement.

We can probably limit the full git history to the main branch and only for the onboarding phase to build the project knowledge and best practices once.
For regular fixes, we can reuse the learned best practices as-is and potentially just update them every 6 months or so. We could plan for a usual smaller storage and only use a larger one for those "edge cases" if that helps keep COGS lower

Totally makes sense - full history for the learning agent is a legit requirement. Commit-level analysis of whether suggestions were adopted, coding style patterns, performance impact - you need that history for all of that.

Separate ticket sounds good, no need to block this PR.

One thought for that ticket though - even keeping full history, --single-branch could be a quick win if you typically only need the production branch. It fetches the complete history for that branch but skips remote tracking refs for all the others. Paired with --no-tags, it could meaningfully reduce transfer size without losing any commit history on the branch you care about.

And since these flags are additive, you could configure them per-flow: full clone for the learning agent, --depth 1 --single-branch for autofix (where you just need a working copy to apply a patch and push), etc.

solaris007 · 2026-02-18T04:03:42Z

packages/spacecat-shared-cloud-manager-client/src/index.js

+  async push(clonePath, programId, repositoryId, {
+    imsOrgId, repoType, repoUrl, ref,
+  } = {}) {
+    const pushArgs = await this.#buildAuthGitArgs('push', programId, repositoryId, { imsOrgId, repoType, repoUrl });


Nice job stripping creds from the remote URL after clone (line 294) - but for push and pull, #buildAuthGitArgs re-embeds user:token in the URL and doesn't clean up afterward. The creds end up in /proc/PID/cmdline while git runs.

Any reason not to use the same -c http.extraheader approach with Authorization: Basic <base64> for standard repos too? That would make all repo types consistent and avoid creds in the URL entirely.

@solaris007

Any reason not to use the same -c http.extraheader approach with Authorization: Basic for standard repos too? That would make all repo types consistent and avoid creds in the URL entirely.

great suggestion, didn't think of this, tried this locally, it worked great, will implement.

but for push and pull, #buildAuthGitArgs re-embeds user:token in the URL and doesn't clean up afterward. The creds end up in /proc/PID/cmdline while git runs.

even with -c extraheader, this problem still there right? To avoid this entirely, should I use GIT_CONFIG_COUNT approach (like below) or is there a better approach?

GIT_CONFIG_COUNT=2 GIT_CONFIG_KEY_0=http.extraheader GIT_CONFIG_VALUE_0=Authorization: Bearer TOKEN GIT_CONFIG_KEY_1=http.extraheader GIT_CONFIG_VALUE_1=x-api-key: my-key

Great instinct - you're right that -c http.extraheader still exposes credentials in /proc/PID/cmdline (and ps output), so from a cmdline perspective it's the same exposure as URL-embedded creds.

GIT_CONFIG_COUNT is exactly the mechanism designed to solve this. It was added in Git 2.31.0 (commit d8d77153) specifically for this reason - the commit message even calls out the ps(1) credential leakage from -c args. With env vars, credentials live in /proc/PID/environ which is restricted to same-UID/root (needs CAP_SYS_PTRACE), while /proc/PID/cmdline is world-readable. That's a meaningful step up.

One thing to verify first though: which git version does our Lambda layer provide? The public lambci/git-lambda-layer ships Git 2.29.0, and GIT_CONFIG_COUNT needs 2.31+. If we're on 2.29, the env vars would be silently ignored and auth would just fail. Worth checking git --version in a Lambda exec to confirm.

For this PR, I think the -c http.extraheader approach for standard repos (which you've already agreed to) is the right move - it's a clear improvement over URL-embedded creds, works with any git version, and makes all repo types consistent. The /proc/PID/cmdline exposure is a real but low-severity concern in Lambda's threat model: no multi-tenant process namespace, and the execution is ephemeral.

If we want to close that gap later, there are two options:

GIT_CONFIG_COUNT - cleanest, no file I/O, but needs git >= 2.31

Temp config file with include.path - works with any git version, similar to what GitHub Actions v6 does

Either way, that's a follow-up - the -c extraheader approach is solid for this PR.

packages/spacecat-shared-cloud-manager-client/src/index.js

solaris007

Nice work on the update, @rpapani - you addressed every item from the first review round. The ImsClient integration, extraheader-based auth for both repo types, timeout handling, sanitization, and clone error cleanup all look solid. Good stuff.

One non-blocking thought for a follow-up:

In-memory zipRepository - archiver currently buffers the entire ZIP into memory via Buffer.concat(chunks). For a 460 MB repo with .git history, the compressed buffer plus the archiver working set could push past Lambda's default 1 GB memory limit (especially if the clone is also still on disk at that point). Might be worth considering a stream-to-S3 approach or writing the zip to /tmp first and then uploading, so memory stays flat regardless of repo size. Not a blocker for this PR - just something to keep an eye on once you're dealing with the larger repos in prod.

rpapani · 2026-02-20T00:23:43Z

@solaris007, @ramboz here's the hedy PR - adobe/helix-deploy#890, it'll be great if you can provide the feedback on that.

One broader question: now that we have a Cloud Manager client that can talk to the CM Repo API, could we leverage this to also enrich our site detection? I'm thinking things like - does a site actually exist in CM, what's its configuration, which programs/repos are associated with it? That kind of context could really help with discovery and onboarding flows. Would be great to hear if that's on the radar or if the CM API surface supports it.

For ASO auto-fix purposes, we need authorUrl (for content opportunities) and code repo (for code opportunities) that's being used in the prod environment. If we know the program id from customer domain, then we can obtain environment, author url and code repo/branch everything using CM APIs, the manual process is documented in this wiki. We're working with CM team to get access to CM APIs for our aso-cm-repo-service ims client so that we can automate this in our onboarding flow. I've automated parts of this in this notebook using SRE token to be able to pre-populate the ASO paid customers once this PR is merged. I've created SITES-40809 for onboarding automation once we've the CM APIs available.

…path of the code for the site

…he external deps

## @adobe/spacecat-shared-cloud-manager-client-v1.0.0 (2026-02-21) ### Features * cloud manager client ([#1335](#1335)) ([2e4b013](2e4b013))

solaris007 · 2026-02-21T07:17:25Z

🎉 This PR is included in version @adobe/spacecat-shared-cloud-manager-client-v1.0.0 🎉

The release is available on:

Your semantic-release bot 📦🚀

## [@adobe/spacecat-shared-data-access-v3.2.0](https://github.com/adobe/spacecat-shared/compare/@adobe/spacecat-shared-data-access-v3.1.0...@adobe/spacecat-shared-data-access-v3.2.0) (2026-02-21) ### Features * cloud manager client ([#1335](#1335)) ([2e4b013](2e4b013))

solaris007 · 2026-02-21T07:17:46Z

🎉 This PR is included in version @adobe/spacecat-shared-data-access-v3.2.0 🎉

The release is available on:

Your semantic-release bot 📦🚀

github-advanced-security bot found potential problems Feb 11, 2026

View reviewed changes

packages/spacecat-shared-cloud-manager-client/src/index.js Fixed Show fixed Hide fixed

rpapani requested review from ekremney and solaris007 as code owners February 12, 2026 21:38

rpapani requested review from dmaurya929, ramboz and vdua February 13, 2026 08:35

ramboz approved these changes Feb 13, 2026

View reviewed changes

solaris007 assigned rpapani Feb 18, 2026

solaris007 added the enhancement New feature or request label Feb 18, 2026

solaris007 requested changes Feb 18, 2026

View reviewed changes

rpapani requested a review from solaris007 February 19, 2026 06:07

solaris007 approved these changes Feb 19, 2026

View reviewed changes

rpapani added 16 commits February 19, 2026 19:19

feat: cloud manager client

820fca6

fix: fix: execGit child process arguments

82f2fc4

fix: git env

0f01f4b

fix: removing debug logs and updating API params

e2384bc

fix: refactor apis, remove debug logs

864a44a

fix: removing cm apis and updating pullRequests api

33f7511

fix: support for cm standard repos

e42a816

fix: tests

aaac95a

fix: readme

42fa5b7

fix: adding support s3StoragePath in the code object to store the s3 …

84daeb0

…path of the code for the site

fix: update api, fixes

a31d6f8

fix: update push api

8a31fa6

fix: pull requests api signature

9deafa5

fix: readme

08c06bd

fix: review feedback

3ee1aab

fix: url parsing coverage

16b4199

rpapani added 2 commits February 19, 2026 19:23

fix: review feedback

c865aa9

fix: npm things and /tmp size logging for rightsizing later

87197fc

rpapani force-pushed the cm-client branch from e6f7329 to 87197fc Compare February 20, 2026 03:34

rpapani added 3 commits February 19, 2026 19:47

fix: restore lockfile with resolved git URLs

3eabc44

fix: use adm-zip for unzipping rather than binary

48c2d83

fix: using adm-zip for archiving also instead of archiver to reduce t…

528e5f2

…he external deps

rpapani merged commit 2e4b013 into main Feb 21, 2026
7 checks passed

rpapani deleted the cm-client branch February 21, 2026 07:10

solaris007 pushed a commit that referenced this pull request Feb 21, 2026

chore(release): 1.0.0 [skip ci]

c63b903

## @adobe/spacecat-shared-cloud-manager-client-v1.0.0 (2026-02-21) ### Features * cloud manager client ([#1335](#1335)) ([2e4b013](2e4b013))

solaris007 added the released label Feb 21, 2026

Conversation

rpapani commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Feb 12, 2026

Uh oh!

ramboz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

solaris007 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

solaris007 left a comment

Choose a reason for hiding this comment

Uh oh!

rpapani commented Feb 20, 2026

Uh oh!

Uh oh!

solaris007 commented Feb 21, 2026

Uh oh!

solaris007 commented Feb 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rpapani commented Feb 11, 2026 •

edited

Loading