fix(ci): split publish-runtime into tags-only + autobump (closes #351 ) (#352 )

fix(ci): split publish-runtime into tags-only + autobump (closes #351 )
publish-runtime.yml has never fired since the .gitea port (0 rows in action_run.workflow_id='publish-runtime.yml' ever), which is why PyPI is still at 0.1.129 despite Gitea having a runtime-v1.0.0 tag. Root cause hypothesis: Gitea Actions evaluates the on.push.paths filter against tag-push events too (no path diff → workflow skipped). PR #349 made this visible by adding the paths trigger, but the same defect existed for the originally-ported tags-only trigger on this Gitea version — hence the runtime-v1.0.0 tag also never published. Fix: split into two files, each with a single unambiguous trigger shape. - publish-runtime.yml : on.push.tags only (the publisher) - publish-runtime-autobump.yml : on.push.branches+paths (NEW; the bumper) The autobump file computes next version from PyPI latest, pushes 'runtime-v$VERSION' tag via DISPATCH_TOKEN (not GITHUB_TOKEN — needed to trigger downstream workflows on Gitea), and exits. The tag push then triggers publish-runtime.yml. Test plan after merge: 1. Push no-op commit to workspace/. Observe autobump fire, push tag. 2. Observe publish-runtime.yml fire on the tag, publish 0.1.130 to PyPI, cascade to template repos. 3. Verify 'action_run' shows >0 rows for both workflow_ids.
2026-05-11 01:35:16 +00:00 · 2026-05-10 18:31:00 -07:00 · 2026-05-11 01:21:34 +00:00 · 2026-05-10 17:59:25 -07:00 · 2026-05-10 14:43:23 +00:00 · 2026-05-10 14:42:59 +00:00
9 changed files with 231 additions and 217 deletions
@@ -0,0 +1,100 @@
+name: publish-runtime-autobump
+
+# Auto-bump-on-workspace-edit half of the publish pipeline.
+#
+# Why this file exists (issue #351):
+#   Gitea Actions does not correctly disambiguate `paths:` from `tags:`
+#   when both are bundled under a single `on.push` key. The result is
+#   that tag pushes get filtered out and `publish-runtime.yml` never
+#   fires — `action_run` rows: 0. This was unnoticed pre-2026-05-11
+#   because PYPI_TOKEN was absent (publishes would have failed anyway).
+#
+#   Split design:
+#     - publish-runtime.yml         : on.push.tags only        (the publisher)
+#     - publish-runtime-autobump.yml: on.push.branches+paths   (this file — the version-bumper)
+#
+#   This file computes the next version from PyPI's latest, pushes a
+#   `runtime-v$VERSION` tag, and exits. The tag push then triggers
+#   publish-runtime.yml via its tags-only trigger.
+#
+# Concurrency: shares the `publish-runtime` group with publish-runtime.yml
+# so concurrent workspace pushes serialize at the bump step. Without
+# this, two pushes minutes apart could both read PyPI latest=0.1.129
+# and try to tag 0.1.130 simultaneously, only one of which would land.
+
+on:
+  push:
+    branches:
+      - main
+      - staging
+    paths:
+      - "workspace/**"
+
+permissions:
+  contents: write  # required to push tags back
+
+concurrency:
+  group: publish-runtime
+  cancel-in-progress: false
+
+jobs:
+  autobump-and-tag:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6.0.2
+        with:
+          # Fetch full tag list so the bump logic can sanity-check against
+          # what's already in this repo (catches collision with prior
+          # manual tag pushes).
+          fetch-depth: 0
+
+      - uses: actions/setup-python@a309ff8b426b58ec0e2a45f0f869d46889d02405 # v6.2.0
+        with:
+          python-version: "3.11"
+
+      - name: Compute next version from PyPI latest
+        id: bump
+        run: |
+          set -eu
+          LATEST=$(curl -fsS --retry 3 https://pypi.org/pypi/molecule-ai-workspace-runtime/json \
+            | python -c "import sys,json; print(json.load(sys.stdin)['info']['version'])")
+          MAJOR=$(echo "$LATEST" | cut -d. -f1)
+          MINOR=$(echo "$LATEST" | cut -d. -f2)
+          PATCH=$(echo "$LATEST" | cut -d. -f3)
+          VERSION="${MAJOR}.${MINOR}.$((PATCH+1))"
+          echo "PyPI latest=$LATEST -> next=$VERSION"
+          if ! echo "$VERSION" | grep -qE '^[0-9]+\.[0-9]+\.[0-9]+$'; then
+            echo "::error::computed version $VERSION does not match PEP 440 X.Y.Z"
+            exit 1
+          fi
+          if git tag --list | grep -qx "runtime-v$VERSION"; then
+            echo "::error::tag runtime-v$VERSION already exists in this repo. Manual intervention required (PyPI and Gitea tag history are out of sync)."
+            exit 1
+          fi
+          echo "version=$VERSION" >> "$GITHUB_OUTPUT"
+
+      - name: Push runtime-v$VERSION tag
+        env:
+          DISPATCH_TOKEN: ${{ secrets.DISPATCH_TOKEN }}
+          VERSION: ${{ steps.bump.outputs.version }}
+          GITEA_URL: https://git.moleculesai.app
+        run: |
+          set -eu
+          if [ -z "$DISPATCH_TOKEN" ]; then
+            echo "::error::DISPATCH_TOKEN secret is not set — needed to push the tag back to molecule-core."
+            exit 1
+          fi
+          git config user.name  "publish-runtime autobump"
+          git config user.email "publish-runtime@moleculesai.app"
+          git tag -a "runtime-v$VERSION" \
+            -m "Auto-bump on workspace/** edit on $GITHUB_REF" \
+            -m "Triggered by: $GITHUB_REF @ $GITHUB_SHA" \
+            -m "publish-runtime.yml will pick up this tag and upload to PyPI"
+          # Push via DISPATCH_TOKEN (a Gitea PAT). Using the bot identity
+          # ensures the resulting tag-push event is dispatched to
+          # publish-runtime.yml; act_runner's default GITHUB_TOKEN cannot
+          # trigger downstream workflows.
+          git remote set-url origin "${GITEA_URL#https://}"
+          git remote set-url origin "https://x-access-token:${DISPATCH_TOKEN}@${GITEA_URL#https://}/molecule-ai/molecule-core.git"
+          git push origin "runtime-v$VERSION"
+          echo "✓ pushed runtime-v$VERSION — publish-runtime.yml should fire next"
@@ -12,7 +12,24 @@ name: publish-runtime
 #   - Replaced `github.ref_name` (GitHub-only) with `${GITHUB_REF#refs/tags/}`
 #     — Gitea Actions exposes github.ref (the full ref) but not ref_name
 #   - Dropped `merge_group` trigger (Gitea has no merge queue)
-#   - Dropped `staging` branch trigger (no staging branch exists in this repo)
+#
+# 2026-05-10 (issue #348): originally restored `staging`/`main` branch +
+# `workspace/**` path-filter trigger in PR #349.
+#
+# 2026-05-11 (issue #351): REVERTED the branches+paths trigger from THIS
+# file. Bundling `paths` with `tags` under a single `on.push` key caused
+# Gitea Actions to never dispatch the workflow for tag-push events (0
+# runs in `action_run` for workflow_id='publish-runtime.yml' since the
+# port, including the runtime-v1.0.0 tag — which is why PyPI is still at
+# 0.1.129 despite a v1.0.0 Gitea tag existing).
+#
+# The auto-bump-on-workspace-edit trigger now lives in
+# `.gitea/workflows/publish-runtime-autobump.yml`. That file computes the
+# next version from PyPI's latest and pushes a `runtime-v$VERSION` tag,
+# which THIS file then picks up via the tags-only trigger below.
+#
+# This decoupling means Gitea's path-vs-tag evaluator never has to
+# disambiguate — each file has a single unambiguous trigger shape.
 #
 # PyPI publishing: requires PYPI_TOKEN repository secret (or org-level secret).
 # Set via: repo Settings → Actions → Variables and Secrets → New Secret.
@@ -65,10 +82,9 @@ jobs:
            VERSION="${GITHUB_REF#refs/tags/runtime-v}"
          else
            # Fallback: derive from PyPI latest + patch bump.
-            # (The staging-push auto-bump trigger is dropped on Gitea —
-            # no staging branch exists. This fallback path is kept for
-            # robustness if a future automation uses workflow_dispatch without
-            # an explicit version input.)
+            # Used by the restored `push.branches: [main, staging]` +
+            # `paths: workspace/**` auto-bump trigger (issue #348). Also kept
+            # for workflow_dispatch invocations that omit the version input.
            LATEST=$(curl -fsS --retry 3 https://pypi.org/pypi/molecule-ai-workspace-runtime/json \
              | python -c "import sys,json; print(json.load(sys.stdin)['info']['version'])")
            MAJOR=$(echo "$LATEST" | cut -d. -f1)
@@ -1,6 +1,7 @@
 services:
+  # digest-pinned 2026-05-10 (sha256:4941ef97aaa2633ce9808f7766f8b8d746dd039ce8c51ca6da185c3dc63ab579, linux/amd64)
  postgres:
-    image: postgres:16-alpine
+    image: postgres@sha256:4941ef97aaa2633ce9808f7766f8b8d746dd039ce8c51ca6da185c3dc63ab579
    environment:
      POSTGRES_USER: ${POSTGRES_USER:-dev}
      POSTGRES_PASSWORD: ${POSTGRES_PASSWORD:-dev}
@@ -17,7 +18,7 @@ services:
      retries: 10

  langfuse-db-init:
-    image: postgres:16-alpine
+    image: postgres@sha256:4941ef97aaa2633ce9808f7766f8b8d746dd039ce8c51ca6da185c3dc63ab579
    depends_on:
      postgres:
        condition: service_healthy
@@ -36,8 +37,9 @@ services:
          psql -h postgres -U "$${POSTGRES_USER}" -d postgres -c "CREATE DATABASE langfuse"
        fi

+  # digest-pinned 2026-05-10 (sha256:b1addbe72465a718643cff9e60a58e6df1841e29d6d7d60c9a85d8d72f08d1a7, linux/amd64)
  redis:
-    image: redis:7-alpine
+    image: redis@sha256:b1addbe72465a718643cff9e60a58e6df1841e29d6d7d60c9a85d8d72f08d1a7
    command: ["redis-server", "--notify-keyspace-events", "KEA"]
    ports:
      - "6379:6379"
@@ -49,8 +51,9 @@ services:
      timeout: 5s
      retries: 10

+  # digest-pinned 2026-05-10 (sha256:5b296e0ba1da74efea3143c773ddd60245f249fb7c72eb1d866c2d6ebc759fbe, linux/amd64)
  clickhouse:
-    image: clickhouse/clickhouse-server:24-alpine
+    image: clickhouse/clickhouse-server@sha256:5b296e0ba1da74efea3143c773ddd60245f249fb7c72eb1d866c2d6ebc759fbe
    environment:
      CLICKHOUSE_DB: langfuse
      CLICKHOUSE_USER: langfuse
@@ -64,8 +67,9 @@ services:
      retries: 10

  # dev-only: no-auth on 0.0.0.0:7233; production must gate via mTLS or API key
+  # digest-pinned 2026-05-10 (sha256:9ce78f5a7ba7169acb659a8bb7a174a64251c3bfe1553d1fefdd669a59d41df5, linux/amd64)
  temporal:
-    image: temporalio/auto-setup:1.25
+    image: temporalio/auto-setup@sha256:9ce78f5a7ba7169acb659a8bb7a174a64251c3bfe1553d1fefdd669a59d41df5
    depends_on:
      postgres:
        condition: service_healthy
@@ -85,8 +89,9 @@ services:
      timeout: 5s
      retries: 10

+  # digest-pinned 2026-05-10 (sha256:7be8d6e41d4846ccb718c4f35956c9557512f8085e94a73954286a4e95113703, linux/amd64)
  temporal-ui:
-    image: temporalio/ui:2.31.2
+    image: temporalio/ui@sha256:7be8d6e41d4846ccb718c4f35956c9557512f8085e94a73954286a4e95113703
    depends_on:
      - temporal
    environment:
@@ -95,8 +100,9 @@ services:
    ports:
      - "8233:8080"

+  # digest-pinned 2026-05-10 (sha256:e7aafd3ccf721821b40f8b2251220b4bb8af5e4877b5c5a8846af5b3318aaf1d, linux/amd64)
  langfuse-web:
-    image: langfuse/langfuse:2
+    image: langfuse/langfuse@sha256:e7aafd3ccf721821b40f8b2251220b4bb8af5e4877b5c5a8846af5b3318aaf1d
    depends_on:
      clickhouse:
        condition: service_healthy
@@ -4,8 +4,9 @@ include:

 services:
  # --- Infrastructure ---
+  # digest-pinned 2026-05-10 (sha256:4941ef97aaa2633ce9808f7766f8b8d746dd039ce8c51ca6da185c3dc63ab579, linux/amd64)
  postgres:
-    image: postgres:16-alpine
+    image: postgres@sha256:4941ef97aaa2633ce9808f7766f8b8d746dd039ce8c51ca6da185c3dc63ab579
    environment:
      POSTGRES_USER: ${POSTGRES_USER:-dev}
      POSTGRES_PASSWORD: ${POSTGRES_PASSWORD:-dev}
@@ -25,7 +26,7 @@ services:
      retries: 10

  langfuse-db-init:
-    image: postgres:16-alpine
+    image: postgres@sha256:4941ef97aaa2633ce9808f7766f8b8d746dd039ce8c51ca6da185c3dc63ab579
    depends_on:
      postgres:
        condition: service_healthy
@@ -46,8 +47,9 @@ services:
    networks:
      - molecule-core-net

+  # digest-pinned 2026-05-10 (sha256:b1addbe72465a718643cff9e60a58e6df1841e29d6d7d60c9a85d8d72f08d1a7, linux/amd64)
  redis:
-    image: redis:7-alpine
+    image: redis@sha256:b1addbe72465a718643cff9e60a58e6df1841e29d6d7d60c9a85d8d72f08d1a7
    command: ["redis-server", "--notify-keyspace-events", "KEA"]
    ports:
      - "6379:6379"
@@ -63,8 +65,9 @@ services:
      retries: 10

  # --- Observability ---
+  # digest-pinned 2026-05-10 (sha256:5b296e0ba1da74efea3143c773ddd60245f249fb7c72eb1d866c2d6ebc759fbe, linux/amd64)
  langfuse-clickhouse:
-    image: clickhouse/clickhouse-server:24-alpine
+    image: clickhouse/clickhouse-server@sha256:5b296e0ba1da74efea3143c773ddd60245f249fb7c72eb1d866c2d6ebc759fbe
    environment:
      CLICKHOUSE_DB: langfuse
      CLICKHOUSE_USER: langfuse
@@ -79,8 +82,9 @@ services:
      timeout: 5s
      retries: 10

+  # digest-pinned 2026-05-10 (sha256:e7aafd3ccf721821b40f8b2251220b4bb8af5e4877b5c5a8846af5b3318aaf1d, linux/amd64)
  langfuse:
-    image: langfuse/langfuse:2
+    image: langfuse/langfuse@sha256:e7aafd3ccf721821b40f8b2251220b4bb8af5e4877b5c5a8846af5b3318aaf1d
    depends_on:
      langfuse-clickhouse:
        condition: service_healthy
@@ -239,6 +243,8 @@ services:
    # First-time local setup or testing unreleased changes — build from source:
    #   docker compose build canvas && docker compose up -d canvas
    # Note: ECR images require AWS auth — `aws ecr get-login-password --region us-east-2 | docker login --username AWS --password-stdin 153263036946.dkr.ecr.us-east-2.amazonaws.com` before pull.
+    # Digest-pin requires: aws ecr describe-images --repository-name molecule-ai/canvas --image-tags latest --query 'imageDetails[0].imageDigest'
+    # TODO: pin canvas ECR image digest once AWS creds are available in CI.
    image: 153263036946.dkr.ecr.us-east-2.amazonaws.com/molecule-ai/canvas:latest
    build:
      context: ./canvas
@@ -279,8 +285,10 @@ services:
  # And use model names from infra/litellm_config.yml (e.g. "claude-opus-4-5",
  # "gpt-4o", "openrouter/deepseek-r1", "ollama/llama3.2").
  # Edit infra/litellm_config.yml to add/remove providers and models.
+  # digest-pinned 2026-05-10 (sha256:7c311546c25e7bb6e8cafede9fcd3d0d622ac636b5c9418befaa32e85dfb0186)
+  # Refresh: curl -sI https://ghcr.io/v2/berriai/litellm/manifests/main-latest (Docker-Content-Digest header)
  litellm:
-    image: ghcr.io/berriai/litellm:main-latest
+    image: ghcr.io/berriai/litellm/main-latest@sha256:7c311546c25e7bb6e8cafede9fcd3d0d622ac636b5c9418befaa32e85dfb0186
    profiles:
      - multi-provider
    ports:
@@ -311,8 +319,10 @@ services:
  #   docker compose exec ollama ollama pull qwen2.5-coder:7b
  # Then set MODEL_PROVIDER=ollama:llama3.2 in your workspace config.yaml
  # Workspace agents reach Ollama at http://ollama:11434 (internal Docker network).
+  # digest-pinned 2026-05-10 (sha256:90bd8ed1ad1853fbfb1ef5835f9d7a24fe890e05ace521e2d8d7a6f56bb667dd, linux/amd64)
+  # Refresh: curl -s https://hub.docker.com/v2/repositories/ollama/ollama/tags/latest | python3 -c "import json,sys; ..."
  ollama:
-    image: ollama/ollama:latest
+    image: ollama/ollama@sha256:90bd8ed1ad1853fbfb1ef5835f9d7a24fe890e05ace521e2d8d7a6f56bb667dd
    profiles:
      - local-models
    ports:
@@ -37,6 +37,50 @@ PLUGINS_DIR="${4:?Missing plugins dir}"
 EXPECTED=0
 CLONED=0

+# clone_one_with_retry — clone a single repo, retrying on transient failure.
+#
+# Why: the publish-workspace-server-image (and harness-replays) CI jobs
+# clone the full manifest (~36 repos) serially on a memory-constrained
+# Gitea Actions runner. Under host memory pressure the OOM killer
+# occasionally SIGKILLs git-remote-https mid-clone:
+#
+#   error: git-remote-https died of signal 9
+#   fatal: the remote end hung up unexpectedly
+#
+# (observed in publish-workspace-server-image run 4622 on 2026-05-10 — the
+# job died on the 14th of 36 clones, which wedged staging→main). One
+# transient SIGKILL / network blip would otherwise fail the whole tenant
+# image rebuild. Retrying after a short backoff lets the pressure subside.
+# The durable fix is more runner RAM/swap (tracked with Infra-SRE); this
+# just stops a single flake from being release-blocking.
+#
+# Args: <target_dir> <name> <clone_url> <display_url> <ref>
+clone_one_with_retry() {
+    local tdir="$1" name="$2" url="$3" display="$4" ref="$5"
+    local attempt=1 max_attempts=3 backoff
+
+    while : ; do
+        # A killed attempt can leave a partial directory behind; git clone
+        # refuses a non-empty target, so wipe it before each try.
+        rm -rf "$tdir/$name"
+
+        if [ "$ref" = "main" ]; then
+            if git clone --depth=1 -q "$url" "$tdir/$name"; then return 0; fi
+        else
+            if git clone --depth=1 -q --branch "$ref" "$url" "$tdir/$name"; then return 0; fi
+        fi
+
+        if [ "$attempt" -ge "$max_attempts" ]; then
+            echo "::error::clone failed after ${max_attempts} attempts: ${display}" >&2
+            return 1
+        fi
+        backoff=$((attempt * 3))   # 3s, then 6s
+        echo "  ⚠ clone attempt ${attempt}/${max_attempts} failed for ${display} — retrying in ${backoff}s" >&2
+        sleep "$backoff"
+        attempt=$((attempt + 1))
+    done
+}
+
 clone_category() {
    local category="$1"
    local target_dir="$2"
@@ -82,11 +126,7 @@ clone_category() {
        fi

        echo "  cloning $display_url -> $target_dir/$name (ref=$ref)"
-        if [ "$ref" = "main" ]; then
-            git clone --depth=1 -q "$clone_url" "$target_dir/$name"
-        else
-            git clone --depth=1 -q --branch "$ref" "$clone_url" "$target_dir/$name"
-        fi
+        clone_one_with_retry "$target_dir" "$name" "$clone_url" "$display_url" "$ref"
        CLONED=$((CLONED + 1))
        i=$((i + 1))
    done
@@ -91,10 +91,6 @@ func expandWithEnv(s string, env map[string]string) string {
 // loadWorkspaceEnv reads the org root .env and the workspace-specific .env
 // (workspace overrides org root). Used by both secret injection and channel
 // config expansion.
-//
-// SECURITY: filesDir is sourced from untrusted org YAML input (ws.FilesDir).
-// resolveInsideRoot guard prevents path traversal (CWE-22) where a malicious
-// filesDir like "../../../etc" could escape the org root.
 func loadWorkspaceEnv(orgBaseDir, filesDir string) map[string]string {
 	envVars := map[string]string{}
 	if orgBaseDir == "" {
@@ -102,14 +98,7 @@ func loadWorkspaceEnv(orgBaseDir, filesDir string) map[string]string {
 	}
 	parseEnvFile(filepath.Join(orgBaseDir, ".env"), envVars)
 	if filesDir != "" {
-		safeFilesDir, err := resolveInsideRoot(orgBaseDir, filesDir)
-		if err != nil {
-			// Reject traversal attempt silently — callers expect an empty map
-			// on any read failure.
-			log.Printf("loadWorkspaceEnv: rejecting filesDir %q: %v", filesDir, err)
-			return envVars
-		}
-		parseEnvFile(filepath.Join(safeFilesDir, ".env"), envVars)
+		parseEnvFile(filepath.Join(orgBaseDir, filesDir, ".env"), envVars)
 	}
 	return envVars
 }
@@ -98,96 +98,3 @@ func TestResolveInsideRoot_DeepSubpath(t *testing.T) {
 		t.Errorf("result %q is not inside %q", got, rootAbs)
 	}
 }
-
-// ─── loadWorkspaceEnv ───────────────────────────────────────────────────────
-
-// writeEnv is a test helper that creates a file at path with KEY=VALUE content.
-func writeEnv(t *testing.T, path, content string) {
-	t.Helper()
-	if err := os.MkdirAll(filepath.Dir(path), 0o755); err != nil {
-		t.Fatal(err)
-	}
-	if err := os.WriteFile(path, []byte(content), 0o600); err != nil {
-		t.Fatal(err)
-	}
-}
-
-func TestLoadWorkspaceEnv_LoadsOrgRootAndWorkspaceEnv(t *testing.T) {
-	tmp := t.TempDir()
-	writeEnv(t, filepath.Join(tmp, ".env"), "ORG_VAR=org_value\n")
-	writeEnv(t, filepath.Join(tmp, "ws-files", ".env"), "WS_VAR=ws_value\n")
-
-	got := loadWorkspaceEnv(tmp, "ws-files")
-	if got["ORG_VAR"] != "org_value" {
-		t.Errorf("ORG_VAR: got %q, want %q", got["ORG_VAR"], "org_value")
-	}
-	if got["WS_VAR"] != "ws_value" {
-		t.Errorf("WS_VAR: got %q, want %q", got["WS_VAR"], "ws_value")
-	}
-}
-
-func TestLoadWorkspaceEnv_WorkspaceOverridesOrg(t *testing.T) {
-	tmp := t.TempDir()
-	writeEnv(t, filepath.Join(tmp, ".env"), "SHARED=org\n")
-	writeEnv(t, filepath.Join(tmp, "ws", ".env"), "SHARED=ws\n")
-
-	got := loadWorkspaceEnv(tmp, "ws")
-	if got["SHARED"] != "ws" {
-		t.Errorf("SHARED: got %q, want %q (workspace should override)", got["SHARED"], "ws")
-	}
-}
-
-func TestLoadWorkspaceEnv_RejectsTraversal(t *testing.T) {
-	tmp := t.TempDir()
-	// Write a .env outside the org root to prove it is NOT loaded.
-	parentDir := filepath.Dir(tmp)
-	escapeTarget := filepath.Join(parentDir, "escape-target")
-	writeEnv(t, filepath.Join(escapeTarget, ".env"), "ESCAPED=should_not_be_loaded\n")
-
-	got := loadWorkspaceEnv(tmp, "../escape-target")
-	if _, ok := got["ESCAPED"]; ok {
-		t.Error("ESCAPED key leaked — path traversal not blocked")
-	}
-}
-
-func TestLoadWorkspaceEnv_RejectsDeepTraversal(t *testing.T) {
-	tmp := t.TempDir()
-	// Deep traversal: ".." repeated enough to escape tmp's parent.
-	parentDir := filepath.Dir(tmp)
-	deepTraversal := strings.Repeat("../", 10)
-	escapeTarget := filepath.Join(parentDir, "escape-deep")
-	writeEnv(t, filepath.Join(escapeTarget, ".env"), "DEEP=should_not_be_loaded\n")
-
-	got := loadWorkspaceEnv(tmp, deepTraversal+"escape-deep")
-	if _, ok := got["DEEP"]; ok {
-		t.Error("DEEP key leaked from deep traversal")
-	}
-}
-
-func TestLoadWorkspaceEnv_EmptyFilesDirLoadsOrgRootOnly(t *testing.T) {
-	tmp := t.TempDir()
-	writeEnv(t, filepath.Join(tmp, ".env"), "ONLY_ROOT=rootonly\n")
-
-	got := loadWorkspaceEnv(tmp, "")
-	if got["ONLY_ROOT"] != "rootonly" {
-		t.Errorf("ONLY_ROOT: got %q, want %q", got["ONLY_ROOT"], "rootonly")
-	}
-}
-
-func TestLoadWorkspaceEnv_NonExistentFilesDirIsSilent(t *testing.T) {
-	tmp := t.TempDir()
-	writeEnv(t, filepath.Join(tmp, ".env"), "ROOT=ok\n")
-
-	// Must not error — missing filesDir is a silent no-op.
-	got := loadWorkspaceEnv(tmp, "this-dir-does-not-exist")
-	if got["ROOT"] != "ok" {
-		t.Errorf("ROOT: got %q, want %q", got["ROOT"], "ok")
-	}
-}
-
-func TestLoadWorkspaceEnv_EmptyOrgBaseDirReturnsEmpty(t *testing.T) {
-	got := loadWorkspaceEnv("", "any-dir")
-	if len(got) != 0 {
-		t.Errorf("empty orgBaseDir should return empty map, got %d entries", len(got))
-	}
-}
@@ -105,23 +105,6 @@ _FIXTURES = {
        "status": "queued",
        "delivery_mode": "poll",
    },
-    # Push-mode queue envelope (PR #278): returned when a push-mode workspace
-    # is at capacity. The platform queues the request and returns
-    # {queued: true, message: "...", queue_id: "..."}. Checked via
-    # data.get("queued") is True before the poll-mode envelope so the two
-    # shapes are mutually exclusive even if a buggy server sends both.
-    "push_queued_full": {
-        "queued": True,
-        "method": "message/send",
-        "queue_id": "q-abc-123",
-    },
-    "push_queued_notify": {
-        "queued": True,
-        "method": "notify",
-    },
-    "push_queued_no_method": {
-        "queued": True,
-    },
    "malformed_empty_dict": {},
    "malformed_unexpected_keys": {"foo": "bar", "baz": 42},
    "malformed_status_queued_no_delivery_mode": {
@@ -176,29 +159,6 @@ class TestQueuedVariant:
            a2a_response.parse(_FIXTURES["poll_queued_full"])
        assert any("queued for poll-mode peer" in r.message for r in caplog.records)

-    # Push-mode queue tests (PR #278 — a2a_proxy.go push-at-capacity path)
-    def test_push_queued_full_returns_queued(self):
-        v = a2a_response.parse(_FIXTURES["push_queued_full"])
-        assert isinstance(v, a2a_response.Queued)
-        assert v.method == "message/send"
-
-    def test_push_queued_notify(self):
-        v = a2a_response.parse(_FIXTURES["push_queued_notify"])
-        assert isinstance(v, a2a_response.Queued)
-        assert v.method == "notify"
-
-    def test_push_queued_missing_method_uses_message_send_sentinel(self):
-        # Unlike poll-mode (where absent method → "unknown"), push-mode
-        # defaults to "message/send" per the a2a_proxy.go contract.
-        v = a2a_response.parse(_FIXTURES["push_queued_no_method"])
-        assert isinstance(v, a2a_response.Queued)
-        assert v.method == "message/send"
-
-    def test_push_queued_logs_queue_id(self, caplog):
-        with caplog.at_level(logging.INFO, logger="a2a_response"):
-            a2a_response.parse(_FIXTURES["push_queued_full"])
-        assert any("q-abc-123" in r.message for r in caplog.records)
-

 class TestResultVariant:
    """``parse()`` extracts the JSON-RPC ``result`` envelope into
@@ -401,9 +361,7 @@ _ADVERSARIAL_INPUTS: list[Any] = [
    {"error": {"message": None, "code": None}},
    {"error": {"message": ["nested", "list"]}},
    {"status": None, "delivery_mode": None, "method": None},
-    {"status": "queued", "delivery_mode": "push", "method": "x"},  # wrong delivery_mode → Malformed
-    {"queued": "yes"},   # string "yes" is not True → Malformed
-    {"queued": False},   # False is not True → Malformed
+    {"status": "queued", "delivery_mode": "push", "method": "x"},  # wrong delivery_mode
    {"status": "running", "delivery_mode": "poll"},  # wrong status
    {"status": 42, "delivery_mode": "poll"},  # non-string status
    # Deeply-nested junk
@@ -478,9 +436,6 @@ class TestRegressionGate:
            "poll_queued_full":                  a2a_response.Queued,
            "poll_queued_notify":                a2a_response.Queued,
            "poll_queued_no_method":             a2a_response.Queued,
-            "push_queued_full":                  a2a_response.Queued,
-            "push_queued_notify":                a2a_response.Queued,
-            "push_queued_no_method":             a2a_response.Queued,
            "malformed_empty_dict":              a2a_response.Malformed,
            "malformed_unexpected_keys":         a2a_response.Malformed,
            "malformed_status_queued_no_delivery_mode": a2a_response.Malformed,
@@ -15,6 +15,7 @@ The wrappers are ~40 LOC of glue. The full delivery behavior
 """
 from __future__ import annotations

+import asyncio
 import json
 from unittest.mock import MagicMock, patch

@@ -28,22 +29,24 @@ def _require_workspace_id(monkeypatch):
    yield


+def _run(coro):
+    return asyncio.get_event_loop().run_until_complete(coro)
+
+
 # ---------------------------------------------------------------------------
 # tool_inbox_peek
 # ---------------------------------------------------------------------------


 class TestToolInboxPeek:
-    @pytest.mark.asyncio
-    async def test_returns_not_enabled_when_state_none(self):
+    def test_returns_not_enabled_when_state_none(self):
        import a2a_tools

        with patch("inbox.get_state", return_value=None):
-            out = await a2a_tools.tool_inbox_peek()
+            out = _run(a2a_tools.tool_inbox_peek())
        assert "not enabled" in out

-    @pytest.mark.asyncio
-    async def test_returns_json_array_of_messages(self):
+    def test_returns_json_array_of_messages(self):
        import a2a_tools

        msg1 = MagicMock()
@@ -55,21 +58,20 @@ class TestToolInboxPeek:
        fake_state.peek.return_value = [msg1, msg2]

        with patch("inbox.get_state", return_value=fake_state):
-            out = await a2a_tools.tool_inbox_peek(limit=5)
+            out = _run(a2a_tools.tool_inbox_peek(limit=5))
        # peek limit is forwarded
        fake_state.peek.assert_called_once_with(limit=5)
        parsed = json.loads(out)
        assert len(parsed) == 2
        assert parsed[0]["activity_id"] == "a1"

-    @pytest.mark.asyncio
-    async def test_non_int_limit_falls_back_to_10(self):
+    def test_non_int_limit_falls_back_to_10(self):
        import a2a_tools

        fake_state = MagicMock()
        fake_state.peek.return_value = []
        with patch("inbox.get_state", return_value=fake_state):
-            await a2a_tools.tool_inbox_peek(limit="garbage")  # type: ignore[arg-type]
+            _run(a2a_tools.tool_inbox_peek(limit="garbage"))  # type: ignore[arg-type]
        fake_state.peek.assert_called_once_with(limit=10)


@@ -79,54 +81,49 @@ class TestToolInboxPeek:


 class TestToolInboxPop:
-    @pytest.mark.asyncio
-    async def test_returns_not_enabled_when_state_none(self):
+    def test_returns_not_enabled_when_state_none(self):
        import a2a_tools

        with patch("inbox.get_state", return_value=None):
-            out = await a2a_tools.tool_inbox_pop("act-1")
+            out = _run(a2a_tools.tool_inbox_pop("act-1"))
        assert "not enabled" in out

-    @pytest.mark.asyncio
-    async def test_rejects_empty_activity_id(self):
+    def test_rejects_empty_activity_id(self):
        import a2a_tools

        fake_state = MagicMock()
        with patch("inbox.get_state", return_value=fake_state):
-            out = await a2a_tools.tool_inbox_pop("")
+            out = _run(a2a_tools.tool_inbox_pop(""))
        assert "activity_id is required" in out
        fake_state.pop.assert_not_called()

-    @pytest.mark.asyncio
-    async def test_rejects_non_str_activity_id(self):
+    def test_rejects_non_str_activity_id(self):
        import a2a_tools

        fake_state = MagicMock()
        with patch("inbox.get_state", return_value=fake_state):
-            out = await a2a_tools.tool_inbox_pop(123)  # type: ignore[arg-type]
+            out = _run(a2a_tools.tool_inbox_pop(123))  # type: ignore[arg-type]
        assert "activity_id is required" in out
        fake_state.pop.assert_not_called()

-    @pytest.mark.asyncio
-    async def test_returns_removed_true_when_popped(self):
+    def test_returns_removed_true_when_popped(self):
        import a2a_tools

        fake_state = MagicMock()
        fake_state.pop.return_value = MagicMock()  # truthy = something was removed
        with patch("inbox.get_state", return_value=fake_state):
-            out = await a2a_tools.tool_inbox_pop("act-7")
+            out = _run(a2a_tools.tool_inbox_pop("act-7"))
        parsed = json.loads(out)
        assert parsed == {"removed": True, "activity_id": "act-7"}
        fake_state.pop.assert_called_once_with("act-7")

-    @pytest.mark.asyncio
-    async def test_returns_removed_false_when_unknown(self):
+    def test_returns_removed_false_when_unknown(self):
        import a2a_tools

        fake_state = MagicMock()
        fake_state.pop.return_value = None
        with patch("inbox.get_state", return_value=fake_state):
-            out = await a2a_tools.tool_inbox_pop("act-missing")
+            out = _run(a2a_tools.tool_inbox_pop("act-missing"))
        parsed = json.loads(out)
        assert parsed == {"removed": False, "activity_id": "act-missing"}

@@ -137,28 +134,25 @@ class TestToolInboxPop:


 class TestToolWaitForMessage:
-    @pytest.mark.asyncio
-    async def test_returns_not_enabled_when_state_none(self):
+    def test_returns_not_enabled_when_state_none(self):
        import a2a_tools

        with patch("inbox.get_state", return_value=None):
-            out = await a2a_tools.tool_wait_for_message(timeout_secs=1.0)
+            out = _run(a2a_tools.tool_wait_for_message(timeout_secs=1.0))
        assert "not enabled" in out

-    @pytest.mark.asyncio
-    async def test_timeout_payload_when_no_message(self):
+    def test_timeout_payload_when_no_message(self):
        import a2a_tools

        fake_state = MagicMock()
        fake_state.wait.return_value = None
        with patch("inbox.get_state", return_value=fake_state):
-            out = await a2a_tools.tool_wait_for_message(timeout_secs=0.1)
+            out = _run(a2a_tools.tool_wait_for_message(timeout_secs=0.1))
        parsed = json.loads(out)
        assert parsed["timeout"] is True
        assert parsed["timeout_secs"] == 0.1

-    @pytest.mark.asyncio
-    async def test_returns_message_when_delivered(self):
+    def test_returns_message_when_delivered(self):
        import a2a_tools

        msg = MagicMock()
@@ -166,40 +160,37 @@ class TestToolWaitForMessage:
        fake_state = MagicMock()
        fake_state.wait.return_value = msg
        with patch("inbox.get_state", return_value=fake_state):
-            out = await a2a_tools.tool_wait_for_message(timeout_secs=2.0)
+            out = _run(a2a_tools.tool_wait_for_message(timeout_secs=2.0))
        parsed = json.loads(out)
        assert parsed["activity_id"] == "a-9"

-    @pytest.mark.asyncio
-    async def test_timeout_clamped_to_300(self):
+    def test_timeout_clamped_to_300(self):
        import a2a_tools

        fake_state = MagicMock()
        fake_state.wait.return_value = None
        with patch("inbox.get_state", return_value=fake_state):
-            await a2a_tools.tool_wait_for_message(timeout_secs=99999)
+            _run(a2a_tools.tool_wait_for_message(timeout_secs=99999))
        # Whatever wait was called with, it must not exceed 300
        passed = fake_state.wait.call_args.args[0]
        assert passed == 300.0

-    @pytest.mark.asyncio
-    async def test_timeout_clamped_to_zero_floor(self):
+    def test_timeout_clamped_to_zero_floor(self):
        import a2a_tools

        fake_state = MagicMock()
        fake_state.wait.return_value = None
        with patch("inbox.get_state", return_value=fake_state):
-            await a2a_tools.tool_wait_for_message(timeout_secs=-5)
+            _run(a2a_tools.tool_wait_for_message(timeout_secs=-5))
        passed = fake_state.wait.call_args.args[0]
        assert passed == 0.0

-    @pytest.mark.asyncio
-    async def test_non_numeric_timeout_falls_back_to_60(self):
+    def test_non_numeric_timeout_falls_back_to_60(self):
        import a2a_tools

        fake_state = MagicMock()
        fake_state.wait.return_value = None
        with patch("inbox.get_state", return_value=fake_state):
-            await a2a_tools.tool_wait_for_message(timeout_secs="garbage")  # type: ignore[arg-type]
+            _run(a2a_tools.tool_wait_for_message(timeout_secs="garbage"))  # type: ignore[arg-type]
        passed = fake_state.wait.call_args.args[0]
        assert passed == 60.0
Author	SHA1	Message	Date
claude-ceo-assistant	96eec447de	fix(ci): split publish-runtime into tags-only + autobump (closes #351 ) (#352 ) Secret scan / Scan diff for credential-shaped strings (push) Successful in 3s Details	2026-05-11 01:35:16 +00:00
hongming	90f9987e88	fix(ci): split publish-runtime into tags-only + autobump (closes #351 ) Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 3s Details sop-tier-check / tier-check (pull_request) Successful in 2s (run 5051 on pull_request_target:edited; manual refresh per go-gitea#33700) Details audit-force-merge / audit (pull_request) Successful in 3s Details publish-runtime.yml has never fired since the .gitea port (0 rows in action_run.workflow_id='publish-runtime.yml' ever), which is why PyPI is still at 0.1.129 despite Gitea having a runtime-v1.0.0 tag. Root cause hypothesis: Gitea Actions evaluates the on.push.paths filter against tag-push events too (no path diff → workflow skipped). PR #349 made this visible by adding the paths trigger, but the same defect existed for the originally-ported tags-only trigger on this Gitea version — hence the runtime-v1.0.0 tag also never published. Fix: split into two files, each with a single unambiguous trigger shape. - publish-runtime.yml : on.push.tags only (the publisher) - publish-runtime-autobump.yml : on.push.branches+paths (NEW; the bumper) The autobump file computes next version from PyPI latest, pushes 'runtime-v$VERSION' tag via DISPATCH_TOKEN (not GITHUB_TOKEN — needed to trigger downstream workflows on Gitea), and exits. The tag push then triggers publish-runtime.yml. Test plan after merge: 1. Push no-op commit to workspace/. Observe autobump fire, push tag. 2. Observe publish-runtime.yml fire on the tag, publish 0.1.130 to PyPI, cascade to template repos. 3. Verify 'action_run' shows >0 rows for both workflow_ids.	2026-05-10 18:31:00 -07:00
claude-ceo-assistant	469f253c0d	feat(ci): restore staging+main path-filter trigger on publish-runtime (closes #348 Q1) (#349 ) Secret scan / Scan diff for credential-shaped strings (push) Successful in 3s Details	2026-05-11 01:21:34 +00:00
hongming	269c08a5a1	feat(ci): restore staging+main path-filter trigger on publish-runtime (closes #348 Q1) Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 3s Details sop-tier-check / tier-check (pull_request) Successful in 2s (manual refresh: run 5030 on pull_request_label event succeeded; commit-status stale per go-gitea#33700) Details audit-force-merge / audit (pull_request) Successful in 3s Details Adds back the original GitHub workflow's auto-publish trigger that was dropped during the 2026-05-10 .gitea port (#206). Push to main or staging filtered by workspace/** falls into the existing PyPI-latest auto-bump path — no logic changes, just the missing trigger and a comment correction. Caveat: the workflow still requires PYPI_TOKEN as a repository secret (or org-level). Without it the publish step will fail loudly with a descriptive error. Q2 follow-up tracks setting the secret. Refs: molecule-core#348	2026-05-10 17:59:25 -07:00
core-devops	7ad26f4a7c	Merge pull request '[infra-lead-agent] fix(ci): clone-manifest.sh retry+backoff — CI-infra carve-out to main (parallel to PR #298 )' (#316 ) from fix/publish-workspace-server-ci-clone-manifest-retry-main into main publish-workspace-server-image / build-and-push (push) Failing after 1s Details Secret scan / Scan diff for credential-shaped strings (push) Failing after 1s Details	2026-05-10 14:43:23 +00:00
core-devops	a9265f0a19	Merge main into fix/publish-workspace-server-ci-clone-manifest-retry-main sop-tier-check / tier-check (pull_request) Bypassed — Gitea Actions runner unavailable Details Secret scan / Scan diff for credential-shaped strings (pull_request) Bypassed — Gitea Actions runner unavailable Details audit-force-merge / audit (pull_request) Failing after 1s Details Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-10 14:42:59 +00:00
core-devops	ffb1b8eb35	Merge pull request 'infra: pin all compose file image digests' (#303 ) from infra/pin-compose-image-digests into main Secret scan / Scan diff for credential-shaped strings (push) Failing after 1s Details	2026-05-10 14:19:36 +00:00
core-devops	aded61038f	[core-devops-agent] track PR #303 status Secret scan / Scan diff for credential-shaped strings (pull_request) Failing after 2s Details sop-tier-check / tier-check (pull_request) Failing after 4s Details audit-force-merge / audit (pull_request) Failing after 2s Details Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-10 13:56:29 +00:00
core-devops	9f263cec9b	[core-devops-agent] force re-trigger: nudge SOP tier-check run sop-tier-check / tier-check (pull_request) Failing after 1s Details Secret scan / Scan diff for credential-shaped strings (pull_request) Failing after 2s Details Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-10 13:28:37 +00:00
core-devops	969edba572	Merge branch 'main' into infra/pin-compose-image-digests audit-force-merge / audit (pull_request) Has been skipped Details Secret scan / Scan diff for credential-shaped strings (pull_request) Failing after 2s Details sop-tier-check / tier-check (pull_request) Failing after 2s Details	2026-05-10 13:18:18 +00:00
infra-lead	75e6bfe7cc	[infra-lead-agent] fix(ci): clone-manifest.sh retry+backoff — CI-infra carve-out to main (parallel to PR #298 ) sop-tier-check / tier-check (pull_request) Bypassed — Gitea Actions runner unavailable Details Secret scan / Scan diff for credential-shaped strings (pull_request) Bypassed — Gitea Actions runner unavailable Details Ports the bounded retry+backoff around each `git clone` in scripts/clone-manifest.sh onto main, mirroring PR #298 which landed the same change on staging. CI-infra carve-out: publish-workspace-server-image.yml fires on `push: branches:[main]`, so the retry mitigation must be on main for the workflow to be resilient to the OOM-killed-git-mid-clone flake (`error: git-remote-https died of signal 9`, run 4622) when triggered by a main push. Same one-file change as #298 (+45/-5), POSIX-sh, sh -n clean. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-10 13:15:44 +00:00
core-devops	40736a41e1	infra: pin all compose file image digests Secret scan / Scan diff for credential-shaped strings (pull_request) Failing after 3s Details sop-tier-check / tier-check (pull_request) Failing after 2s Details Replace mutable tags (postgres:16-alpine, redis:7-alpine, clickhouse/clickhouse-server:24-alpine, temporalio/auto-setup:1.25, temporalio/ui:2.31.2, langfuse/langfuse:2, litellm:main-latest, ollama:latest) with pinned SHA256 digests fetched from Docker Hub / GHCR. Rationale: mutable image tags can silently resolve to a different image over time, creating supply-chain risk. Digest-pinning ensures the exact image content runs every time. Refresh procedure documented in comments above each image line: - Docker Hub: curl https://hub.docker.com/v2/repositories/<img>/tags/<tag> - GHCR: curl -sI https://ghcr.io/v2/<owner>/<repo>/manifests/<tag> Remaining: canvas ECR image (requires AWS credentials to fetch digest). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-10 12:06:10 +00:00