Hosted MCP at chieflab.io/api/mcp
45 production tools focused on the launch-operator wedge. JSON-RPC 2.0, HTTP transport, llms.txt, .well-known/mcp.json, smithery.yaml. api.chieflab.io is the proxy-free Vercel-direct primary endpoint; chieflab.io/api/mcp is the brand-domain fallback.
proof: POST chieflab.io/api/mcp with tools/list — returns 45 tools (customer-smoke gates this number).
chieflab_launch_product end-to-end loop
Plan → create → approve → publish → measure → recommend, in real artifacts. X publish via Zernio (real post 69f8a74fc50416d0f77f852e). Email via Resend (real messageId ba122a9c-2843-4d38-b998-a345990911dc). 24h post-launch review pulls Zernio + GA4 + Search Console and recommends next iteration.
proof: CLOSED_LOOP_VERIFIED.md in repo lists every artifact ID.
Signed reviewUrl + workspace Inbox at /app
Two human-control surfaces, one backend. Signed HMAC reviewUrl (no login, 7-day TTL) for the agent-shareable approval. Workspace Inbox for owner-side multi-run control. Approve / reject / inline feedback — rejection feedback flows into per-tenant memory.
proof: Try it: POST chieflab.io/api/sandbox/launch and click the returned reviewUrl.
Live OAuth reads — GA4 + Search Console
Real OAuth flow, encrypted token storage, real fetch against analyticsdata.googleapis.com and webmasters/v3. Wired into chieflab_post_launch_review.
proof: Connect from /app/connections — read snapshot returns real data.
Approved publish — Zernio (social) + Resend (email)
Real per-platform publishing across LinkedIn / X / Threads / IG / FB / Bluesky / TikTok via Zernio. Real email send via Resend (mail.chieflab.io is verified; onboarding@resend.dev is the bootstrap fallback).
proof: Approve a publishAction or sendAction — the connector fires immediately.
Per-tenant brand context + voice memory
Inline brand: { audience, voice, pillars } threads into agent drafting prompts. Approved/rejected voice samples persist. The tenancy spine — every run, action, secret, memory entry is workspace+tenant scoped.
proof: chieflab_create_tenant + chieflab_set_tenant_context — second run uses the context.
Image generation (Gemini 2.5 Flash, brand-grounded)
Opt-in via imagesNeeded > 0. Three modes: brief / prompt for caller's image model (free, default), BYOK image gen (free), hosted Gemini ($0.04 / image). No surprise bills.
proof: Sandbox launch with imagesNeeded: 1 — returns a generated image URL.
8 install paths — verified configs
Cursor (one-click cursor:// deeplink), Claude Desktop, Codex, Lovable, Bolt, OpenClaw, Hermes, Base44. Direct HTTPS works for any agent that can fetch.
proof: Each config block is in /install/<runtime> and re-tested on every release.
50/50 customer-perspective smoke checks
npm run smoke:customer runs after every prod deploy and gates the Cloudflare Pages function bundle that proxies chieflab.io/api/* to the API. A documented prior regression makes this script load-bearing.
proof: Run scripts/customer-smoke.mjs — non-zero exit on any regression.
LaunchBench — public reproducible benchmark
Apache 2.0 benchmark in benchmark/. 20 real product URLs, 6-dimension rubric, deterministic LLM-judge prompt. Compares ChiefLab vs raw Sonnet, raw GPT-5, Sonnet+chieflab-launch skill.
proof: cd benchmark && node run.mjs — leaderboard.md is the artifact.
P10 scaffolding — Multi-agent GTM orchestrator + 8 agents
Eight specialized GTM sub-agents under ChiefMO (launch / social / email / blog-seo / creative / analytics / experiment / approval). Typed handoffs, per-tenant shared memory, declared dependencies. Orchestrator routes; agents never call each other directly. 8 MCP tools: chiefmo_gtm_run_start + 7 read/write helpers. Public agents page at /agents. Architecture spec: docs/proposals/p10-multi-agent-gtm.md. NB: this row is the SCAFFOLD only; the dogfood + design-partner proof is in Beta below.
proof: node scripts/gtm-smoke.mjs — orchestrator end-to-end smoke green. node scripts/security-mismatch-smoke.mjs — workspace isolation contract on every GTM tool green (18 assertions).
Stripe webhook (subscription state sync) — closes P7 revenue gap
POST /webhooks/stripe verifies the Stripe-Signature header (HMAC-SHA256, 5-min replay tolerance) and applies customer.subscription.{created,updated,deleted} + invoice.paid to chieflab_workspace_owners. Idempotent via chieflab_stripe_events. STRIPE_WEBHOOK_SECRET env required; STRIPE_PRICE_TO_PLAN env maps price ids to plan names.
proof: node scripts/stripe-webhook-smoke.mjs — 7/7 green (signature missing/malformed/stale/wrong-secret all 400; valid sig 200; replay 200; invoice.paid 200).
P21 — Cold-start usability sweep
Strategic fix: manualFallback default surfaces paste-ready briefs inline when zero connectors are wired (was: missing_connector error). Brand-scrape pre-pass + anti-fabrication rail (3 stacked rules: no refusals, no fabrication, honest degradation with _inferred_ markers). chieflab_help as first-contact tool. _meta.category on all tools so cold agents discover primary first. (The earlier experimental.progress capability was removed when the official MCP SDK rejected unknown experimental fields; see /status for the current initialize check.) chieflab_redraft for multi-turn revision (two-call pattern: brief → render → commit). Smart channel narrowing on thin context — drafts 1 channel instead of 5-7 when scrape is thin AND repoContext is light.
proof: Cold-stranger smoke 26/26 PASS post-deploy (scripts/cold-stranger-smoke.mjs). Spec ↔ implementation parity green (12/12 reason codes match). Drafts verified honest: zero fabrications, _inferred_ markers applied, no brand conflation.
P22.1 — BYOK + cost transparency
Provider-key vault: chieflab_set_provider_key / chieflab_list_provider_keys / chieflab_revoke_provider_key. Supported providers: gemini, resend, zernio, anthropic, openai. costEstimate on every chieflab_launch_product response with per-provider source (byok | hosted | no_key | n/a) + per-provider USD. Per-workspace daily cost cap (CHIEFLAB_DAILY_COST_CAP_USD, default $5) prevents runaway hosted bills.
proof: BYOK lifecycle verified live: set Gemini key → costEstimate.gemini.source: byok → revoke → costEstimate.gemini.source: hosted. Migration trail at supabase/migrations/202605120{100,300}_connector_secrets_*.sql.
P23.1 — Reply loop (chieflab-reply operator)
Inbound: chieflab_record_engagement (push from agent or webhook) → chieflab_inbox → chieflab_draft_reply (brain voice + launch context + anti-template rules) → approval gate → chiefmo_send_reply (channel adapter routes to Zernio for social / Resend for email). Brain integration: every approved reply becomes a voice sample. Same architecture for the outbound counterpart.
proof: 5 new tools (chieflab_record_engagement / chieflab_inbox / chieflab_draft_reply / chiefmo_send_reply / chieflab_dismiss_engagement) live on chieflab.io/api/mcp. Migration: supabase/migrations/202605120200_engagement_events.sql.
P24.1 — Outbound operator (chieflab-outbound)
Cold prospecting: chieflab_record_prospect → chieflab_outbound_inbox → chieflab_draft_outbound (brain voice + product scrape + anti-canned-opener rules + ≤120w cold-email format) → approval gate → chiefmo_send_outbound (Resend adapter; LinkedIn InMail / Twitter DM stubbed for later). When the prospect replies, the chieflab-reply loop catches it via chieflab_record_engagement. Closed selling loop end-to-end.
proof: Full outbound loop verified live with Resend test address delivered@resend.dev: 6/7 steps green; step 5 (real Resend send) requires workspace BYOK Resend key (config gap, not code). Migration: supabase/migrations/202605120400_outbound_prospects.sql.
P25.1 — Security audits + namespace cleanup + status/brain aggregators
Approval-gate adversarial audit (scripts/approval-gate-bypass-test.mjs): 4 executors × 7 bypass attempts — 0 breaches. reviewUrl HMAC adversarial audit (scripts/review-url-hmac-test.mjs): 8 attacks — 0 breaches. chieflab_* surface promoted to primary; legacy chiefmo_* names demoted to category=legacy (back-compat retained). chieflab_status: single aggregator returning pending approvals + ripe measurements + new engagements + queued prospects + recent launches + today's hosted spend. chieflab_brain_summary: plain-English narrative of what the workspace brain has learned (the moat made visible).
proof: scripts/approval-gate-bypass-test.mjs and scripts/review-url-hmac-test.mjs both exit 0. Audits at docs/APPROVAL_BYPASS_AUDIT_2026-05-12.md and docs/REVIEW_URL_HMAC_AUDIT_2026-05-12.md.