AI agent for product teams

Half your team's time goes to finding things that already exist. In code, in tickets, in docs, in someone's head.

Probe knows your entire product. Ask it anything.
Let it handle the rest. Learn. Debug. Plan. Review. Automate.

What teams actually ask Probe

Real prompts. Real answers. From a production deployment across engineering, product, QA, support, legal, and HR. DM it, invite it to channels and threads, or mention it anywhere your team already works.

ProductQA

@Probe evaluate ticket PROJ-15792 for completeness

Status: NOT READY — There are blockers that must be addressed before implementation. Issues Found (16): missing acceptance criteria, no error handling spec, incomplete edge cases, no performance requirements defined...

Engineering

Upstream cache control is not working as described in the ticket — is this actually a bug?

Yes, this is a bug in the response caching logic. The issue lies in how the gateway handles the absence of upstream cache control headers. The problematic code is in gateway/res_cache.go — it incorrectly defaults to caching...

EngineeringQA

If I adopt your fix, what tests should I write to prove it?

1. Test Case: Cache Control Enabled, Header ABSENT (Verifies the Bug Fix) 2. Test Case: Header PRESENT (Verifies Correct Behavior) 3. Test Case: Control Disabled (Verifies No Regression)

Engineering

How does the developer portal connect to the admin dashboard, and how are API keys created?

1. Portal-Dashboard Connection: The Portal acts as a client to the Dashboard API via REST. 2. API Key Creation Flow: When a developer requests a key through the portal UI, it sends a request to the Dashboard API, which generates the key object and stores it in Redis...

ProductUX

Which customers are interested in JWT authentication? Pull from Zendesk, Jira, and onboarding docs.

Acme Corp: Looking to migrate to JWT long term. GlobalTech: Uses JWT with tokens minted by Auth0. MegaSoft: JWT listed as a potential auth method. DataFlow Inc: Strategy is JWT Auth with Okta and EntralD...

Engineering

Memory consumption increased between v5.8.2 and v5.8.5 — what could cause it?

Based on the code changes between v5.8.2 and v5.8.5, here are PRs that might be related: PR #2847 modified buffer handling in the request pipeline, PR #2891 changed connection pooling defaults, PR #2903 added eager-loading...

Product

Break this refined ticket down into small engineering tasks with story point estimates

Task 1: Fix the cache control logic in gateway/res_cache.go • Estimated Effort: 1 Point (one-line code change) Task 2: Add regression tests for cache behavior • Estimated Effort: 3 Points Task 3: Update API docs • Estimated Effort: 1 Point

Technical Writer

Build release notes for v5.10.1 from the merged PRs and Jira tickets

Release Notes for v5.10.1 Changed: Upgraded Go version to enhance security. Fixed: Addressed high-impact CVEs. Fixed critical issue where Hybrid mode would hang. Patched "Zip Slip" vulnerability in bundle decompression...

Engineering

Propose a fix for this caching bug — show me the exact code change needed

The bug can be fixed by ensuring that cacheThisRequest is explicitly set to false when upstream cache control is enabled and the required header is missing. Change the logic in gateway/res_cache.go: cacheThisRequest = res.Header.Get(upstreamCacheHeader) != ""

Business Ops

Answer this 47-question RFP security questionnaire — search Confluence, docs, and code for evidence

Systematically searching Confluence, docs, and codebase. Compiled answers for 43/47 questions with source references. 4 questions flagged for manual review. Estimated 12 hours of work reduced to 20 minutes...

EngineeringQA

Assess our test coverage for certificate filtering by comparing open bugs to existing tests

CRITICAL: No acceptance criteria defined for certificate statuses. CRITICAL: Available filter states not documented. ERROR: Non-functional requirements like performance not specified. WARNING: Edge cases for expired certificates not covered...

UXEngineering

I'm designing a real-time collaborative editor for the dashboard — is that technically feasible with our current architecture?

Short answer: not without significant changes. The dashboard uses REST polling on a 30s interval (dashboard/api/poller.ts). There's no WebSocket infrastructure — the backend has no pub/sub layer. Real-time collab would require: 1. Adding a WebSocket gateway (estimate: 3-5 weeks). 2. Conflict resolution (OT or CRDT). 3. Redis pub/sub for state sync. Alternative: you could do near-real-time with optimistic UI + 2s polling, which works within current architecture. That's a 1-week effort.

Can the settings page support drag-and-drop reordering? The design assumes it but I want to check before we finalize mocks.

The settings page renders items from a static config object in settings/config.ts — there's no sort order field in the database. Drag-and-drop would require: 1. Adding a "position" column to the settings table. 2. A new API endpoint for reorder (PATCH /settings/reorder). 3. Frontend: the current list uses v-for with no key tracking — needs refactoring. Estimated effort: 2-3 days backend, 1-2 days frontend. The simpler option: alphabetical or category-based sorting, which needs zero backend changes.

Engineering

CI tests are failing — can you pinpoint which change broke them and when was the last green build?

The failures began on November 5th, immediately following the merge of pull request #143. This PR updated the GitHub Actions runner from ubuntu-22.04 to ubuntu-24.04. The operating system upgrade is the direct cause of the CI failures...

Engineering

Why does the rate limiter return 429 for authenticated users even when they're under quota?

Found the issue in middleware/rate_limit.go:142. The token bucket lookup uses client IP instead of API key when the X-Forwarded-For header is present. Authenticated users behind a shared proxy hit the same bucket. Fix: use the session key as primary identifier...

QAEngineering

Read ticket PROJ-8410, then find the original task and spec it was built from. Think like QA — cover positive, negative, and edge cases. Update the ticket with detailed test cases, then check tests/ for gaps in automated coverage.

Done. Read PROJ-8410 (webhook retry logic). Traced back to PROJ-7201 (original spec) and RFC-031 (design doc). Generated 14 test cases: 5 positive (successful retry, backoff timing, max retries), 4 negative (malformed payload, timeout, auth failure), 5 edge (concurrent retries, partial failure, clock skew). Updated PROJ-8410 with all cases. Gap analysis: tests/webhook_test.go covers 6/14 — missing timeout, concurrent, and all edge cases.

Engineering

Nightly CI failed on main — analyze the failure, identify root cause, and post a summary to #engineering with steps to fix

Build #4721 failed at 03:12 UTC. Root cause: integration test test_api_gateway_timeout depends on external sandbox endpoint that returned 503. Not a code regression — infrastructure issue. Posted to #engineering: "Nightly build failed — external sandbox was down. Tests are green when sandbox is healthy. Recommend: add retry logic to integration suite or mock the external dependency. Created INFRA-892 for the sandbox team."

QAEngineering

@Probe analyze ticket PROJ-16554 and compare its acceptance criteria with PR #7745. Provide a list of test cases and edge cases.

Analyzed PROJ-16554 (rate limit policy per API key) and PR #7745 (38 files changed). AC comparison: 3/7 acceptance criteria fully covered by PR. Missing: per-key quota reset behavior, concurrent request handling under burst, and error response format for 429s. Generated 18 test cases: 8 positive (basic rate limiting, quota tracking, key rotation), 6 negative (expired keys, malformed headers, zero-quota edge), 4 edge (burst at quota boundary, clock skew between nodes, race condition on counter increment).

ProductEngineering

@Probe quantify the cost to deliver PROJ-15859 — time in each kanban state, time on PRs and tests in GitHub, number of people involved and for how long

PROJ-15859 (SSO integration): Total cycle time: 34 days. Breakdown: Backlog → In Dev: 8 days (waited for sprint). In Dev: 14 days across 3 engineers (2 PRs: #6201 primary, #6240 hotfix). PR review time: 6.2 days avg (first review comment after 2.1 days). QA: 4 days (failed first pass, 2 days to fix, 1 day re-test). Done: deployed day 34. GitHub effort: 847 lines added, 312 removed, 23 review comments. People: 3 engineers (1 primary, 2 reviewers), 1 QA, 1 PM. Estimated cost at blended rate: ~$18,400.

Product

@Probe for PROJ-15859 — focus on refinement. What was missed in the original spec, what was added during dev, what AC would you recommend, and how did this influence coding time?

Refinement gaps: Original spec had 4 AC — missing: session timeout handling, multi-IdP fallback, SAML vs OIDC toggle behavior, and error states for expired assertions. During dev: 3 unplanned items added (SCIM provisioning hook, group mapping, audit log entries) — these added ~6 days. Recommended AC that would have caught this: 1. Define supported IdP protocols upfront. 2. Specify session lifecycle (creation, refresh, timeout, revocation). 3. Include admin UI states for config errors. 4. Define audit log schema for SSO events. Impact: ~40% of coding time was spent on requirements discovered mid-development.

Product

Review the proposed architecture in epic PROJ-220 — give me your honest opinion on feasibility, risks, and what's missing from the spec

Epic PROJ-220 (event-driven notification system): Feasibility: 7/10. The proposed Kafka-based architecture is sound but over-engineered for current throughput (200 events/sec). Risks: 1. No fallback if Kafka goes down. 2. Schema evolution strategy is undefined. 3. Consumer lag monitoring not addressed. Missing: SLA requirements, capacity planning, rollback strategy, data retention policy. Recommendation: start with Redis Streams, migrate to Kafka when you hit 10k events/sec.

Platform

Our deploy workflow in GitHub Actions started failing intermittently — find the pattern and root cause across recent runs

Analyzed last 40 runs of deploy.yml. Failures correlate with runs triggered between 02:00-04:00 UTC — the container registry rate-limits pulls during that window. 12/15 failures show "429 Too Many Requests" in the docker pull step. Fix: add registry mirror as fallback, or switch to pre-cached base images. Created OPS-1847 with details.

AutomationProduct

@Probe remind me every Monday 9am: check Zendesk tickets tagged "churn-risk" and cross-reference with open Jira bugs for those accounts

Scheduled. Every Monday at 9:00 AM I'll pull Zendesk tickets tagged "churn-risk," match them to customer accounts in Jira, and DM you a summary: which accounts have open bugs, severity, and how long they've been waiting. First report arrives this Monday.

AutomationEngineering

@Probe follow up on this thread tomorrow — check if the hotfix was deployed and the error rate dropped in Datadog

Got it. Tomorrow at 10:00 AM I'll check: 1. Whether PR #1847 was merged and deployed to production. 2. The error rate for /api/checkout in Datadog over the last 24h. I'll reply in this thread with the status. If the error rate is still elevated, I'll flag it in #engineering.

AutomationQA

@Probe every Friday 4pm: check all tickets moved to "Ready for QA" this week — flag any missing acceptance criteria or test cases

Scheduled weekly QA gate check. Every Friday at 4:00 PM I'll scan all tickets moved to "Ready for QA" during the week, evaluate each for: acceptance criteria completeness, defined edge cases, and linked test cases. Tickets that fail get flagged with comments and a summary posted to #qa-team. First run this Friday.

AutomationProduct

@Probe every hour: check newly created Jira tickets, find related Zendesk cases, classify the issue, and add early analysis as a comment

Hourly ticket enrichment active. Every hour I'll scan Jira for tickets created in the last 60 minutes, search Zendesk for related customer cases by keyword and account, classify each issue (bug, feature request, regression, config error), and post a comment with: related Zendesk tickets, affected customers, preliminary root cause from the codebase, and suggested priority. Already processed 4 new tickets — PROJ-4210 linked to 3 Zendesk cases from the same customer, classified as regression in the billing module.

Automation

@Probe set up a daily standup digest for #backend-team: open PRs, blocking issues, tickets in review, and anything stale for 3+ days

Daily standup digest configured for #backend-team at 9:15 AM. Each morning I'll post: open PRs with review status, tickets blocked or waiting on external teams, items in code review for 2+ days, and anything in "In Progress" with no activity for 3+ days. Team members can reply to the digest thread with updates.

PlatformEngineering

Correlate the Datadog spike in p99 latency with recent deployments and config changes — what went out in the last 6 hours?

Latency spike began at 14:22 UTC. Timeline: 13:45 — PR #892 merged (added request logging middleware). 14:10 — deploy to prod completed. 14:22 — p99 jumped from 120ms to 890ms. The new middleware calls JSON.stringify on every request body including file uploads. Confirmed: disabling the middleware in staging restores normal latency. Recommend: skip serialization for multipart requests.

Platform

We have 14 microservices — which ones are missing health checks, structured logging, or graceful shutdown? Audit all repos.

Scanned all 14 service repos. Missing health checks: payment-service, notification-worker, analytics-ingest (3/14). Missing structured logging: legacy-api, pdf-generator still use fmt.Println (2/14). Missing graceful shutdown: notification-worker, batch-processor, analytics-ingest — all exit immediately on SIGTERM (3/14). Priority: analytics-ingest fails all 3 checks. Created PLAT-320 through PLAT-327 with per-service remediation steps.

ProductQA

@Probe evaluate ticket PROJ-15792 for completeness

Engineering

Upstream cache control is not working as described in the ticket — is this actually a bug?

EngineeringQA

If I adopt your fix, what tests should I write to prove it?

1. Test Case: Cache Control Enabled, Header ABSENT (Verifies the Bug Fix) 2. Test Case: Header PRESENT (Verifies Correct Behavior) 3. Test Case: Control Disabled (Verifies No Regression)

Engineering

How does the developer portal connect to the admin dashboard, and how are API keys created?

ProductUX

Which customers are interested in JWT authentication? Pull from Zendesk, Jira, and onboarding docs.

Engineering

Memory consumption increased between v5.8.2 and v5.8.5 — what could cause it?

Product

Break this refined ticket down into small engineering tasks with story point estimates

Technical Writer

Build release notes for v5.10.1 from the merged PRs and Jira tickets

Engineering

Propose a fix for this caching bug — show me the exact code change needed

Business Ops

Answer this 47-question RFP security questionnaire — search Confluence, docs, and code for evidence

EngineeringQA

Assess our test coverage for certificate filtering by comparing open bugs to existing tests

UXEngineering

I'm designing a real-time collaborative editor for the dashboard — is that technically feasible with our current architecture?

Can the settings page support drag-and-drop reordering? The design assumes it but I want to check before we finalize mocks.

Engineering

CI tests are failing — can you pinpoint which change broke them and when was the last green build?

Engineering

Why does the rate limiter return 429 for authenticated users even when they're under quota?

QAEngineering

Engineering

Nightly CI failed on main — analyze the failure, identify root cause, and post a summary to #engineering with steps to fix

QAEngineering

@Probe analyze ticket PROJ-16554 and compare its acceptance criteria with PR #7745. Provide a list of test cases and edge cases.

ProductEngineering

@Probe quantify the cost to deliver PROJ-15859 — time in each kanban state, time on PRs and tests in GitHub, number of people involved and for how long

Product

@Probe for PROJ-15859 — focus on refinement. What was missed in the original spec, what was added during dev, what AC would you recommend, and how did this influence coding time?

Product

Review the proposed architecture in epic PROJ-220 — give me your honest opinion on feasibility, risks, and what's missing from the spec

Platform

Our deploy workflow in GitHub Actions started failing intermittently — find the pattern and root cause across recent runs

AutomationProduct

@Probe remind me every Monday 9am: check Zendesk tickets tagged "churn-risk" and cross-reference with open Jira bugs for those accounts

AutomationEngineering

@Probe follow up on this thread tomorrow — check if the hotfix was deployed and the error rate dropped in Datadog

AutomationQA

@Probe every Friday 4pm: check all tickets moved to "Ready for QA" this week — flag any missing acceptance criteria or test cases

AutomationProduct

@Probe every hour: check newly created Jira tickets, find related Zendesk cases, classify the issue, and add early analysis as a comment

Automation

@Probe set up a daily standup digest for #backend-team: open PRs, blocking issues, tickets in review, and anything stale for 3+ days

PlatformEngineering

Correlate the Datadog spike in p99 latency with recent deployments and config changes — what went out in the last 6 hours?

Platform

We have 14 microservices — which ones are missing health checks, structured logging, or graceful shutdown? Audit all repos.

Product

Give me solution design, functional AC, and non-functional AC for this ticket

Solution Design: Configure the container registry to reject pushes to existing tags and update the CI/CD pipeline to use unique tags per build. Functional: All Docker images tagged with Git commit SHA. Deployment processes updated to use new tags. Non-functional: Image publishing must not be significantly slowed...

Write unit and integration tests that prove the acceptance criteria for PROJ-4521

9 tests across 3 categories. Unit: verify tag uniqueness validation, test rejection of duplicate tags, validate rollback on failure. Integration: end-to-end pipeline with unique tag generation. Cross-project: dependency resolution with new tagging scheme...

Engineering

Are these two bugs related? Jira PROJ-9234 and Zendesk ticket #17962

Based on analysis, these are NOT related. PROJ-9234 is about the runtime behavior of the gateway process (how it stops). Zendesk #17962 is about data representation in the dashboard UI (how session tokens are displayed). Completely different subsystems and problem areas...

Product

Estimate these 5 tasks using Fibonacci points — factor in codebase complexity and cross-service deps

Based on code analysis and historical PRs of similar scope: Task 1: 1 point (single-file config change). Task 2: 3 points (touches 4 files across 2 services). Task 3: 2 points. Task 4: 5 points (requires migration). Task 5: 3 points. Total: 14 points.

Engineering

These 4 tickets look similar — trace the code and tell me if they share a root cause. Can we fix them in one PR?

After tracing the code paths: 3 of the 4 tickets share the same root cause — a race condition in the session store when Redis connections reset. PROJ-8821 is unrelated (UI rendering issue). The 3 related bugs can be fixed in one effort by adding a mutex in storage/redis.go...

Engineering

Show me where API authentication lives in the codebase and walk me through the request lifecycle

Authentication lives in middleware/auth_check.go. The request lifecycle: 1. Listener accepts connection (server/http.go) 2. Router matches endpoint (mux/router.go) 3. Middleware chain executes: rate-limit → auth → transform → proxy 4. Response flows back through the chain...

All

Give me a digest of key technical decisions from last week — pull from Slack, RFCs, and design docs

Week of Jan 27: 1. Platform team decided to migrate from Redis 6 to Redis 7 (thread in #platform). 2. RFC-042 approved: new circuit breaker pattern for upstream calls. 3. Decision to deprecate v1 REST API by Q3. 4. Security team flagged OpenSSL upgrade as P1...

UXEngineering

The new design has inline editing for all table cells. Before I push back on scope — what does the current table component actually support?

The DataTable component (components/DataTable.vue) supports: read-only rendering, column sorting, pagination, and row selection. No inline editing — cells are plain <span> elements. The component doesn't emit cell-level events, only row-level (row-click, row-select). Adding inline editing means: new cell renderer, per-field validation, debounced auto-save, and undo support. Easier path: an "Edit" button per row that opens a side panel — the SidePanel component already exists and handles form state.

UXProduct

Design wants to show "time since last activity" on user profiles — do we even track that? What data is available?

Partially. The users table has "last_login_at" (updated on auth) and "updated_at" (updated on profile changes). But there's no general activity tracking — no page views, no click events, no session duration. The closest thing: audit_log table records API calls with user_id and timestamp, but it's write-heavy actions only (create, update, delete). For the design: you can show "Last login: 3 days ago" today. Showing "Last active" would need a new activity tracking service — that's a separate project.

Engineering

Run a first-pass security scan — find hardcoded secrets, deprecated crypto, and SQL injection risks

Found: 3 instances of hardcoded API keys in config files (config/test_fixtures.go). 7 uses of deprecated crypto/md5 for hashing (should use SHA-256). 2 instances of SQL string concatenation in reporting module. 1 use of eval() in template engine...

Engineering

List every public function in pkg/api/handler.go that's missing from our API reference docs

Comparing handler.go exports with docs/api-reference/: 4 undocumented endpoints found: HandleBatchRequest(), HandleWebhookCallback(), HandleGraphQLPlayground(), HandleMetricsExport(). All added after v5.8 release — docs were not updated...

Product

Diff these two job spec drafts in Confluence — what's different, what's missing from each?

Key differences: Spec A requires 5+ years experience, Spec B requires 3+. Spec A includes on-call rotation, Spec B doesn't mention it. 4 items in Spec A missing from Spec B: security clearance requirement, travel expectations, budget ownership, team size...

Business Ops

List every GPL and AGPL dependency across all repos — flag anything in production code

Scanning all dependency manifests and lock files across 47 repos. Found: 23 GPL-licensed dependencies (12 are test-only), 4 AGPL packages requiring legal review, 156 Apache-2.0 (no action needed). Flagged: 2 transitive AGPL deps in production code...

Business Ops

I need the remote work policy and the process for requesting a new laptop — find both

Remote work policy: Confluence > People Ops > Remote Work Guidelines (last updated Oct 2024). Equipment requests: submit through ServiceNow portal > IT Hardware > New Equipment Request. Standard approval takes 2-3 business days, expedited available for new hires...

Business Ops

Pull engineering effort data from Jira for Project X this quarter — we need it for R&D tax credit filing

Project X, Q4 2024: 847 tickets completed across 12 engineers. 2,340 story points delivered. Breakdown by category: New feature development (62%), bug fixes (18%), infrastructure (12%), documentation (8%). Qualifying R&D activities: 74% of total effort...

Business Ops

User can't access staging — check known issues, recent infra changes, and open a ticket if needed

Checked: 1. No active incidents in #status channel. 2. Found: VPN config was updated yesterday (Jira OPS-2341) — new certificate required. 3. Resolution: User needs to re-download VPN profile from portal. Created ticket OPS-2387 to update the onboarding docs...

Engineering

Check tickets assigned to me in this sprint for project TT. Analyze status, comments, and pending PRs — tell me what needs my attention before standup

Sprint "TT-24.3": You have 6 tickets. 2 need action: PROJ-3341 (In Review) — reviewer left 3 comments yesterday, 1 blocking change requested in auth module. PROJ-3358 (In Progress) — blocked by PROJ-3290 which was merged 2h ago, you can unblock now. PRs: #487 approved and ready to merge. #491 has failing CI — same sandbox issue from last night. Suggestion: merge #487, rebase #491, then address review comments on PROJ-3341.

Technical WriterEngineering

Ticket PROJ-6100 changed the webhook retry logic — update the docs, open a PR, and link it back to the Jira ticket

Done. Read PROJ-6100 and the merged PR #672. Changes: max retries increased from 3 to 5, backoff changed from fixed to exponential, new X-Retry-Count header added. Updated docs/webhooks/retry-policy.md with new behavior table and code examples. Opened PR #701 "Update webhook retry docs for PROJ-6100". Linked PR back to PROJ-6100 in Jira. Also flagged: the API changelog needs an entry — created PROJ-6105 for that.

Engineering

I just taught you how our deployment pipeline works. Save this as a runbook so you can help the whole team with deployments going forward.

Saved to knowledge base as "Deployment Runbook — Production Pipeline". Contents: 1. Pre-deploy checklist (migrations, feature flags, config). 2. Deploy steps (merge to main → CI → staging canary → prod rolling). 3. Rollback procedure (revert commit, re-run pipeline, verify health checks). 4. Common issues and fixes. This is now available to all team members who ask about deployments.

QAEngineering

Generate TDD-style test stubs for the acceptance criteria in PROJ-5500 — I want to write tests before I write the implementation

Generated 11 test stubs from PROJ-5500 acceptance criteria. Organized into 3 files: test_rate_limit_policy.go (4 tests: applies correct limit per tier, rejects over-quota, resets at window boundary, handles concurrent requests), test_rate_limit_api.go (4 tests: CRUD for policies, validation errors, conflict detection), test_rate_limit_integration.go (3 tests: end-to-end with Redis, failover behavior, metrics emission). All stubs have descriptive names and TODO comments mapping back to specific AC items.

ProductQA

@Probe analyze Zendesk tickets updated this week. Summarize by severity, resolution type (product fix vs environment issue vs docs gap). For product issues — suggest test cases. For docs issues — suggest fixes. Delegate each ticket analysis individually.

Analyzed 47 Zendesk tickets updated this week. Summary: 12 Critical (3 product bugs, 7 environment/config, 2 docs gaps). 19 High (5 product, 9 environment, 5 docs). 16 Medium/Low. Product bugs (8 total): ZD-29841 — race condition in webhook delivery, suggested 3 regression tests. ZD-29903 — pagination broken on filtered results, 2 test cases covering offset+filter combo. ZD-29877 — API returns 500 on empty array body, 1 negative test. Docs gaps (7): ZD-29856 — migration guide missing step for Redis 7 upgrade. ZD-29901 — rate limit headers undocumented. Created 8 Jira tickets for product fixes and 7 docs PRs drafted.

EngineeringProduct

@Probe check the In Dev time for PROJ-15859. What were the blockers? When was the first human PR review (ignore bot comments)? How long from In Dev to QA? How long to fix when QA failed? Show me the cycles and reaction times.

PROJ-15859 flow analysis: In Dev entered: Jan 12. First commit: Jan 12 (same day). PR #6201 opened: Jan 18 (6 days coding). First human review comment: Jan 20 — 2 days wait (3 bot comments before that filtered out). Review cycles: Round 1: 4 comments, 2 blocking — fixed in 1 day. Round 2: approved Jan 22. Moved to QA: Jan 23 (11 days In Dev). QA failed: Jan 24 — edge case in group mapping. Fix PR #6240: opened Jan 25, merged Jan 26. QA re-test passed: Jan 27. Blockers: 2-day wait for first review, 1 day for QA feedback. Reaction times: avg 1 day to respond to review comments, same-day QA fix turnaround. Total QA cycle: 4 days (including 1 failure + retest).

Platform

Our GitHub Actions matrix build takes 47 minutes — analyze the workflow, find bottlenecks, and propose how to cut it in half

Analyzed ci.yml and last 20 runs. Bottlenecks: 1. Docker layer cache miss on every run — no cache-from configured (adds 8min). 2. Integration tests run sequentially across 3 DBs — can parallelize with matrix strategy (saves 12min). 3. Linting runs after build instead of concurrently (wastes 4min). 4. npm ci re-downloads every time — add actions/cache for node_modules (saves 3min). Proposed workflow cuts build from 47min to ~20min. PR draft ready for review.

PlatformEngineering

Grafana shows memory climbing on the API pods until OOMKill every 18 hours — trace the leak through logs, metrics, and recent code changes

Cross-referenced Grafana memory charts, Loki container logs, and git history. The leak started 5 days ago after PR #1204 merged — it added an in-memory response cache with no TTL eviction. Cache grows unbounded: 200MB after 6h, 1.2GB at OOMKill. The cache key includes query params, so unique API calls generate unique entries forever. Fix: add TTL-based eviction (already implemented in cache/lru.go but not wired up). One-line fix in api/middleware.go:89.

Platform

Map our Terraform infrastructure to the services that depend on it — which resources are used by which teams and repos?

Scanned 8 Terraform modules and cross-referenced with service configs, Dockerfiles, and env vars across all repos. Mapping: RDS cluster → used by 4 services (user-api, billing, analytics, admin). ElastiCache → 3 services (session-store, rate-limiter, user-api). SQS queues → 5 services across 2 teams. Orphaned resources found: 2 unused S3 buckets (last write 8 months ago), 1 idle NAT gateway costing $32/day. Owner gaps: 3 Lambda functions have no team tag. Created spreadsheet and PLAT-412 for cleanup.

AutomationProduct

@Probe every time a Zendesk ticket is tagged "feature-request" — find the related Jira epic, add customer context, and update the epic with the request

Automation active. When a Zendesk ticket gets tagged "feature-request," I'll: 1. Extract the feature area and match it to existing Jira epics. 2. Add the customer name, plan tier, and request details as a comment on the epic. 3. If no matching epic exists, I'll create a new one in the Product Backlog and DM you. Already processed 3 tickets from today — linked to EPIC-142 and EPIC-89.

AutomationEngineering

@Probe check this PR every 6 hours — when all reviewers approve and CI is green, merge it and post to #releases

Watching PR #2104. I'll check every 6 hours for: all requested reviewers approved, CI pipeline passing, no merge conflicts. Once all conditions are met, I'll merge to main and post a summary to #releases with the changelog. If anything blocks it, I'll reply here with what's still pending.

AutomationPlatform

@Probe every morning: check Datadog for any service with error rate above 1% or p99 above 500ms in the last 12 hours — post findings to #on-call

Morning health check scheduled for 8:00 AM daily. I'll query Datadog for services exceeding thresholds, correlate any anomalies with recent deploys from GitHub, and post a summary to #on-call. If everything is healthy, I'll post a green status. If issues are found, I'll include: affected service, metric values, likely cause, and link to the relevant deploy or config change.

Automation

@Probe personal digest every morning: my open PRs, tickets assigned to me, any threads I was mentioned in yesterday, and upcoming deadlines this week

Personal daily digest configured. Every morning at 8:30 AM I'll DM you: 1. Your open PRs — review status and CI results. 2. Assigned tickets — any status changes or new comments overnight. 3. Slack mentions from yesterday you haven't responded to. 4. Tickets with due dates this week. Digest arrives as a single message you can scan in 30 seconds.

Product

Give me solution design, functional AC, and non-functional AC for this ticket

Write unit and integration tests that prove the acceptance criteria for PROJ-4521

Engineering

Are these two bugs related? Jira PROJ-9234 and Zendesk ticket #17962

Product

Estimate these 5 tasks using Fibonacci points — factor in codebase complexity and cross-service deps

Engineering

These 4 tickets look similar — trace the code and tell me if they share a root cause. Can we fix them in one PR?

Engineering

Show me where API authentication lives in the codebase and walk me through the request lifecycle

All

Give me a digest of key technical decisions from last week — pull from Slack, RFCs, and design docs

UXEngineering

The new design has inline editing for all table cells. Before I push back on scope — what does the current table component actually support?

UXProduct

Design wants to show "time since last activity" on user profiles — do we even track that? What data is available?

Engineering

Run a first-pass security scan — find hardcoded secrets, deprecated crypto, and SQL injection risks

Engineering

List every public function in pkg/api/handler.go that's missing from our API reference docs

Product

Diff these two job spec drafts in Confluence — what's different, what's missing from each?

Business Ops

List every GPL and AGPL dependency across all repos — flag anything in production code

Business Ops

I need the remote work policy and the process for requesting a new laptop — find both

Business Ops

Pull engineering effort data from Jira for Project X this quarter — we need it for R&D tax credit filing

Business Ops

User can't access staging — check known issues, recent infra changes, and open a ticket if needed

Engineering

Check tickets assigned to me in this sprint for project TT. Analyze status, comments, and pending PRs — tell me what needs my attention before standup

Technical WriterEngineering

Ticket PROJ-6100 changed the webhook retry logic — update the docs, open a PR, and link it back to the Jira ticket

Engineering

I just taught you how our deployment pipeline works. Save this as a runbook so you can help the whole team with deployments going forward.

QAEngineering

Generate TDD-style test stubs for the acceptance criteria in PROJ-5500 — I want to write tests before I write the implementation

ProductQA

EngineeringProduct

Platform

Our GitHub Actions matrix build takes 47 minutes — analyze the workflow, find bottlenecks, and propose how to cut it in half

PlatformEngineering

Grafana shows memory climbing on the API pods until OOMKill every 18 hours — trace the leak through logs, metrics, and recent code changes

Platform

Map our Terraform infrastructure to the services that depend on it — which resources are used by which teams and repos?

AutomationProduct

@Probe every time a Zendesk ticket is tagged "feature-request" — find the related Jira epic, add customer context, and update the epic with the request

AutomationEngineering

@Probe check this PR every 6 hours — when all reviewers approve and CI is green, merge it and post to #releases

AutomationPlatform

@Probe every morning: check Datadog for any service with error rate above 1% or p99 above 500ms in the last 12 hours — post findings to #on-call

Automation

@Probe personal digest every morning: my open PRs, tickets assigned to me, any threads I was mentioned in yesterday, and upcoming deadlines this week

Customer feedback

What teams say after shipping with Probe

"As a Product Manager, Probe helps me to understand the true behaviour of the software so that I can go beyond the documentation and validate edge case scenarios and answer 'what if?' questions as I develop the product roadmap. This saves a lot of time and disruption to the development teams."

Andy OstSenior Product Manager at Tyk.io

"I'm using Probe Labs tools daily as a technical lead, and they've been adopted across marketing, sales, documentation, product, delivery, and engineering. The YAML-based automation makes it easy to wire in tools like JIRA, Zendesk, and GitHub for agentic flows that actually work from day one."

Laurentiu GhiurTechnical Lead at Tyk.io

Book a demo

30 minutes. Real questions, real answers. No slideware.

See it on a real codebaseWe'll walk through Probe on a production-scale environment — multi-repo, real integrations, real complexity.
Map it to your stackAI editors, GitHub Actions, Slack, Web UI — we'll figure out what fits your team and workflow.
Walk away with a planA clear path to getting Probe running in your org. Most teams are live within a day.

Tell us about your setup

Why Probe feels different

Probe isn't a finished product in the classical sense. It's an agent you co-own with your team — backed by process, not promises. You can even think about it as OpenClaw, except built on enterprise-grade security standards and designed to scale across your whole org.

Co-owned, not consumed

Probe isn't a product you subscribe to and hope works. It's a framework your team shapes — YAML workflows, custom integrations, your rules. Probe even knows how to extend itself. Under your control.

Process over trust

You can't trust AI. You can't trust humans either. So you turn everything into a process with quality gates. Every process starts with accurate data and the right question. Probe helps with both.

Connects, then acts

It starts with your data sources — code, tickets, docs, Slack, wikis. Then it acts on your behalf — PRs, ticket updates, reviews, deployments. Claude Code? Just one building block of your master plan.

Accurate at any scale

Powered by a unique context engine that actually works on large projects. Million-line repos, hundreds of users. No scale limit, no accuracy trade-off.

Built for the AI age

A new type of product — designed to be used not just by humans but by other agents too. Your CI pipelines, Slack bots, and AI editors all talk to Probe as a first-class interface.

Enterprise-grade by default

On-prem deployment, any LLM provider, no vendor lock-in. GitOps workflows, OpenTelemetry traces, fault tolerance, governance, and security controls — not afterthoughts bolted on, but how it was built from day one.

Different roles, same source of truth

Engineers, platform teams, QA, product, and leadership all get value — in different ways.

Engineering Leadership

Stop being the bottleneck. Give every team self-serve access to engineering context.

You get

One source of truth for everyone: Engineers, PMs, support, legal — all query the same codebase. No more "ask engineering" loops.
Problems get handled when they arise: Issues triaged, context gathered, first response drafted — automatically, the moment something breaks. Your response times shrink without adding headcount.
Repeatable processes without the overhead: Build workflows once — compliance checks, release gates, onboarding — and they run every time. No chasing people.

Replaces

Tribal knowledge locked in senior engineers' heads
"Can someone from engineering answer this?" in Slack
Processes that only work when the right person remembers

How Probe helps

Automate issue triage — categorize, assign, and add context before a human touches it
Enforce release readiness gates across all teams before every deploy
Schedule hourly Jira→Zendesk enrichment — new tickets get customer context and classification automatically
Surface cross-team risks and blockers in a weekly digest from Slack and RFCs
Onboard new hires faster with self-serve codebase walkthroughs
Set up recurring compliance checks — license audits, security scans, R&D tracking — that run on schedule

Engineering Leadership →Talk to a human

Use cases & workflows

Direct entry points for teams, tools, and automation.

Chat with Code

Grounded answers from your codebase. Multi-repo search across millions of lines.

Learn more →

Intelligent Code Review

Structured reviews with guardrails. Configurable, scalable, context-aware.

Learn more →

GitHub Assistant

Issue and PR automation in GitHub. Auto-triage and first-pass reviews.

Learn more →

Workflow Automation

Visor workflows for real processes — review, triage, and release pipelines.

Learn more →

Scheduled Jobs & Reminders

Daily digests, thread follow-ups, recurring checks, and Jira→Zendesk enrichment — from Slack or YAML.

Learn more →

Developers & SDK

Build custom tooling on Probe. MCP server, CLI, and programmatic access.

Learn more →

Get started in 10 minutes

Real value, not demos. Pick any of these and have something running before your next meeting.

Add Probe to Your AI Coding Tool

Get enterprise-grade code understanding in your existing workflow. Probe auto-detects Claude Code and Codex auth, or works with any LLM API.

You get: A specialized AI agent for code search and analysis, powered by Probe's code search engine.

AI code editor setup →

Add to Claude Code

claude mcp add probe -- npx -y @probelabs/probe@latest agent --mcp

On-prem and governable by design

Run it in your environment, bring your model, and audit every workflow like infrastructure.

Control

On-prem by default — your code and data stay in your environment
Any LLM — choose per workflow; swap models without rewriting
Explicit tool permissions — per workflow and per step
Scoped context retrieval — pull only what's needed
Schema validation + policy gates — for critical steps

Governance

OpenTelemetry traces + audit trails — what happened, with what context
Versioned workflows — reviewed like code
Org templates + team ownership — teams customize safely
Multi-repo + multi-team — aligned to ownership boundaries
Deterministic retries — no silent "AI weirdness"

No vendor lock-in: open-source core + open standards + deploy anywhere.

Questions we get asked

What is Probe exactly?

Probe is an AI agent that connects to your codebase, tickets, docs, Slack, and other tools — then answers questions, automates workflows, and takes actions like opening PRs or updating tickets. It also runs scheduled jobs: daily digests, thread follow-ups, recurring checks, and cross-tool automations like enriching Jira tickets with Zendesk context. Think of it as a team member that holds the full context of your product — and never forgets to follow up.

Does my code stay private?

Yes. Code indexing and retrieval run locally or on your own infrastructure. You control what context is sent to the LLM. Full on-prem deployment is available — bring any LLM provider, no vendor lock-in.

Who uses Probe?

Engineers, platform teams, QA, product managers, support, legal, HR, and finance. Anyone who needs answers from your codebase or connected tools. It's deployed across entire orgs, not just engineering.

How is this different from ChatGPT or Copilot?

ChatGPT and Copilot work with what you paste in. Probe connects to your actual systems — code repos, Jira, Zendesk, Confluence, Slack — and reasons across all of them. It also takes actions: opens PRs, updates tickets, posts summaries.

Can I use my own LLM?

Yes. Probe works with any LLM provider — OpenAI, Anthropic, Google, or your own self-hosted models. Switch providers anytime with no lock-in.

How long does setup take?

Most teams are running Probe within 10 minutes. Add it to your AI editor, connect a Slack bot, or deploy the GitHub Action. YAML-based workflows mean no custom code required.

Workflows, not prompts.

Deep code understanding. Workflow automation. Predictable, auditable, on-prem.

Why Probe →Talk to a human