Argus MCP & Plugin Integration Architecture¶

PhD-Level Analysis: The Intelligent SDLC/STLC Interface Layer¶

Version: 3.0 Date: 2026-01-29 Status: Production Implementation Review

Executive Summary¶

Argus is not a testing tool — it's an AI-powered Quality Intelligence Platform that serves as "Jarvis for engineering organizations." The MCP server and Claude Code plugin form the critical human-AI interface layer that connects developers directly to:

73 MCP tools exposing full platform capabilities
27 specialized AI agents for autonomous task execution
60+ API endpoints for every SDLC/STLC operation
10+ data source integrations (Sentry, Jira, GitHub, Datadog, etc.)
Real-time knowledge graph (Cognee + FalkorDB + Neo4j)
Multi-tier intelligence (Cache → Vector → Graph → LLM)

1. Current Implementation Status¶

1.1 MCP Server: 73 Tools Implemented ✅¶

┌─────────────────────────────────────────────────────────────────────────────────────────────┐
│                           ARGUS MCP SERVER - 73 TOOLS                                        │
├─────────────────────────────────────────────────────────────────────────────────────────────┤
│                                                                                              │
│  AUTHENTICATION (4)                 BROWSER TESTING (7)                                     │
│  ├─ argus_auth                      ├─ argus_health                                         │
│  ├─ argus_auth_complete             ├─ argus_discover                                       │
│  ├─ argus_auth_status               ├─ argus_act                                            │
│  └─ argus_auth_logout               ├─ argus_test                                           │
│                                     ├─ argus_extract                                        │
│  QUALITY INTELLIGENCE (3)           ├─ argus_agent                                          │
│  ├─ argus_quality_score             └─ argus_generate_test                                  │
│  ├─ argus_quality_stats                                                                      │
│  └─ argus_risk_scores               PRODUCTION EVENTS (4)                                   │
│                                     ├─ argus_events                                          │
│  TEST MANAGEMENT (2)                ├─ argus_event_triage                                   │
│  ├─ argus_tests                     ├─ argus_test_from_event                                │
│  └─ argus_test_review               └─ argus_batch_generate                                 │
│                                                                                              │
│  SELF-HEALING (4)                   SMART INSIGHTS (4)                                      │
│  ├─ argus_healing_config            ├─ argus_what_to_test                                   │
│  ├─ argus_healing_patterns          ├─ argus_coverage_gaps                                  │
│  ├─ argus_healing_stats             ├─ argus_dashboard                                      │
│  └─ argus_healing_review            └─ argus_ask                                            │
│                                                                                              │
│  SYNCHRONIZATION (4)                TEST EXPORT (4)                                         │
│  ├─ argus_sync_push                 ├─ argus_export                                         │
│  ├─ argus_sync_pull                 ├─ argus_export_languages                               │
│  ├─ argus_sync_status               ├─ argus_recording_to_test                              │
│  └─ argus_sync_resolve              └─ argus_recording_snippet                              │
│                                                                                              │
│  COLLABORATION (2)                  DISCOVERY (4)                                           │
│  ├─ argus_presence                  ├─ argus_discovery_start                                │
│  └─ argus_comments                  ├─ argus_discovery_flows                                │
│                                     ├─ argus_discovery_generate                             │
│  TIME TRAVEL (5)                    └─ argus_discovery_compare                              │
│  ├─ argus_time_travel_checkpoints                                                            │
│  ├─ argus_time_travel_history       CI/CD INTEGRATION (4)                                   │
│  ├─ argus_time_travel_replay        ├─ argus_cicd_test_impact                               │
│  ├─ argus_time_travel_fork          ├─ argus_cicd_deployment_risk                           │
│  └─ argus_time_travel_compare       ├─ argus_cicd_builds                                    │
│                                     └─ argus_cicd_pipelines                                 │
│  CORRELATIONS (4)                                                                            │
│  ├─ argus_correlations_timeline     API TESTING (3)                                         │
│  ├─ argus_correlations_root_cause   ├─ argus_api_discover                                   │
│  ├─ argus_correlations_insights     ├─ argus_api_generate                                   │
│  └─ argus_correlations_query        └─ argus_api_run                                        │
│                                                                                              │
│  SCHEDULING (4)                     VISUAL AI (5)                                           │
│  ├─ argus_schedule_create           ├─ argus_visual_capture                                 │
│  ├─ argus_schedule_list             ├─ argus_visual_compare                                 │
│  ├─ argus_schedule_run              ├─ argus_visual_baseline                                │
│  └─ argus_schedule_history          ├─ argus_visual_baselines                               │
│                                     └─ argus_visual_analyze                                 │
│  INFRASTRUCTURE (5)                                                                          │
│  ├─ argus_infra_overview            PROJECTS (1)                                            │
│  ├─ argus_infra_recommendations     └─ argus_projects                                       │
│  ├─ argus_infra_apply                                                                        │
│  ├─ argus_browser_pool                                                                       │
│  └─ argus_llm_usage                                                                          │
│                                                                                              │
└─────────────────────────────────────────────────────────────────────────────────────────────┘

1.2 Backend Agents: 27 Implemented ✅¶

Tier	Agent	Purpose	Status
Core	CodeAnalyzerAgent	AST-based codebase analysis	✅
Core	TestPlannerAgent	STAMP framework test specs	✅
Core	UITesterAgent v2	Browser pool testing	✅
Core	APITesterAgent	Schema validation	✅
Core	DBTesterAgent	Data integrity	✅
Core	SelfHealerAgent	4-mode healing (99.9%)	✅
Core	ReporterAgent	Multi-format reports	✅
Quality	PerformanceAnalyzerAgent	Core Web Vitals	✅
Quality	SecurityScannerAgent	OWASP Top 10	✅
Quality	AccessibilityCheckerAgent	WCAG 2.1	✅
Quality	VisualAIAgent	Screenshot comparison	✅
Quality	FlakyTestDetector	Statistical flakiness	✅
Quality	QualityAuditorAgent	Code quality	✅
Intelligence	ReporterAgent	Report generation	✅
Intelligence	RouterAgent	Multi-model routing	✅
Intelligence	TestImpactAnalyzerAgent	Change impact	✅
Intelligence	MRAnalyzerAgent	PR analysis	✅
Intelligence	NLPTestCreatorAgent	Plain English → tests	✅
Intelligence	RootCauseAnalyzerAgent	Failure diagnosis	✅
Advanced	AutoDiscoveryAgent	App crawling	✅
Advanced	SessionToTestAgent	Recording → test	✅
Orchestration	SupervisorAgent	Multi-agent coordination	✅

1.3 Integrations: 13+ Platforms ✅¶

┌─────────────────────────────────────────────────────────────────────────────────────────────┐
│                           INTEGRATION STATUS                                                 │
├──────────────────────┬───────────────┬───────────────────────────────────────────────────────┤
│  Platform            │  Status       │  Capabilities                                         │
├──────────────────────┼───────────────┼───────────────────────────────────────────────────────┤
│  GitHub              │  ✅ Complete  │  PRs, Issues, Webhooks, Check Runs, Actions           │
│  GitLab              │  ✅ Complete  │  MRs, Pipelines, Webhooks                             │
│  Jira                │  ✅ Complete  │  Issues, Stories, Acceptance Criteria, Comments       │
│  Linear              │  ✅ Complete  │  Issues, Projects, Cycles                             │
│  Sentry              │  ✅ Complete  │  Errors, Stack Traces, Fingerprints                   │
│  Datadog             │  ✅ Complete  │  APM, Metrics, Session Replay (via Hub)               │
│  New Relic           │  ✅ Complete  │  APM, Analytics (via Hub)                             │
│  Slack               │  ✅ Complete  │  Notifications, Alerts, Reports                       │
│  PagerDuty           │  ✅ Complete  │  Incidents, Correlation                               │
│  Vercel              │  ✅ Complete  │  Deployments, Preview URLs                            │
│  LaunchDarkly        │  ✅ Complete  │  Feature Flags, Conditional Testing                   │
│  Amplitude           │  ✅ Complete  │  Product Analytics, User Journeys                     │
│  Confluence          │  🔄 Partial   │  Read-only, Needs Full Sync                           │
│  FullStory           │  📋 Planned   │  Session Replay to Test                               │
│  PostHog             │  📋 Planned   │  Product Analytics                                    │
└──────────────────────┴───────────────┴───────────────────────────────────────────────────────┘

2. MCP Architecture Deep Dive¶

2.1 Deployment Architecture¶

┌─────────────────────────────────────────────────────────────────────────────────────────────┐
│                           MCP SERVER DEPLOYMENT                                              │
├─────────────────────────────────────────────────────────────────────────────────────────────┤
│                                                                                              │
│   AI IDE (Claude Code / Cursor / Windsurf / VS Code)                                        │
│                              │                                                               │
│                              │ MCP Protocol (SSE / Streamable-HTTP)                          │
│                              ▼                                                               │
│   ┌──────────────────────────────────────────────────────────────────────────────────────┐  │
│   │                    ARGUS MCP SERVER (Cloudflare Workers)                              │  │
│   │                                                                                        │  │
│   │   ┌────────────────────────────────────────────────────────────────────────────────┐ │  │
│   │   │  AUTHENTICATION LAYER                                                           │ │  │
│   │   │  • OAuth2 Device Flow (user authentication)                                     │ │  │
│   │   │  • JWT Signing (browser pool access)                                            │ │  │
│   │   │  • Token Storage (Durable Objects + KV)                                         │ │  │
│   │   └────────────────────────────────────────────────────────────────────────────────┘ │  │
│   │                                                                                        │  │
│   │   ┌────────────────────────────────────────────────────────────────────────────────┐ │  │
│   │   │  TOOL ROUTER (73 Tools)                                                         │ │  │
│   │   │  • Input validation (Zod schemas)                                               │ │  │
│   │   │  • Activity logging (non-blocking)                                              │ │  │
│   │   │  • Error handling (auth-aware)                                                  │ │  │
│   │   └────────────────────────────────────────────────────────────────────────────────┘ │  │
│   │                                                                                        │  │
│   │   ┌────────────────────────────────────────────────────────────────────────────────┐ │  │
│   │   │  STORAGE LAYER                                                                  │ │  │
│   │   │  • R2 Bucket (screenshots with signed URLs)                                     │ │  │
│   │   │  • Durable Objects (session state)                                              │ │  │
│   │   │  • KV Namespace (cross-session tokens)                                          │ │  │
│   │   └────────────────────────────────────────────────────────────────────────────────┘ │  │
│   │                                                                                        │  │
│   └──────────────────────────────────────────────────────────────────────────────────────┘  │
│                              │                                                               │
│              ┌───────────────┴───────────────┐                                              │
│              ▼                               ▼                                              │
│   ┌─────────────────────────┐     ┌─────────────────────────┐                              │
│   │  BROWSER POOL (Vultr)   │     │  BRAIN API (Railway)    │                              │
│   │                         │     │                         │                              │
│   │  • Selenium Grid        │     │  • 60+ API Endpoints    │                              │
│   │  • JWT Authentication   │     │  • 27 AI Agents         │                              │
│   │  • Screenshot Capture   │     │  • LangGraph Orchestr.  │                              │
│   │  • Action Execution     │     │  • Cognee Knowledge     │                              │
│   │  • Element Discovery    │     │  • Multi-tier Intel.    │                              │
│   │                         │     │                         │                              │
│   └─────────────────────────┘     └─────────────────────────┘                              │
│                                                                                              │
└─────────────────────────────────────────────────────────────────────────────────────────────┘

2.2 MCP Tool Categories by SDLC Phase¶

┌─────────────────────────────────────────────────────────────────────────────────────────────┐
│                           MCP TOOLS BY SDLC PHASE                                            │
├─────────────────────────────────────────────────────────────────────────────────────────────┤
│                                                                                              │
│  PLANNING                          DEVELOPMENT                     TESTING                  │
│  ───────                           ──────────                      ───────                  │
│  argus_ask                         argus_api_discover              argus_test               │
│  argus_what_to_test                argus_api_generate              argus_act                │
│  argus_coverage_gaps               argus_discovery_start           argus_discover           │
│  argus_correlations_insights       argus_discovery_flows           argus_extract            │
│  argus_dashboard                   argus_recording_snippet         argus_visual_compare     │
│                                    argus_generate_test             argus_visual_capture     │
│                                    argus_recording_to_test         argus_api_run            │
│                                                                    argus_agent              │
│                                                                                              │
│  CI/CD                             MONITORING                      MAINTENANCE              │
│  ─────                             ──────────                      ───────────              │
│  argus_cicd_test_impact            argus_events                    argus_healing_patterns   │
│  argus_cicd_deployment_risk        argus_event_triage              argus_healing_review     │
│  argus_cicd_builds                 argus_quality_score             argus_healing_stats      │
│  argus_cicd_pipelines              argus_quality_stats             argus_sync_push          │
│  argus_schedule_create             argus_risk_scores               argus_sync_pull          │
│  argus_schedule_run                argus_infra_overview            argus_time_travel_*      │
│                                    argus_browser_pool                                       │
│                                    argus_llm_usage                                          │
│                                                                                              │
│  COLLABORATION                     ANALYSIS                                                 │
│  ─────────────                     ────────                                                 │
│  argus_presence                    argus_correlations_timeline                              │
│  argus_comments                    argus_correlations_root_cause                            │
│  argus_test_review                 argus_correlations_query                                 │
│  argus_projects                    argus_visual_analyze                                     │
│                                    argus_test_from_event                                    │
│                                                                                              │
└─────────────────────────────────────────────────────────────────────────────────────────────┘

2.3 Data Flow: MCP → Brain → Knowledge Graph¶

┌─────────────────────────────────────────────────────────────────────────────────────────────┐
│                           INTELLIGENCE QUERY FLOW                                            │
├─────────────────────────────────────────────────────────────────────────────────────────────┤
│                                                                                              │
│   Developer in IDE                                                                           │
│   ────────────────                                                                           │
│   "Why did this test fail?"                                                                  │
│           │                                                                                  │
│           ▼                                                                                  │
│   ┌───────────────────────┐                                                                  │
│   │  MCP: argus_ask       │                                                                  │
│   │  or argus_event_triage│                                                                  │
│   └───────────┬───────────┘                                                                  │
│               │                                                                              │
│               ▼                                                                              │
│   ┌───────────────────────────────────────────────────────────────────────────────────────┐ │
│   │  BRAIN API: Query Router (src/intelligence/query_router.py)                           │ │
│   │                                                                                        │ │
│   │  Step 1: Intent Classification (Workers AI - <10ms)                                    │ │
│   │  ────────────────────────────────────────────────                                      │ │
│   │  • SIMILAR_ERRORS: "Find failures like this"                                           │ │
│   │  • ROOT_CAUSE: "Why did this fail?"                                                    │ │
│   │  • TEST_IMPACT: "What tests for this file?"                                            │ │
│   │  • HEALING: "How to fix this selector?"                                                │ │
│   │  • COVERAGE_GAP: "What's not tested?"                                                  │ │
│   │  • RISK_SCORE: "How risky is this change?"                                             │ │
│   │  • DOCUMENTATION: "What does this feature do?"                                         │ │
│   │                                                                                        │ │
│   │  Step 2: Tiered Resolution                                                             │ │
│   │  ────────────────────────                                                              │ │
│   │                                                                                        │ │
│   │  ┌─────────────┐    ┌─────────────┐    ┌─────────────┐    ┌─────────────┐            │ │
│   │  │  TIER 1     │    │  TIER 2     │    │  TIER 3     │    │  TIER 4     │            │ │
│   │  │  Cache      │───▶│  Vector     │───▶│  Graph      │───▶│  LLM        │            │ │
│   │  │  (Valkey)   │    │  (pgvector) │    │  (Cognee)   │    │  (Claude)   │            │ │
│   │  │  <5ms       │    │  <50ms      │    │  <200ms     │    │  3-15s      │            │ │
│   │  │  100% conf  │    │  >0.85 conf │    │  >0.7 conf  │    │  Fallback   │            │ │
│   │  └─────────────┘    └─────────────┘    └─────────────┘    └─────────────┘            │ │
│   │        │                  │                  │                  │                      │ │
│   │        └──────────────────┴──────────────────┴──────────────────┘                      │ │
│   │                                   │                                                     │ │
│   │                                   ▼                                                     │ │
│   │                          Confidence-Based Exit                                          │ │
│   │                          (Stop at first tier with conf > threshold)                    │ │
│   │                                                                                        │ │
│   └───────────────────────────────────────────────────────────────────────────────────────┘ │
│               │                                                                              │
│               ▼                                                                              │
│   ┌───────────────────────┐                                                                  │
│   │  Response to IDE      │                                                                  │
│   │  • Answer/Suggestion  │                                                                  │
│   │  • Source tier        │                                                                  │
│   │  • Confidence score   │                                                                  │
│   │  • Latency            │                                                                  │
│   └───────────────────────┘                                                                  │
│                                                                                              │
└─────────────────────────────────────────────────────────────────────────────────────────────┘

3. Knowledge Graph Architecture (Cognee)¶

3.1 Multi-Source Data Ingestion¶

┌─────────────────────────────────────────────────────────────────────────────────────────────┐
│                           COGNEE DATA PIPELINE                                               │
├─────────────────────────────────────────────────────────────────────────────────────────────┤
│                                                                                              │
│  DATA SOURCES                                                                                │
│  ───────────                                                                                 │
│                                                                                              │
│  ┌───────────┐ ┌───────────┐ ┌───────────┐ ┌───────────┐ ┌───────────┐ ┌───────────┐      │
│  │  GitHub   │ │   Jira    │ │  Sentry   │ │  Datadog  │ │ Confluence│ │  Tests    │      │
│  │  PRs      │ │  Issues   │ │  Errors   │ │  APM      │ │  Docs     │ │  Results  │      │
│  └─────┬─────┘ └─────┬─────┘ └─────┬─────┘ └─────┬─────┘ └─────┬─────┘ └─────┬─────┘      │
│        │             │             │             │             │             │              │
│        └─────────────┴─────────────┴─────────────┴─────────────┴─────────────┘              │
│                                          │                                                   │
│                                          ▼                                                   │
│  ┌───────────────────────────────────────────────────────────────────────────────────────┐  │
│  │                        REDPANDA (Kafka Topics)                                         │  │
│  │                                                                                        │  │
│  │  argus.codebase.ingested    argus.test.created     argus.test.completed               │  │
│  │  argus.healing.event        argus.integration.*    argus.dlq.*                        │  │
│  │                                                                                        │  │
│  └───────────────────────────────────────────────────────────────────────────────────────┘  │
│                                          │                                                   │
│                                          ▼                                                   │
│  ┌───────────────────────────────────────────────────────────────────────────────────────┐  │
│  │                        COGNEE WORKER (ECL Pipeline)                                    │  │
│  │                                                                                        │  │
│  │  ┌──────────────────────────────────────────────────────────────────────────────────┐│  │
│  │  │  EXTRACT                                                                          ││  │
│  │  │  • Parse JSON/YAML/XML                                                            ││  │
│  │  │  • Extract text from docs                                                         ││  │
│  │  │  • Parse code ASTs                                                                ││  │
│  │  │  • Extract entities (files, functions, tests, errors, tickets)                    ││  │
│  │  └──────────────────────────────────────────────────────────────────────────────────┘│  │
│  │                                          │                                             │  │
│  │  ┌──────────────────────────────────────────────────────────────────────────────────┐│  │
│  │  │  COGNIFY (AI Enhancement)                                                         ││  │
│  │  │  • Generate embeddings (nomic-embed-text / text-embedding-3-small)                ││  │
│  │  │  • Classify data types & intents                                                  ││  │
│  │  │  • Extract relationships (file→test, error→code, ticket→PR)                      ││  │
│  │  │  • Detect patterns (flaky tests, recurring errors)                                ││  │
│  │  │  • Compute risk scores                                                            ││  │
│  │  └──────────────────────────────────────────────────────────────────────────────────┘│  │
│  │                                          │                                             │  │
│  │  ┌──────────────────────────────────────────────────────────────────────────────────┐│  │
│  │  │  LOAD                                                                             ││  │
│  │  │  • Store in PostgreSQL (structured data via Supabase)                             ││  │
│  │  │  • Store in pgvector (embeddings for semantic search)                             ││  │
│  │  │  • Store in FalkorDB (local knowledge graph)                                      ││  │
│  │  │  • Store in Neo4j Aura (cloud graph for complex queries)                          ││  │
│  │  │  • Store in Valkey (hot cache for frequent patterns)                              ││  │
│  │  │  • Store in R2/S3 (artifacts: screenshots, videos, logs)                          ││  │
│  │  └──────────────────────────────────────────────────────────────────────────────────┘│  │
│  │                                                                                        │  │
│  └───────────────────────────────────────────────────────────────────────────────────────┘  │
│                                                                                              │
└─────────────────────────────────────────────────────────────────────────────────────────────┘

3.2 Knowledge Graph Schema¶

┌─────────────────────────────────────────────────────────────────────────────────────────────┐
│                           UNIFIED KNOWLEDGE GRAPH                                            │
├─────────────────────────────────────────────────────────────────────────────────────────────┤
│                                                                                              │
│   ENTITIES                              RELATIONSHIPS                                        │
│   ────────                              ─────────────                                        │
│                                                                                              │
│   • File (code, test, config)           • File TESTS_FOR File                               │
│   • Function                            • Commit MODIFIES File                              │
│   • Class                               • Error OCCURS_IN File                              │
│   • Commit                              • Ticket REFERENCES Commit                          │
│   • PullRequest                         • User AUTHORED Commit                              │
│   • Test                                • Build DEPLOYS_TO Environment                      │
│   • TestRun                             • Test FLAKY_IN Environment                         │
│   • TestResult                          • Pattern HEALS Error                               │
│   • Error                               • Session TRIGGERS Error                            │
│   • StackTrace                          • Document DESCRIBES Feature                        │
│   • Ticket (Jira/Linear)                • Test COVERS Function                              │
│   • User                                • File DEPENDS_ON File                              │
│   • Team                                • PR CLOSES Ticket                                  │
│   • Deployment                          • Flag AFFECTS Test                                 │
│   • Environment                         • Incident CAUSED_BY Error                          │
│   • HealingPattern                                                                           │
│   • UserSession                                                                              │
│   • Document                                                                                 │
│   • FeatureFlag                                                                              │
│   • Incident                                                                                 │
│                                                                                              │
│   VECTOR INDICES (pgvector)                                                                  │
│   ────────────────────────                                                                   │
│   • Code embeddings (semantic code search)                                                   │
│   • Error embeddings (similar error matching)                                                │
│   • Document embeddings (knowledge search)                                                   │
│   • Test embeddings (test similarity)                                                        │
│   • Healing pattern embeddings (fix matching)                                                │
│                                                                                              │
└─────────────────────────────────────────────────────────────────────────────────────────────┘

4. Cross-Source Pattern Analysis¶

4.1 Implemented Correlation Types¶

┌─────────────────────────────────────────────────────────────────────────────────────────────┐
│                           CROSS-SOURCE CORRELATIONS (Implemented)                            │
├─────────────────────────────────────────────────────────────────────────────────────────────┤
│                                                                                              │
│  PATTERN 1: ERROR → ROOT CAUSE                                                              │
│  ─────────────────────────────                                                              │
│                                                                                              │
│    Sentry Error ──┬── Stack Trace ──── Code File ──── Recent Commit ──── Author             │
│                   │                                                                          │
│                   └── Similar Errors ──── Past Fixes ──── Healing Pattern                   │
│                                                                                              │
│    MCP Tool: argus_correlations_root_cause, argus_event_triage                              │
│    Backend: RootCauseAnalyzerAgent + Cognee graph traversal                                 │
│                                                                                              │
│  ────────────────────────────────────────────────────────────────────────────────────────── │
│                                                                                              │
│  PATTERN 2: CODE CHANGE → TEST IMPACT                                                       │
│  ───────────────────────────────────                                                        │
│                                                                                              │
│    PR Files ──┬── Modified Functions ──── Tests Covering Functions                          │
│               │                                                                              │
│               └── Dependency Graph ──── Transitive Test Dependencies                        │
│                                                                                              │
│    MCP Tool: argus_cicd_test_impact                                                         │
│    Backend: TestImpactAnalyzerAgent + Precomputed Matrix (Flink)                            │
│    Latency: 20-50ms (precomputed) vs 2-3s (LLM fallback)                                    │
│                                                                                              │
│  ────────────────────────────────────────────────────────────────────────────────────────── │
│                                                                                              │
│  PATTERN 3: PRODUCTION ERROR → TEST GAP                                                     │
│  ─────────────────────────────────────                                                      │
│                                                                                              │
│    Sentry Error ──── Code Location ──── Test Coverage Check ──── No Test Found              │
│                                                                          │                   │
│                                                       argus_test_from_event                  │
│                                                                          │                   │
│                                                       Auto-Generate Test                     │
│                                                                                              │
│    MCP Tool: argus_coverage_gaps, argus_test_from_event                                     │
│    Backend: Production event indexing + Coverage analysis                                    │
│                                                                                              │
│  ────────────────────────────────────────────────────────────────────────────────────────── │
│                                                                                              │
│  PATTERN 4: FLAKY TEST → ROOT CAUSE                                                         │
│  ─────────────────────────────────                                                          │
│                                                                                              │
│    Test Run History ──── Pass/Fail Pattern ──── Environment Correlation                     │
│                │                                                                             │
│                └── Time of Day ──── Load Metrics ──── External Dependencies                 │
│                                                                                              │
│    MCP Tool: argus_healing_patterns (flaky detection)                                       │
│    Backend: FlakyTestDetectorAgent + Statistical analysis                                   │
│                                                                                              │
│  ────────────────────────────────────────────────────────────────────────────────────────── │
│                                                                                              │
│  PATTERN 5: DEPLOYMENT → RISK ASSESSMENT                                                    │
│  ──────────────────────────────────────                                                     │
│                                                                                              │
│    PR Changes ──┬── Historical Bug Rate of Files                                            │
│                 │                                                                            │
│                 ├── Test Coverage Delta                                                      │
│                 │                                                                            │
│                 ├── Author's Past Success Rate                                              │
│                 │                                                                            │
│                 └── Missing Acceptance Criteria                                             │
│                                                                                              │
│    MCP Tool: argus_cicd_deployment_risk, argus_risk_scores                                  │
│    Backend: Multi-source correlation (GitHub + Jira + Test Results)                         │
│                                                                                              │
└─────────────────────────────────────────────────────────────────────────────────────────────┘

5. Tiered Deployment Model¶

5.1 MCP/Plugin Behavior by Tier¶

┌─────────────────────────────────────────────────────────────────────────────────────────────┐
│                           MCP/PLUGIN CAPABILITIES BY TIER                                    │
├──────────────────┬──────────────────────────────────────────────────────────────────────────┤
│                  │                                                                           │
│  FREE            │  FULL MCP ACCESS (73 tools)                                              │
│  ($0)            │  • All tools available (rate-limited)                                    │
│                  │  • Cloud processing only                                                 │
│                  │  • 100 tests/month                                                       │
│                  │  • Shared healing patterns (cross-tenant anonymized)                     │
│                  │  • Basic Cognee queries                                                  │
│                  │                                                                           │
├──────────────────┼──────────────────────────────────────────────────────────────────────────┤
│                  │                                                                           │
│  STARTER         │  FULL MCP ACCESS + GitHub Integration                                    │
│  ($49/mo)        │  • argus_cicd_* tools enabled                                            │
│                  │  • Slack notifications via argus_ask                                     │
│                  │  • 500 tests/month                                                       │
│                  │  • Higher rate limits                                                    │
│                  │                                                                           │
├──────────────────┼──────────────────────────────────────────────────────────────────────────┤
│                  │                                                                           │
│  PRO             │  FULL MCP + Local MCP Server Option                                      │
│  ($99/mo)        │  • Docker deployment available                                           │
│                  │  • Local code indexing (src/mcp/local_mcp_server.py)                     │
│                  │  • Code never leaves machine (privacy mode)                              │
│                  │  • BYOK (Bring Your Own Key) for LLM                                     │
│                  │  • 2,000 tests/month                                                     │
│                  │  • Git-aware self-healing (99.9% accuracy)                               │
│                  │                                                                           │
├──────────────────┼──────────────────────────────────────────────────────────────────────────┤
│                  │                                                                           │
│  TEAM            │  FULL MCP + Team Collaboration                                           │
│  ($299/mo)       │  • argus_presence, argus_comments enabled                                │
│                  │  • SSO integration (Okta, Azure AD)                                      │
│                  │  • Dedicated Supabase namespace                                          │
│                  │  • Private cloud option                                                  │
│                  │  • Cross-project pattern sharing                                         │
│                  │  • 10,000 tests/month                                                    │
│                  │                                                                           │
├──────────────────┼──────────────────────────────────────────────────────────────────────────┤
│                  │                                                                           │
│  ENTERPRISE      │  SELF-HOSTED MCP + Brain                                                 │
│  (Custom)        │  • Deploy entire stack on-premise (K8s/OpenShift)                        │
│                  │  • On-premise Ollama/vLLM option                                         │
│                  │  • Zero external calls possible                                          │
│                  │  • Custom model routing                                                  │
│                  │  • SAML 2.0 SSO                                                          │
│                  │  • Unlimited tests                                                       │
│                  │                                                                           │
├──────────────────┼──────────────────────────────────────────────────────────────────────────┤
│                  │                                                                           │
│  AIR-GAPPED      │  100% LOCAL OPERATION                                                    │
│  (Custom)        │  • No internet required                                                  │
│                  │  • Local LLM only (Llama 3, Mistral)                                     │
│                  │  • Offline license activation                                            │
│                  │  • HSM-backed encryption                                                 │
│                  │  • STIG-hardened containers                                              │
│                  │                                                                           │
└──────────────────┴──────────────────────────────────────────────────────────────────────────┘

5.2 Local MCP Server Architecture (Pro+)¶

┌─────────────────────────────────────────────────────────────────────────────────────────────┐
│                           LOCAL MCP SERVER (Privacy Mode)                                    │
├─────────────────────────────────────────────────────────────────────────────────────────────┤
│                                                                                              │
│   DEVELOPER MACHINE                                                                          │
│   ─────────────────                                                                          │
│                                                                                              │
│   ┌───────────────────────────────────────────────────────────────────────────────────────┐ │
│   │                        LOCAL MCP SERVER (src/mcp/local_mcp_server.py)                  │ │
│   │                                                                                        │ │
│   │   ┌────────────────────────────────────────────────────────────────────────────────┐ │ │
│   │   │  LOCAL ANALYSIS (Code Never Leaves)                                             │ │ │
│   │   │                                                                                  │ │ │
│   │   │  • Tree-sitter AST parsing (all languages)                                      │ │ │
│   │   │  • Merkle tree change detection                                                 │ │ │
│   │   │  • Local embeddings (nomic-embed via Ollama)                                    │ │ │
│   │   │  • Semantic chunking                                                            │ │ │
│   │   │  • Test mapping                                                                 │ │ │
│   │   │  • Git history analysis                                                         │ │ │
│   │   └────────────────────────────────────────────────────────────────────────────────┘ │ │
│   │                                                                                        │ │
│   │   ┌────────────────────────────────────────────────────────────────────────────────┐ │ │
│   │   │  LOCAL STORAGE                                                                  │ │ │
│   │   │                                                                                  │ │ │
│   │   │  • SQLite (structured data)                                                     │ │ │
│   │   │  • LanceDB (vector embeddings)                                                  │ │ │
│   │   │  • Local file index                                                             │ │ │
│   │   └────────────────────────────────────────────────────────────────────────────────┘ │ │
│   │                                                                                        │ │
│   │   ┌────────────────────────────────────────────────────────────────────────────────┐ │ │
│   │   │  CLOUD SYNC (Metadata Only)                                                     │ │ │
│   │   │                                                                                  │ │ │
│   │   │  • Test results (no code)                                                       │ │ │
│   │   │  • Healing patterns (anonymized)                                                │ │ │
│   │   │  • Quality metrics                                                              │ │ │
│   │   │  • Obfuscated file paths (optional)                                             │ │ │
│   │   └────────────────────────────────────────────────────────────────────────────────┘ │ │
│   │                                                                                        │ │
│   └───────────────────────────────────────────────────────────────────────────────────────┘ │
│                                                                                              │
│   Key Files:                                                                                 │
│   • src/mcp/local_mcp_server.py (MCP stdio server)                                          │
│   • src/indexer/tree_sitter_parser.py (AST parsing)                                         │
│   • src/indexer/merkle_tree.py (change detection)                                           │
│   • src/indexer/local_embedder.py (on-device embeddings)                                    │
│   • src/indexer/semantic_chunker.py (code splitting)                                        │
│                                                                                              │
└─────────────────────────────────────────────────────────────────────────────────────────────┘

6. Linear: Recent Completions (Last 7 Days)¶

6.1 MCP Tools Sprint (RAP-276 → RAP-282) ✅¶

All 7 MCP tool expansions completed: - RAP-276: CI/CD Integration (4 tools) - Urgent ✅ - RAP-277: Discovery & Crawling (4 tools) - High ✅ - RAP-278: Time Travel Debugging (5 tools) - High ✅ - RAP-279: Visual AI Testing (5 tools) - Medium ✅ - RAP-280: Scheduling & Automation (4 tools) - Medium ✅ - RAP-281: Correlation & Analytics (4 tools) - Medium ✅ - RAP-282: API Testing (3 tools) - Medium ✅

6.2 Intelligence Layer (UIIL) Sprint (RAP-236 → RAP-250) ✅¶

All 15 intelligence layer tickets completed: - RAP-236: Intelligence cache module (Valkey wrapper) ✅ - RAP-237: AI Gateway caching (90% latency reduction) ✅ - RAP-238: Query router with intent detection ✅ - RAP-239: Sentry integration topic/handler ✅ - RAP-240: Precomputed results table migration ✅ - RAP-241: precomputed.py reader module ✅ - RAP-242: Jira integration handler ✅ - RAP-243: GitHub PR integration handler ✅ - RAP-244: Confluence integration handler ✅ - RAP-245: Flink test impact precomputation ✅ - RAP-246: Flink failure clustering job ✅ - RAP-247: Workers AI for intent classification ✅ - RAP-248: Vectorize edge vector search ✅ - RAP-249: Healing API intelligence layer ✅ - RAP-250: CI/CD API precomputed matrix ✅

6.3 Agent Evaluation (Backlog)¶

Agent testing tickets created but not started: - RAP-257 to RAP-275: Individual agent tests (Tier 1-3)

7. Summary: What Argus Actually Is¶

┌─────────────────────────────────────────────────────────────────────────────────────────────┐
│                           ARGUS: QUALITY INTELLIGENCE PLATFORM                               │
├─────────────────────────────────────────────────────────────────────────────────────────────┤
│                                                                                              │
│  NOT JUST TESTING → FULL SDLC/STLC INTELLIGENCE                                             │
│  ─────────────────────────────────────────────                                              │
│                                                                                              │
│  ┌─────────────────────────────────────────────────────────────────────────────────────┐   │
│  │                                                                                      │   │
│  │   INTERFACE LAYER                                                                    │   │
│  │   ───────────────                                                                    │   │
│  │   • MCP Server (73 tools, Cloudflare Workers)                                        │   │
│  │   • Claude Code Plugin                                                               │   │
│  │   • Cursor / Windsurf / VS Code support                                              │   │
│  │   • Dashboard (Next.js 15, 19 pages)                                                 │   │
│  │   • REST API (60+ endpoints)                                                         │   │
│  │                                                                                      │   │
│  └─────────────────────────────────────────────────────────────────────────────────────┘   │
│                                          │                                                   │
│                                          ▼                                                   │
│  ┌─────────────────────────────────────────────────────────────────────────────────────┐   │
│  │                                                                                      │   │
│  │   INTELLIGENCE LAYER                                                                 │   │
│  │   ──────────────────                                                                 │   │
│  │   • 27 AI Agents (specialized for different tasks)                                   │   │
│  │   • Multi-tier resolution (Cache → Vector → Graph → LLM)                             │   │
│  │   • Query router with intent classification                                          │   │
│  │   • Supervisor orchestration (LangGraph)                                             │   │
│  │   • Multi-model routing (13 providers)                                               │   │
│  │                                                                                      │   │
│  └─────────────────────────────────────────────────────────────────────────────────────┘   │
│                                          │                                                   │
│                                          ▼                                                   │
│  ┌─────────────────────────────────────────────────────────────────────────────────────┐   │
│  │                                                                                      │   │
│  │   KNOWLEDGE LAYER (Cognee)                                                           │   │
│  │   ────────────────────────                                                           │   │
│  │   • Unified knowledge graph (FalkorDB + Neo4j)                                       │   │
│  │   • Vector embeddings (pgvector)                                                     │   │
│  │   • Hot cache (Valkey)                                                               │   │
│  │   • Event streaming (Redpanda)                                                       │   │
│  │   • Stream processing (Flink)                                                        │   │
│  │                                                                                      │   │
│  └─────────────────────────────────────────────────────────────────────────────────────┘   │
│                                          │                                                   │
│                                          ▼                                                   │
│  ┌─────────────────────────────────────────────────────────────────────────────────────┐   │
│  │                                                                                      │   │
│  │   DATA SOURCES (10+ Categories)                                                      │   │
│  │   ─────────────────────────────                                                      │   │
│  │   • Code & VCS (GitHub, GitLab)                                                      │   │
│  │   • Project Management (Jira, Linear)                                                │   │
│  │   • Observability (Sentry, Datadog, New Relic)                                       │   │
│  │   • CI/CD (GitHub Actions, Vercel)                                                   │   │
│  │   • Documentation (Confluence)                                                       │   │
│  │   • Analytics (Amplitude, PostHog - planned)                                         │   │
│  │   • Communication (Slack)                                                            │   │
│  │   • Incidents (PagerDuty)                                                            │   │
│  │   • Feature Flags (LaunchDarkly)                                                     │   │
│  │   • Test Results (internal)                                                          │   │
│  │                                                                                      │   │
│  └─────────────────────────────────────────────────────────────────────────────────────┘   │
│                                                                                              │
│  KEY DIFFERENTIATORS                                                                         │
│  ───────────────────                                                                         │
│  1. MCP-first: AI IDE is the primary interface, not a web dashboard                         │
│  2. Privacy-preserving: Local MCP option keeps code on-device                               │
│  3. Multi-source intelligence: Correlates data across 10+ platforms                         │
│  4. Self-healing: 99.9% accuracy with git-aware healing                                     │
│  5. Tiered resolution: <5ms cache hits, <15% LLM fallback                                   │
│  6. Enterprise-ready: Air-gapped deployment, SOC2, HIPAA (roadmap)                          │
│                                                                                              │
└─────────────────────────────────────────────────────────────────────────────────────────────┘

8. Gaps & Roadmap¶

8.1 Current Gaps¶

Area	Status	Priority
SSO (SAML 2.0)	🔄 Needs Work	High
Fine-grained RBAC	🔄 Needs Work	High
SOC2 Type 2 Certification	🔄 In Progress	High
Air-gapped packaging	📋 Planned	Medium
FullStory integration	📋 Planned	Medium
PostHog integration	📋 Planned	Medium
Offline license activation	📋 Planned	Low

8.2 Q1 2026 Roadmap (From Linear)¶

Complete agent evaluation tests (RAP-257 to RAP-275)
SSO integration (Okta, Azure AD)
Enhanced RBAC for Enterprise
SOC2 Type 2 audit completion
Local indexing MVP refinement

Document Author: Claude (Analysis) Source: Argus Backend, Dashboard, MCP Server, argus-docs, Linear Confidence: High (based on actual code review)