Dashboard Cards
Cards are the building blocks of your dashboards. Each card shows specific information that you can resize, move, and configure.
Card Features
Every card has:
- Drag handle - Move it around
- Menu button - Configure, replace, or remove
- AI button - Ask AI about this card
- Expand button - Make it full screen
- Refresh indicator - See when data was last updated
All 120+ Card Types
The console ships with 120+ built-in cards, and you can create more using the Card Factory. Below are the main categories.
Cluster Health Cards (7)
| # | Card | What it shows |
|---|
| 1 | Cluster Health | Health status of all clusters with green/red/gray indicators |
| 2 | Cluster Metrics | Time-series graphs of CPU, memory, pods, nodes |
| 3 | Cluster Focus | Detailed view of specific cluster |
| 4 | Cluster Comparison | Side-by-side comparison of multiple clusters |
| 5 | Cluster Costs | Cost breakdown per cluster |
| 6 | Upgrade Status | Version info and available upgrades |
| 7 | Cluster Resource Tree | Hierarchical view of cluster resources |
Workload Cards (6)
| # | Card | What it shows |
|---|
| 8 | Deployment Status | Donut chart of deployment health |
| 9 | Deployment Issues | Table of deployments with problems |
| 10 | Deployment Progress | Rollout progress gauge |
| 11 | Pod Issues | Table of pods with problems (crashes, OOM, etc.) |
| 12 | Top Pods | Bar chart of top resource-consuming pods |
| 13 | App Status | Overall application health status |
Compute Cards (8)
| # | Card | What it shows |
|---|
| 14 | Compute Overview | Summary of CPU, memory, nodes, pods, GPUs |
| 15 | Resource Usage | Gauge showing CPU/memory/GPU utilization |
| 16 | Resource Capacity | Bar chart of used vs available resources |
| 17 | GPU Overview | Summary of GPU resources and utilization |
| 18 | GPU Status | Donut chart of GPU allocation |
| 19 | GPU Inventory | Table of GPU nodes with types and counts |
| 20 | GPU Workloads | Table of workloads using GPUs |
| 21 | GPU Usage Trend | Time-series graph of GPU utilization |
Storage Cards (2)
| # | Card | What it shows |
|---|
| 22 | Storage Overview | Summary of storage resources |
| 23 | PVC Status | Table of Persistent Volume Claims |
Network Cards (3)
| # | Card | What it shows |
|---|
| 24 | Network Overview | Summary of network resources |
| 25 | Service Status | Table of services |
| 26 | Cluster Network | Network status per cluster |
GitOps Cards (7)
| # | Card | What it shows |
|---|
| 27 | Helm Release Status | Status of Helm releases |
| 28 | Helm History | Event timeline of Helm deployments |
| 29 | Helm Values Diff | Compare Helm values between releases |
| 30 | Chart Versions | Available chart version updates |
| 31 | Kustomization Status | Status of Kustomize overlays |
| 32 | Overlay Comparison | Compare Kustomize overlays |
| 33 | GitOps Drift | Detect when clusters don’t match git |
ArgoCD Cards (3)
| # | Card | What it shows |
|---|
| 34 | ArgoCD Applications | Status of ArgoCD apps |
| 35 | ArgoCD Sync Status | Donut chart of sync status |
| 36 | ArgoCD Health | Health status of ArgoCD |
CNCF Ecosystem Cards (6)
| # | Card | What it shows |
|---|
| 118 | KubeVela | Controller health, OAM application delivery status, workflow progress, component/trait counts |
| 119 | KEDA | Operator health, scaled object stats with replica progress bars, trigger details |
| 120 | Strimzi Kafka | Cluster health, broker readiness, topic status, consumer group lag |
| 121 | OpenFeature | Provider health (flagd, LaunchDarkly, Split), feature flag stats, evaluation counts |
| 122 | Drasi Reactive Graph | Reactive data pipeline visualization — sources, continuous queries, reactions with animated flow lines, live SSE streaming, full CRUD, CodeMirror Cypher editor, per-language stream-consumer code samples, flow discovery. Supports drasi-server, drasi-platform, and drasi-lib. See Drasi Dashboard for full docs. |
| 123 | Keycloak Monitoring | Keycloak instance health, realm counts, client counts, user session metrics |
Operator Cards (3)
| # | Card | What it shows |
|---|
| 37 | Operator Status | Status of OLM operators |
| 38 | Operator Subscriptions | Table of operator subscriptions |
| 39 | CRD Health | Health of Custom Resource Definitions |
Namespace Cards (4)
| # | Card | What it shows |
|---|
| 40 | Namespace Overview | Summary of namespace resources |
| 41 | Namespace Quotas | Gauge of quota usage |
| 42 | Namespace RBAC | Table of RBAC rules |
| 43 | Namespace Events | Event stream for namespace |
Security & Events Cards (3)
| # | Card | What it shows |
|---|
| 44 | Security Issues | Table of security problems |
| 45 | Event Stream | Live event feed |
| 46 | User Management | Table of console users |
Live Trend Cards (4)
| # | Card | What it shows |
|---|
| 47 | Events Timeline | Time-series of events |
| 48 | Pod Health Trend | Time-series of pod health |
| 49 | Resource Trend | Time-series of resource usage |
| 50 | GPU Utilization | Time-series of GPU usage |
AI Cards (3)
| # | Card | What it shows |
|---|
| 51 | AI Issues | Issues detected by AI |
| 52 | Kubeconfig Audit | Audit of your kubeconfig |
| 53 | AI Health Check | AI health check gauge |
Alerting Cards (2)
| # | Card | What it shows |
|---|
| 54 | Active Alerts | Currently firing alerts |
| 55 | Alert Rules | Table of alert rules |
Cost Cards (3)
| # | Card | What it shows |
|---|
| 56 | Cluster Costs | Cost per cluster |
| 57 | OpenCost Overview | OpenCost integration data |
| 58 | Kubecost Overview | Kubecost integration data |
Policy Cards (2)
| # | Card | What it shows |
|---|
| 59 | OPA Policies | OPA Gatekeeper policies |
| 60 | Kyverno Policies | Kyverno policy status |
Compliance Cards (5)
| # | Card | What it shows |
|---|
| 61 | Compliance Score | Overall compliance percentage (CIS, NSA, PCI) |
| 62 | Compliance Findings | Table of compliance findings by severity |
| 63 | Security Posture | Combined security posture overview |
| 116 | Compliance Trestle (OSCAL) | OSCAL-based compliance assessment — overall score, per-profile breakdowns (NIST 800-53, FedRAMP), per-cluster status |
| 117 | Recommended Policies | AI-powered fleet-wide compliance gap analysis — identifies missing policies with-click Deploy All |
Provider Health Cards (1)
| # | Card | What it shows |
|---|
| 64 | Provider Health | Status of AI providers (Claude, OpenAI, Gemini) and cloud providers |
Workload Monitor Cards (2)
| # | Card | What it shows |
|---|
| 65 | Workload Status | Cascading cluster/namespace/workload selector with resource details |
| 66 | Resource Allocation | Resource allocation across clusters |
llm-d Inference Cards (10)

| # | Card | What it shows |
|---|
| 67 | llm-d Request Flow | Animated request flow through the inference stack with throughput/latency metrics |
| 68 | KV Cache Monitor | KV cache utilization, per-pod cache stats, aggregated/per-pod toggle |
| 69 | EPP Routing | Endpoint Picker routing decisions with RPS and routing distribution |
| 70 | P/D Disaggregation | Prefill and Decode server load, queue depth, throughput, TPOT, GPU memory |
| 71 | llm-d Benchmarks | Stacks vs Comparison vs Latency views with TTFT, throughput, bar charts |
| 72 | llm-d AI Insights | AI-generated insights about balanced P/D configuration and optimization |
| 73 | llm-d Configurator | Configure inference strategies: Intelligent Scheduling, P/D Disaggregation, Wide Expert Parallelism, Variant Autoscaling |

| # | Card | What it shows |
|---|
| 74 | llm-d Stack | Stack health, component status, model serving details with cluster discovery |
| 75 | llm-d Models | Loaded models with namespace, cluster, and GPU allocation |
| 76 | llm-d Inference Servers | Running inference servers with status and throughput |
PROW CI Cards (3)
| # | Card | What it shows |
|---|
| 77 | PROW CI Monitor | Overall PROW health: success rate, job counts (running, pending, failed) |
| 78 | PROW Jobs | Filterable job list with type, state, PR number, duration, and age |
| 79 | PROW History | Revision history with pass/fail trends |
Hardware Health Card (1)

| # | Card | What it shows |
|---|
| 80 | Hardware Health | GPU/accelerator node health with alerts, inventory, IPMI-style monitoring. Shows critical/warning counts, device search, and per-device status with disappearance tracking |
Predictive Health Card (1)

| # | Card | What it shows |
|---|
| 81 | Predictive Health Monitor | AI-powered failure prediction with offline node count, GPU issues, and predicted failures. Shows confidence levels, severity, and correlates with traffic patterns |
ML Job & Notebook Cards (2)
| # | Card | What it shows |
|---|
| 82 | ML Jobs | Running ML training jobs (Kubeflow, Ray, custom) with GPU count, ETA, and status |
| 83 | ML Notebooks | Active Jupyter/notebook servers with user, resources, and status |
Kagenti AI Agent Cards (7)
| # | Card | What it shows |
|---|
| 84 | Kagenti Overview | Agent count, MCP tools, builds, framework breakdown (LangGraph, CrewAI, AG2) |
| 85 | Agent Fleet | Searchable agent list with cluster, framework, replicas, and status |
| 86 | Agent Topology | Visual topology of agent relationships and dependencies |
| 87 | SPIFFE Identity | SPIFFE identity coverage and certificate status |
| 88 | Agent Builds | Build history with status (succeeded, failed, building) |
| 89 | Agent MCP Tools | MCP tool inventory per agent |
| 90 | Agent Logs | Aggregated agent logs with filtering |
Deploy Cards (5)
| # | Card | What it shows |
|---|
| 91 | Workloads | All workloads with status, drag-to-deploy to cluster groups |
| 92 | Cluster Groups | Target groups (production, staging, edge) with health |
| 93 | Deployment Missions | AI-assisted deployment missions with status tracking |
| 94 | Resource Marshall | Cascading cluster/namespace/workload selector for resource placement |
| 95 | Deployment History | Timeline of recent deployments with rollback options |
GPU Node Health Monitor (1)
| # | Card | What it shows |
|---|
| 96 | GPU Node Health Monitor | Proactive GPU health checks across 4 tiers (Critical, Standard, Full, Deep). CronJob management, per-node results, alert integration, AI Diagnose button |
Flatcar Container Linux Card (1)
| # | Card | What it shows |
|---|
| 97 | Flatcar Container Linux Status | Flatcar node count, OS version distribution, update status and health |
Nightly E2E Test Cards (1)
| # | Card | What it shows |
|---|
| 98 | Nightly E2E Status | Run history dots (green=pass, red=fail, amber=GPU unavailable, blue=running), per-run metadata, log/artifact links, AI Diagnose on failures |
| Card | What it shows |
|---|
| Crossplane Managed Resources | Managed resource count, provider health, composite resource status, resource table with sync/ready status |
| Cloud Native Buildpacks | Build counts, success rates, active builders, recent builds with duration and builder info |
AI Codebase Maturity (ACMM) Cards (4)
| # | Card | What it shows |
|---|
| 124 | ACMM Level | L1–L5 ring gauge showing the scanned repo’s AI codebase maturity level, role, and next-level progress |
| 125 | ACMM Balance | Weekly AI vs human contribution bar chart with a level-anchored target slider |
| 126 | ACMM Feedback Loops | Inventory of 33 feedback loops filtered by source and detected/missing status |
| 127 | ACMM Recommendations | Role-aware top-5 missing criteria prioritized for reaching the next level |
All four cards are driven by a single GitHub scan. See ACMM Dashboard for full docs.
Additional Cards (44+)
The console includes 44+ additional specialized cards across categories like:
- Events - Event timeline and filtering
- Data Compliance - Data classification and compliance checks
- Arcade - 21 Kubernetes-themed games (AI Checkers, Kube Chess, Container Tetris, etc.)
- Card History - Track card changes over time
- User Management - Console user management
- Weather, Stocks, RSS - Widget-style cards for external data
Plus any custom cards you create using the Card Factory.
Visualization Types
Cards use different ways to show data:
| Type | Icon | What it looks like |
|---|
| Gauge | ⏱️ | Circular progress indicator |
| Table | 📋 | Rows and columns of data |
| Timeseries | 📈 | Line chart over time |
| Events | 📜 | Scrolling event feed |
| Donut | 🍩 | Pie/donut chart |
| Bar | 📊 | Bar chart |
| Status | 🚦 | Status indicators (green/yellow/red) |
Adding Cards
- Click the Add Card button
- Browse by category or search
- Click a card to add it
- Drag it where you want
- Click the menu to configure it
Creating Custom Cards (Card Factory)
Don’t see the card you need? Create your own:
- Open the Card Factory
- Choose your method:
- AI-Assisted - Describe what you want in plain English
- JSON - Write a declarative card definition
- TSX Code - Write a React component (compiled at runtime)
- Preview your card
- Add it to any dashboard
Configuring Cards
Click the menu (three dots) on any card:
- Configure - Change settings like filters, refresh interval
- Replace - Swap for a different card type
- Remove - Take it off your dashboard
Common Configuration Options
- Clusters - Show data from specific clusters
- Namespaces - Filter to specific namespaces
- Refresh interval - How often to update
- Show count - How many items to display
AI Card Suggestions
In High AI mode, the console watches what you look at and suggests new cards:
- AI notices you’re focusing on pods
- It suggests adding the Pod Issues card
- You can Accept, Snooze (1 hour), or Dismiss
This helps your dashboard evolve with your needs!