Driving AI Performance Visibility and ROI: New Virtana Platform Features

Back to Blog

Most teams don’t need another dashboard; they need answers that hold up in a post-incident review and a budget meeting. This release focuses on that outcome. It makes token spend, GPU efficiency, Kubernetes health, storage performance, and network behavior visible in one operating picture you can act on. You’ll see where dollars are going, why latency spikes happen, and which fixes reduce waste without risking reliability. Support for managed Kubernetes, deeper storage and backup visibility, and practical export and sharing options keep the workflow simple: observe, prove, fix, report.

Global View: Cost and performance clarity without tool switching for AI Workloads

Token Usage Dashboard

Most teams can’t see token consumption in the same place they track performance and errors. The Token Usage Dashboard connects token usage from LLM-as-a-Service to the metrics your business cares about: cost, latency, and error rates. This makes it possible to spot inefficient prompts or bursty workloads and tie them to real dollars and user impact.

Business value: Clear line of sight from usage to cost and reliability, so product and finance can agree on what to tune or throttle.

GPU Fleet Analysis Dashboard

GPU fleets are expensive to run and easy to underutilize. This dashboard shows power, temperature, utilization, and spend across on-prem and cloud, alongside indicators that point to throttling and waste. Root-cause assistance helps teams connect hot spots or idle periods to specific jobs and dependencies.

Business value: Reduce waste, prevent slowdowns from thermal and power issues, and defend budgets with evidence.

Service Observability alerts visible in Global View

Alerts from Virtana Service Observability now appear next to signals from Container Observability, Infrastructure Observability, and third-party sources. Teams can review incidents in one place instead of bouncing between consoles.

Business value: Faster triage and fewer missed handoffs, which improve MTTR.

These are just a few of the latest capabilities available to Virtana customers. Read the Global View technical blog to learn more about the rest.

Container Observability: Day-2 confidence for managed and hybrid Kubernetes

GKE Autopilot support and partnership

Virtana Container Observability is approved to run on GKE Autopilot. You get real-time performance visibility across managed clusters, plus the ability to cross-launch for deeper pod-level investigation when needed.

Business value: Operate on Google’s managed Kubernetes with confidence while keeping the same troubleshooting workflows.

AWS FSx for NetApp ONTAP monitoring

Storage is often the hidden cause of tail latency. You can now monitor FSx for ONTAP and correlate those signals with on-prem and cloud events through Global View’s correlation policies.

Business value: One place to see storage behavior across environments, which speeds up incident resolution and capacity planning.

Azure Confluent monitoring

Track the performance of Confluent Kafka on Azure and capture alerts such as sustained drops in message counts below certain thresholds.

Business value: Earlier detection of throughput issues in data pipelines that feed services and models.

More container observation capabilities are available to Virtana customers. Read the Container Observability technical blog to learn more about the rest.

Infrastructure Observability: Resilience and topology you can act on

Dell PowerProtect integration

Infrastructure Observability now ingests metrics and inventory from Dell PowerProtect appliances using native interfaces (Agentless). A topology view shows how backup components relate to protected assets.

Business value: Understand backup health in context with the rest of the stack, verify protection status during incidents, and confirm RPO and RTO expectations with concrete data.

Dell ObjectScale integration (Early Access)

Infrastructure Observability now discovers ObjectScale clusters and collects capacity, performance, and health metrics at the cluster level. Futures include tenant levels, including namespace and bucket detail. A topology view maps front-end endpoints, nodes, and dependent services so you can correlate storage behavior with upstream applications.

Business value: Detect saturation and skew before they impact services, tie object storage latency and errors to user-facing symptoms, and plan capacity with evidence rather than guesswork.

A lot more capabilities came to Virtana Infrastructure Observability customers in this release. Read the Infrastructure Observability technical blog to find out more.

Service Observability: Faster triage, clearer reporting

Cisco Meraki Cloud-to-Cloud Connection

High-capacity, lightweight collection for Meraki brings scalable monitoring without on-prem agents, even during maintenance windows.

Business value: Gain dependable visibility for distributed sites and branch networks, helping teams prove or rule out the network during incidents.

Advanced Kubernetes monitoring for Service Observability

Cross-launch from Service Observability into Container Observability to drill into pod-level telemetry, including logs and traces.

Business value: Shorten mean time to resolution by moving from an alert to detailed Kubernetes evidence without changing tools.

The Virtana Service Observability team has been busy delivering even more advancements into the platform, so read the Service Observability technical blog to learn more.

Conclusion

Running AI in production demands evidence. You get cost truth from token tracking, efficiency truth from GPU power, thermals, and utilization, and operational truth from correlated signals across Kubernetes, storage, backups, and network. That gives you faster triage, fewer misdiagnoses, lower run costs, and reports that finance can stand behind.

To see these updates against your AI workloads or discuss deployment options, contact your Virtana sales representative or request a demo.

The Deepest and Broadest Observability Platform

Virtana helps teams keep critical services healthy by connecting performance, capacity, and cost signals across on-premises, cloud, and Kubernetes environments. Get a clear view of what is changing, what is constrained, and what is driving impact, so you can troubleshoot faster and plan with confidence. From day-to-day incident response to long-term infrastructure planning, Virtana supports the workflows teams rely on to reduce downtime, avoid resource waste, and keep service levels on track. Let’s get deeper

Learn More

Virtana Insight

Artificial Intelligence

April 15 2026Virtana Insight

What We Learned at Nutanix .NEXT 2026: From Platform Adoption to Operational Reality

Nutanix .NEXT 2026 made one thing clear: the conversation has shifted from “why Nutanix” to...

Virtana Platform

January 30 2026Meeta Lalwani

Virtana Platform Achieves TX-RAMP Level 2 Certification

Delivering Secure Full-Stack, GPU, and AI Observability for Public Sector Virtana announce...

AIFO

September 30 2025Paul Appleby

Building AI Infrastructure the Right Way: Why Observability Matters More Than Ever

When I wrote recently in Forbes that we’re racing toward an AI-everywhere future without th...

Driving AI Performance Visibility and ROI: What’s New in Virtana Platform