Storage Guide

Aegis Managed Pure Storage: Co-Managed FlashArray Operations

Pure Storage delivers one of the operationally simplest enterprise storage platforms available, but simple doesn't mean zero operations. This guide explains what co-managed Pure FlashArray operations actually covers and when it makes sense for disaggregated environments.

The focus is operational attention and coordination, not complexity management — handling the human judgment required for operational continuity while Pure's Evergreen model handles hardware proactively.

⏱ 18 min read Operations-focused · Cross-platform · Evergreen-aware

Key Takeaways

  • Pure Storage's Evergreen model handles hardware proactively, but Pure environments still require human judgment for operational continuity including upgrade scheduling, alert assessment, capacity management, and performance anomaly investigation.
  • Aegis Managed Pure provides purpose-built monitoring via LogicMonitor's native Pure FlashArray datasource, incident response coordination with Pure TAC and Evergreen, and lifecycle management for Purity OS upgrades and hardware refresh planning.
  • Cross-platform correlation between Pure FlashArray metrics and hypervisor data isolates performance problems to the correct layer quickly — per-volume IOPS from Pure combined with VM-to-volume mapping identifies noisy VMs without manual investigation across multiple tools.
  • NVMe-oF path health monitoring spans Pure initiator sessions and network fabric Ethernet paths, providing end-to-end visibility that neither vendor's native monitoring delivers alone.
  • Aegis Managed Pure fits disaggregated environments running hypervisor plus Pure FlashArray where unified operations management across both platforms through a single engagement delivers more value than managing Pure in isolation.

A different approach to Pure operations

Pure Storage delivers one of the operationally simplest enterprise storage platforms available. Evergreen provides non-disruptive controller upgrades. Pure1 delivers proactive health telemetry. Purity OS upgrades execute without downtime. Drive failures get handled proactively.

The question becomes: what does co-managed Pure actually cover?

The answer is operational attention and coordination, not complexity management.

Pure environments still require human judgment for operational continuity. Purity OS upgrades need scheduling around business requirements, testing against application workloads, and execution within coordinated change windows. Pure1 alerts require assessment, prioritization, and action. Capacity management stays proactive — Pure alerts on utilization thresholds, but someone must act before capacity exhaustion.

Volume and host management requires changes when compute nodes change — new hosts added, nodes decommissioned, workloads moved. Performance anomalies like noisy-neighbor VMs or unexpected IOPS spikes need per-volume investigation that crosses Pure's boundary into hypervisor territory. NVMe-oF path health spans Pure initiator sessions and the network fabric Ethernet path — neither vendor's monitoring spans the boundary.

The operational reality

Pure's Evergreen model handles hardware proactively. Aegis handles operational continuity, change coordination, cross-platform monitoring, and decisions that require engineering judgment.

What Aegis manages

Aegis Managed Pure Storage provides purpose-built monitoring, incident response, and lifecycle governance for Pure FlashArray environments. The service integrates with Pure's existing Evergreen support model rather than replacing it, focusing on operational coordination and cross-platform visibility that Pure's native tools don't provide.

The three core components — monitoring via Aegis PM, incident response via Aegis IR, and lifecycle management via Aegis LM — work together to provide comprehensive operational coverage while leveraging Pure's proactive hardware management through Evergreen.

Monitoring capabilities via Aegis PM

LogicMonitor's native Pure FlashArray datasource polls via Pure REST API v2, providing comprehensive visibility across array performance, capacity, health, and replication status.

Array-level performance monitoring covers read and write IOPS, latency measured in microseconds with alert thresholds calibrated to workload baseline rather than generic percentiles, throughput in GB/s, and queue depth for pending I/O operations. Sustained queue depth indicates storage pressure before latency spikes occur.

Capacity monitoring tracks array used capacity in both raw and effective terms, provisioned capacity versus physical capacity to monitor overprovision ratios, and data reduction ratios combining deduplication, compression, and thin provisioning. Reduction ratio degradation alerts when data mix changes. Per-volume capacity monitoring identifies volumes consuming disproportionate capacity, while snapshot space consumption tracking prevents snapshot accumulation from consuming capacity without volume-level alerting.

Health monitoring covers array overall health status, individual drive health including wear indicators, controller health tracking active versus passive states and failover conditions, and environmental factors like fans, temperature, and power supplies. Pure FlashArray NVRAM health monitoring is critical for write performance.

Replication monitoring tracks replication session status per protection group, replication lag, and remote array connectivity.

Per-volume metrics for disaggregated environments

Per-volume IOPS monitoring identifies noisy-neighbor VMs consuming disproportionate storage IOPS. When correlated with hypervisor VM data, IVI maps high-IOPS volumes to specific VMs for investigation. This cross-platform correlation requires custom workflow in Aegis PM, not native LogicMonitor functionality.

NVMe-oF path health monitoring

Custom monitoring tracks active NVMe-oF initiator sessions from Pure's perspective, correlates with network interface metrics for end-to-end path health, and alerts when session count drops below expected host count. This indicates path failure before storage errors appear in VMs.

Incident response coordination

Aegis IR provides IVI first call response for array performance degradation with diagnosis across Pure storage layer, network fabric path, and hypervisor host/VM layer for full stack investigation. Replication failures get diagnosed and remediated including connectivity issues and reseeding when required.

Capacity alerts trigger client notification, growth trajectory assessment, and coordinated response. Controller failover events require confirmation that secondary controller is active, Pure1 alert acknowledgment, and beginning Evergreen coordination for hardware events. Drive failure events typically trigger Pure1 alerts and Evergreen replacement coordination — IVI confirms, tracks replacement tickets, and validates post-replacement cluster rebuild completion.

Volume management changes including host add/remove operations, volume group changes, and snapshot schedule management fall under Aegis IR scope.

Pure TAC and Evergreen coordination

Pure's Evergreen model delivers distinctive support with proactive hardware monitoring and non-disruptive controller upgrades performed by Pure engineers. IVI coordinates scheduling, ensures upgrades align with workload requirements, and validates return to health.

Explicitly not owned by Aegis

Guest OS I/O issues inside VMs that manifest as storage performance problems remain client responsibility — IVI proves storage layer health but what runs above it stays with the client. Application-level data management including database backups and snapshot policies driven by application requirements are outside scope, though Pure snapshot schedules are managed. Pure1 license management and contract renewals are not included.

Lifecycle management via Aegis LM

Purity OS lifecycle management covers tracking Pure's regular release cadence for both major releases and maintenance releases. IVI tracks recommended upgrade versions via Pure's release notes and recommendation channels, plans quarterly upgrade schedules or follows Pure recommendations, coordinates change windows with clients, and executes non-disruptive upgrade procedures with post-upgrade performance and health validation.

Pre-upgrade procedures include reviewing Pure1 health, confirming no open critical alerts, validating NVMe-oF path health, and confirming host connectivity.

Hardware EOL and Evergreen planning leverages Pure's Evergreen model which includes controller and hardware technology refreshes as part of subscription. IVI tracks when refreshes are scheduled and coordinates planning around the environment. Pure1 provides controller upgrade recommendations that IVI acts upon.

CVE tracking monitors Pure Security Advisories and PSIRT notices with defined patching timelines.

Cross-platform intelligence

When a workload experiences performance problems, the symptom often appears in the application as slow response time, but the cause could be in the Pure FlashArray with high queue depth, latency spikes, or volumes hitting limits. It could be NVMe-oF path issues like interface errors, path failures, or MTU mismatches. The hypervisor might have CPU saturation, scheduling delays, or live migration load. Or the guest OS could have applications consuming excessive I/O or runaway processes.

None of these four monitoring domains — Pure, network fabric, hypervisor, or guest OS — has native visibility into the others. Without Aegis, teams manually investigate each layer, often spending hours on the wrong layer.

Aegis PM's cross-platform correlation pulls from Pure via LogicMonitor, from network fabric via API, from hypervisor via management API, and from compute via management interface to isolate problems to the correct layer quickly. Per-volume IOPS from Pure combined with VM-to-volume mapping from the hypervisor identifies a noisy VM. Network interface error correlation with Pure session counts identifies a path failure.

The correlation advantage

This operational intelligence spans boundaries that no single tool provides, delivering faster problem resolution in disaggregated environments where storage and compute are managed separately.

Who this serves

Aegis Managed Pure fits organizations running disaggregated architecture with hypervisor plus Pure FlashArray who want unified operations management across both platforms through a single Aegis engagement. It serves organizations with Pure FlashArray who want operational monitoring, change coordination, and lifecycle management handled without building internal Pure expertise.

Organizations that find Purity OS upgrade coordination and Evergreen management more time-consuming than expected benefit from the service. Those with compliance requirements where storage change documentation, firmware currency, and incident records require audit trails find value in the structured approach.

Not a fit for

Organizations that want to self-manage Pure using Pure1 directly and only need break-fix escalation support, or those whose Pure environment runs applications requiring application-consistent snapshot management at the application layer.

Key decision points

Evaluating co-managed Pure operations requires answering four key questions that determine fit and scope.

First, are you running Pure FlashArray as external storage for a hypervisor in a disaggregated architecture? If yes, Aegis Managed Pure extends other Aegis managed services naturally through a single managed services engagement covering multiple platforms.

Second, do you have internal expertise to manage Purity OS upgrades and coordinate Evergreen hardware events? If yes, consider a monitoring-only engagement with Aegis PM for Pure while the client handles lifecycle events. If no, full Aegis scope including lifecycle management for Pure provides more value.

Third, is cross-platform performance correlation between Pure and hypervisor a current pain point? If yes, this represents Aegis PM's core value for disaggregated environments. If no, it may be lower priority but remains relevant for future incident prevention.

Fourth, are you under Evergreen support with Pure? If yes, IVI coordinates Evergreen events effectively. If not under Evergreen, evaluate whether support contract renewal should be part of the engagement scope, as IVI should advise on Pure support options as part of planning.

Related Resources

FAQs

Frequently Asked Questions

How does Aegis Managed Pure work with Pure's existing Evergreen support?

Aegis coordinates with Pure's Evergreen model rather than replacing it. Pure handles hardware proactively through Evergreen, while Aegis provides operational coordination, change management, and cross-platform monitoring that Pure's native tools don't cover.

What's the difference between monitoring Pure directly via Pure1 and using Aegis PM?

Pure1 provides excellent array-level monitoring, but Aegis PM adds cross-platform correlation with hypervisor and network data. This correlation isolates performance problems to the correct layer quickly, especially valuable in disaggregated environments.

Does Aegis handle Pure hardware replacement?

No. Pure's Evergreen model includes hardware replacement as a support deliverable performed by Pure engineers. Aegis provides operational coordination, monitoring, and validation of return to health after Pure completes hardware work.

Can we use Aegis Managed Pure without other Aegis services?

Yes, but the highest value comes from unified management across storage and compute platforms. Organizations running Pure with hypervisor infrastructure benefit most from combined Aegis services covering both platforms.

What Pure Storage models does Aegis support?

Aegis Managed Pure focuses on Pure FlashArray platforms. The service is designed around FlashArray's operational model, Purity OS lifecycle, and integration patterns common in disaggregated enterprise environments.

How does NVMe-oF path monitoring work across Pure and network infrastructure?

Aegis monitors active NVMe-oF initiator sessions from Pure's perspective and correlates with network interface metrics for end-to-end path health. This provides visibility that neither Pure nor network vendor monitoring delivers alone.

Ready to explore co-managed Pure Storage operations?

IVI's Aegis team works with organizations running Pure FlashArray in disaggregated environments to design operational coverage that complements Pure's Evergreen model while providing cross-platform visibility and coordination.

Start a Conversation