This GigaOm Research Reprint Expires: Jan 22, 2023

GigaOm Radar for Hyperconverged Infrastructure Solutions: Edge Deploymentsv2.0

1. Summary

Edge deployments are among the most dispersed HCI environments, with small clusters at many locations and usually without technical staff readily available. One area of differentiation is between “near edge” and “far edge.” The near edge is a data center that’s not on-premises, while with the far edge, the location typically depends on the function. Near-edge deployments could be in colocation or telco data centers spread around the country or around the world. Far-edge environments might be in trucks, retail stores, or oil exploration platforms. The available facilities vary greatly among these locations and so there is variation as well among solutions.

The usual data center HCI vendors are represented for near-edge environments, which require rack-mount servers and extensive resources. The difference is that there tend to be far more edge locations than on-premises data centers and visiting these locations for maintenance is usually impractical, so hands-off management of large numbers of HCI clusters is essential.

The far edge requires hardware that does not expect the niceties of a data center; power, cooling, and space may all be limited. Far-edge clusters are also more likely to run exclusively custom applications, making Kubernetes a central component either alongside VMs or possibly without any VM-based applications. At the far edge, hands-off management may not be enough; these locations may require hands-off deployment for clusters never visited by IT staff. Finally, far-edge locations may only have intermittent connectivity and must be able to operate for extended periods in disconnected mode. For example, a fishing boat on a two-week expedition might only have internet connectivity when in port.

Whether your environment consists of 500 clusters in 500 telco PoPs, or 10,000 delivery trucks, edge deployments scale out. Policy-based management of large numbers of clusters is vital. Automation of every aspect of operations and application updates is essential.

How to Read this Report

This GigaOm report is one of a series of documents that helps IT organizations assess competing solutions in the context of well-defined features and criteria. For a fuller understanding consider reviewing the following reports:

Key Criteria report: A detailed market sector analysis that assesses the impact that key product features and criteria have on top-line solution characteristics—such as scalability, performance, and TCO—that drive purchase decisions.

GigaOm Radar report: A forward-looking analysis that plots the relative value and progression of vendor solutions along multiple axes based on strategy and execution. The Radar report includes a breakdown of each vendor’s offering in the sector.

Solution Profile: An in-depth vendor analysis that builds on the framework developed in the Key Criteria and Radar reports to assess a company’s engagement within a technology sector. This analysis includes forward-looking guidance around both strategy and product.

2. Market Categories and Deployment Types

For a better understanding of the market and vendor positioning (Table 1), we assess how well solutions for Edge HCI are positioned to serve specific market segments.

  • Near edge: In this category, we assess solutions on their ability to meet the needs of organizations deploying to large numbers of data center locations, such as telco locations, cellular network sites, or large retail branches
  • Far edge: Here offerings are assessed on their ability to be deployed in non-traditional locations for servers. These solutions will support smaller form-factor machines and will not require data center services such as server racks, air conditioning, and protected power.

In addition, we recognize three deployment models for solutions in this report: hardware appliances, software-only, and hybrid cloud.

  • Hardware appliance: These products have HCI software pre-installed on physical servers, with a validated collection of firmware and drivers. The limited hardware choice, compared to software-only HCI, provides a smaller set of components for the vendor to validate for updates. Both hardware and software support are provided by a single vendor.
  • Software-only: This approach allows customers to assemble their hardware configuration from a compatibility list and install the HCI software. The customer takes more responsibility for the compatibility of firmware and drivers with updates and receives hardware and software support from different vendors. Software-only HCI deployments may reuse existing servers or have servers from different vendors in the deployment.
  • Hybrid cloud: These products offer the same HCI platform or management tools as a cloud service with integration to on-premises deployment. Workload portability from on-premises to the cloud service will score higher than cloud-based management.

Table 1. Vendor Positioning

Market Segment

Deployment Models

Near Edge Far Edge Hardware Appliance Software-Only Hybrid Cloud
Cisco
DataCore
Dell
Hitachi Vantara
Hivecell
HPE
Microsoft
NodeWeaver
Nutanix
Scale Computing
StorMagic
Sunlight
VMware
3 Exceptional: Outstanding focus and execution
2 Capable: Good but with room for improvement
2 Limited: Lacking in execution and use cases
2 Not applicable or absent

3. Key Criteria Comparison

Building on the findings from the GigaOm report, “Key Criteria for Evaluating Hyperconverged Infrastructure,” Table 2 summarizes how each vendor included in this research performs in the areas we consider differentiating and critical in this sector. Table 3 follows this with insight into each product’s evaluation metrics—the top-line characteristics that define the impact each will have on the organization. The objective is to give the reader a snapshot of the technical capabilities of available solutions, define the perimeter of the market landscape, and gauge the potential impact on the business.

Table 2. Key Criteria Comparison

Key Criteria

Edge-Core-Cloud integration Analytics Extended Data Storage Capabilities Kubernetes Support Automation and Orchestration Advanced Data Protection Services
Cisco 2 2 2 2 2 2
DataCore 1 1 2 2 2 2
Dell 2 2 2 3 3 3
Hitachi Vantara 2 2 2 2 2 2
Hivecell 2 1 1 2 2 1
HPE 2 3 2 2 2 3
Microsoft 3 2 2 2 3 3
NodeWeaver 2 3 2 2 3 2
Nutanix 3 3 3 3 3 3
Scale Computing 2 2 0 1 2 2
StorMagic 1 1 1 2 2 2
Sunlight 2 2 1 1 1 0
VMware 2 2 2 3 2 3
3 Exceptional: Outstanding focus and execution
2 Capable: Good but with room for improvement
2 Limited: Lacking in execution and use cases
2 Not applicable or absent

Table 3. Evaluation Metrics Comparison

Evaluation Metrics

Scalability Flexibility Efficiency Solution and Partner Ecosystem
Cisco 3 3 3 2
DataCore 3 3 3 2
Dell 3 2 2 3
Hitachi Vantara 3 2 2 3
Hivecell 2 2 3 2
HPE 3 3 3 2
Microsoft 2 3 2 3
NodeWeaver 2 2 3 2
Nutanix 3 3 2 3
Scale Computing 3 3 3 2
StorMagic 2 2 3 2
Sunlight 3 2 3 2
VMware 3 3 2 3
3 Exceptional: Outstanding focus and execution
2 Capable: Good but with room for improvement
2 Limited: Lacking in execution and use cases
2 Not applicable or absent

By combining the information provided in the tables above, the reader can develop a clear understanding of the technical solutions available in the market.

4. GigaOm Radar

This report synthesizes the analysis of the key criteria and their impact on evaluation metrics to inform the GigaOm Radar graphic in Figure 1. The resulting chart is a forward-looking perspective on all the vendors in this report, based on their products’ technical capabilities and feature sets.

The GigaOm Radar plots vendor solutions across a series of concentric rings, with those set closer to the center judged to be of higher overall value. The chart characterizes each vendor on two axes—balancing Maturity versus Innovation, and Feature Play versus Platform Play—while displaying an arrow that projects each solution’s evolution over the coming 12 to 18 months.

Figure 1. GigaOm Radar for Edge HCI

As you can see in the Radar chart in Figure 1, Edge HCI is an emerging category with multiple approaches to solving problems. The well-known enterprise server vendors target the near-edge HCI with scaled down versions of their enterprise HCI. The solutions for near edge look similar to ROBO solutions, although usually with more locations than most ROBO deployments.

Products that target the far edge, such as Sunlight, Hivecell, and NodeWeaver, are more likely to be newer companies and focus on a mixture of Kubernetes and virtual machines to run the workloads. These companies are highly innovative and often cooperate with one another to bring complete solutions to market. For example, Hivecell has an option to run the Sunlight HCI software on Hivecell hardware. We expect to see more entrants in this sector as it becomes clear that HCI architectures can help organizations address some limitations of Kubernetes for edge applications.

The more established server vendors may have some products that haven’t been included in this report that target the far edge, not relying on their data center focused HCI to scale down to far-edge requirements. The data center HCI products still operate at the near edge where rack mount servers are viable and consistency with data center HCI platforms is beneficial.

Inside the GigaOm Radar

The GigaOm Radar weighs each vendor’s execution, roadmap, and ability to innovate to plot solutions along two axes, each set as opposing pairs. On the Y axis, Maturity recognizes solution stability, strength of ecosystem, and a conservative stance, while Innovation highlights technical innovation and a more aggressive approach. On the X axis, Feature Play connotes a narrow focus on niche or cutting-edge functionality, while Platform Play displays a broader platform focus and commitment to a comprehensive feature set.

The closer to center a solution sits, the better its execution and value, with top performers occupying the inner Leaders circle. The centermost circle is almost always empty, reserved for highly mature and consolidated markets that lack space for further innovation.

The GigaOm Radar offers a forward-looking assessment, plotting the current and projected position of each solution over a 12- to 18-month window. Arrows indicate travel based on strategy and pace of innovation, with vendors designated as Forward Movers, Fast Movers, or Outperformers based on their rate of progression.

Note that the Radar excludes vendor market share as a metric. The focus is on forward-looking analysis that emphasizes the value of innovation and differentiation over incumbent market position.

5. Vendor Insights

Cisco

Cisco’s strategy around HCI and, in general, around infrastructure management, is based on a holistic management layer provided by Cisco Intersight. In the last year, Cisco introduced several new features and enhancements that will be at the core of future product development, including support for Terraform, Kubernetes, and the Workload Orchestrator. Cisco has also committed to accelerating the release cycle of new versions of HyperFlex. Users will eventually be able to get an end-to-end monitoring and management tool that abstracts all infrastructure components, including select non-Cisco hardware, and provides a common user interface for provisioning, monitoring, and management of the entire infrastructure. For the moment, however, only Cisco hardware and HyperFlex are supported, and it may still be some years before we see wider hardware and software support.

Using a minimum of two nodes, the solution can use short-depth servers for locations that have communications racks rather than server racks. Fleet deployment, management, and automation are available, and additional cloud services, including a cloud witness for remote two-node clusters, are aimed at simplifying the infrastructure while keeping costs at bay. The existing HyperFlex platforms suit near-edge deployment, and Cisco has other hardware platforms more suited to the far edge..

HyperFlex uses a VSA model for software-defined storage, with support for both compute-only nodes and storage plus compute to enable scaling to suit different workloads. Both compute and storage nodes run IOVisor software, a layout-aware client for HyperFlex storage that allows direct retrieval of data by compute nodes. The VSA implements a log-structured file system, which allows instant storage snapshots without performance impact. Replication to another HyperFlex cluster can also be enabled in the storage layer; PowerShell commands are provided to automate DR failover. Customers can use either these HyperFlex functions or hypervisor-based capabilities, such as VMware Site Recovery Manager. Similarly, a number of backup vendors support HyperFlex storage integration for data protection, as well as hypervisor layer integration.

Users can create edge-core solutions to replicate data from the edge to the core for backup and disaster recovery purposes. And Cisco is investing heavily in the cloud to provide additional integrated services for its customers.

Last but not least, the solution supports both Cisco and third-party Kubernetes solutions on both its native and VMware ESXi hypervisors.

Strengths: The vision apparent in the roadmap around Intersight and HyperFlex is exciting. Cisco’s offering will become more flexible and will provide multiple solutions to customers.

Challenges: Cisco is showing strong acceleration, but execution of its roadmap will continue to be key to bring additional features and integrations to users. Support for smaller form-factor servers will help reach more edge use cases.

DataCore

Datacore SANsymphony is a software-defined storage (SDS) product that can be deployed as a virtual storage appliance (VSA) on a hypervisor or the public cloud. SANSymphony offers robust storage virtualization and is able to consume Fibre Channel, iSCSI, and local storage, presenting virtualized storage volumes to hypervisors or other consumers such as in-guest iSCSI initiators or physical servers. The storage virtualization functionality includes deduplication, compression, and thin provisioning, as well as automated tiering across the various consumed storage devices. Aggregating direct-attached storage from multiple nodes into redundant shared storage was a large initial value proposition for HCI, and SANsymphony expands on this capability by integrating with existing non-HCI storage. An existing SAN can be virtualized behind SANSymphony as an additional tier of storage.

SANsymphony HCI storage uses two-way mirroring for high availability, and it’s possible to establish a third mirror to retain high availability during maintenance or after a node failure. SANSymphony can take snapshots and replicate these to another site, or replicate every volume write to another SANSymphony cluster for Continuous Data Protection (CDP) and DVR-like point-in-time recovery. Replication can be integrated with VMware Site Recovery Manager (SRM) for failover automation and testing.

One unusual aspect of the DataCore VSA is that the SANsymphony software installs onto a Windows server rather than a Linux machine. As a result, SANsymphony can install in the parent partition on a HyperV server, removing the overhead of a VSA operating system. For large-scale deployment, both the initial deployment and ongoing operations can be automated through a REST API, PowerShell, or a CLI. Datacore Insight Services provides web-based analytics and alerting, with pre-failure alerting. Most of the automation and alerting is implemented by VAR partners who specialize in integrating products, including DataCore’s, for their customers.

Eliminating the requirement for a SAN to build a highly available virtualization platform is great, but HCI also simplifies and integrates the management of the virtualization, with Datacore leaving all of the management of the virtualization platform to either VMware or Microsoft and integrating only with the hypervisor storage management layer. DataCore reports that most SANsymphony customers have highly regulated or locked-down environments. These environments probably need segregation of duties more than a VM-centric HCI platform might easily allow. SANsymphony administration access does not provide or require any VM or network administrative access.

Strengths: SANsymphony is a simple yet complete solution that eases the path to hyperconvergence. The company has been making steady progress with new features and optimizations.

Challenges: The fleet management for large numbers of SANsymphony deployments at the edge is left to VARs rather than being a part of the Datacore platform.

Dell Technologies

Dell Technologies’ extensive product portfolio gives users plenty of choice in designing their infrastructures for hyperconverged solutions. One solution, VxRail, has been particularly well received, and boasts one of the largest ecosystems and numerous certifications for enterprise use cases. Developed in conjunction with VMware, VxRail takes full advantage of the entire VCF stack, with integrated tools to simplify deployments and automate management operations. As Dell is a traditional server vendor, VxRail targets near-edge locations with rack-mount server options.

Updates are key to VxRail’s value, eliminating the manual compatibility validation and testing that customers typically complete to avoid post-upgrade issues. The VxRail team validates a collection of firmware, drivers, and software updates against the VxRail hardware list and packages the validated collection for deployment. Customers can simply deploy the versioned collection of updates to one or more clusters. The deployment process is optimized to minimize the time that individual nodes spend in maintenance mode, which in turn minimizes disruption to VSAN for re-sync activity.

VxRail is not specifically designed for the edge, but users can select nodes from a number of models for various use cases, including ROBO and small-to-medium business (SMB). For small sites, particularly edge, Dell offers a single node deployment as a VxRail satellite node. Dell also provides its own managed service that users can access to create a cloud experience for infrastructures physically installed in their data centers. Kubernetes support is available through VMware Tanzu Kubernetes Grid. System lifecycle management is a key aspect of this solution and heavily contributes to improving its TCO.

VxRail includes a marketplace of solutions that integrate with a VxRail cluster, such as Dell RecoverPoint for VMs, as well as third-party applications that can be easily deployed. Dell’s hyperconverged solutions can be acquired via traditional CapEx models or new flexible subscriptions.

Strengths: VxRail is a well-integrated solution for large organizations, with an extensive partner ecosystem and a cloud-like experience, thanks to Dell Cloud. There are a number of models to choose from, including SMB and ROBO dedicated nodes.

Challenges: The solution is not optimized for edge use cases that involve several small sites characterized by very few VMs, space efficiency, and very low power consumption needs.

Hitachi Vantara

Hitachi Vantara’s compelling hyperconverged solution is based on VMware’s vSAN and VCF, with additional features to automate first deployments and Day 2 operations. This includes end-to-end system lifecycle management, fast provisioning, and seamless integration with existing Hitachi Vantara storage systems. These rack-server-based products suit near-edge deployment into telco data centers or branch office data closets.

For edge-deployment use cases, Hitachi’s Unified Compute Platform (UCP) HC can be paired with Lumada for IoT and Pentaho for analytics. The UCP edge is likely to be deployed in a manufacturing plant or some other OT deployment. Alternatively, both VMware Tanzu and RedHat OpenShift are supported for container orchestration on top of UCP, as is Hitachi’s own Kubernetes (HKS) platform. The Hitachi Content Platform (HCP) can also be licensed with UCP to provide S3-compatible object support with object-level replication between sites, potentially very useful for replicating backup images from ROBO and Edge UCP to central data centers.

Customers with a significant commitment to Hitachi will find UCP HC to be a fully featured HCI platform and may well deploy UCP HC at regional locations to complement UCP RS (Rack Scale) in larger data centers. The Lumada and Pentaho combination may encourage application owners to bring UCP into manufacturing and IoT deployments where Hitachi might not already have a footprint.

Hitachi Vantara doesn’t provide its own cloud services but because of the VMware VCF stack, users can take advantage of multiple solutions from major cloud providers based on the same stack. Kubernetes support is now available through VMware Tanzu Kubernetes Grid, the Hitachi Kubernetes Service, and RedHat OpenShift.

Hitachi Vantara is expanding its subscription program to include HCI solutions, but traditional purchasing options are available as well.

Strengths: This solution is an interesting value proposition with a solid stack and well-designed automation and management tools. The Hitachi Vantara UVP HC series complements Hitachi Lumada Edge Intelligence solutions.

Challenges: Large, distributed infrastructures for edge use cases can be expensive compared to other solutions dedicated exclusively to edge use cases.

Hivecell

Hivecell is a hardware appliance plus software platform purpose-built for edge computing, particularly for edge locations with little infrastructure and possibly no local staff. These edge locations generate significant amounts of transient data, and the edge applications respond to this data in near real time and return smaller amounts of persistently valuable data to a central location, either on-premises or the public cloud. The small form-factor hardware appliances have a unique stacking design that the company calls a hive, rather than a rack model. A single power cable and a single network cable connect to the hive and are shared by the appliances in the stack through connectors on the top and bottom of each appliance. Appliances have built-in batteries for power protection and WiFi as a backup to the wired network, so lifting a node off the stack does not break the cluster.

The nodes do not require special power or cooling as they are designed to run in closets or in vehicles rather than needing a data center. This unusual hardware design reduces training and skill requirements around on-site operations—a courier with no prior training could add a node to a stack or replace a failed node. Another unusual aspect is that the appliances and central management service (Hive Control) are delivered only as a service billed monthly. Customers can increase or decrease appliance number, or even change model of appliance as required, with no capital expenditure.

The central control platform, Hive Control, is used to onboard devices through hardware ID integration with appliance fulfillment—there is no need to stage the appliance before it is sent to its end location. Hive Control is then used for the ongoing management and monitoring of the deployed appliances.

Hivecell expects that many customers will want to follow DevOps and infrastructure-as-code methodologies, integrating the infrastructure configuration with their application build and automating both with CI/CD pipelines. Applications can be run in VMs, enabled by a partnership with StorMagic and using KVM as a hypervisor, or in containers, or in a combination of both. The VMs use appliances based on Intel CPUs; containers can also use ARM-based nodes with far smaller power requirements. The ARM nodes include NVIDIA CUDA cores to enable higher performance, machine learning use cases. A single stack can have a mix of Intel and ARM nodes, possibly to mix VMs and containers, although individual applications are usually tied to a particular CPU architecture.

Strengths: The custom hardware platform ideally suits locations that are never visited by IT staff yet require reliable infrastructure. An all-inclusive monthly cost model is innovative.

Challenges: The platform is foreign to enterprise IT organizations and different from the organization’s existing platforms, thus requiring new skills to operate.

HPE

HPE SimpliVity is one of the most mature and feature-rich platforms on the market. It supports both VMware ESXi and Microsoft Hyper-V, and provides a clear, simple, and cost-effective migration path for existing HPE Proliant customers who want to move to hyperconvergence.

The SimpliVity platform has always been focused on efficiency, with always-on global deduplication and compression enabling efficient data protection and replication for disaster recovery. A single-node deployment at a near-edge location can have data protection and recovery by replication, or a cluster can scale out to meet data center requirements. One design consideration is that a SimpliVity cluster must contain uniform hardware—upgrading to newer hardware requires a new cluster be built and the VMs migrated using the efficient replication.

One of the main factors differentiating HPE SimpliVity is its core storage architecture, which is designed to optimize the data footprint. This allowed HPE to build a solid set of data protection services—including snapshots, backups, and remote replication for disaster recovery—that are particularly bandwidth-conscious, an important feature for remote site data protection. In addition, HPE InfoSight, the AI-based analytic platform now available for most HPE products, offers a complete view of the entire stack and helps users with daily operations, system tuning, reporting, as well as support and capacity planning. Initial deployment is streamlined as well, so end users can get the system up and running quickly. Daily operations are also simple, with an easy-to-use interface and good integration with the hypervisors. HPE SimpliVity supports Kubernetes through its CSI plug-in and virtualization platforms like VMware Tanzu.

Users can choose from a broad range of preconfigured appliances, including high-capacity and workload-specific nodes, as in the case of VDI. HPE SimpliVity is part of the HPE GreenLake program, which provides added flexibility and a cloud-like purchasing experience.

Strengths: HPE SimpliVity is a robust platform that is easy to use and has the solid performance and capacity optimization associated with top-notch analytics.

Challenges: SimpliVity does not scale to large numbers of clusters within a federation; although customers can deploy multiple federations, each is a separate management domain.

Microsoft

Microsoft’s strategy for Azure Stack HCI has evolved radically in the last year, and it is now practically an extension of Microsoft Azure Cloud. Users can download and install a specific OS designed for HCI and opt for the pay-as-you-go model based on the resources used, starting at one core per day. Deployment is simplified, thanks to wizards and automation tools aimed at speeding up the provisioning of infrastructures of any size. System management is delivered through a management console available on Azure. Support is available via Azure Cloud as well, further simplifying the entire process while creating a consistent user experience across different services and environments.

Microsoft also improved the core of Azure Stack HCI. Its distributed storage layer (Storage Spaces Direct, or S2D) has been greatly optimized for better performance and lower resource consumption, while native backup and disaster recovery services are integrated in the stack. Infrastructure lifecycle management, including firmware upgrades, is possible via select hardware vendors.

Azure Stack HCI is the only HCI platform from a major public cloud provider and its unique value stems from its ability to bridge from on-premises to Azure. Bringing Azure services to Azure Stack HCI is a priority for Microsoft. ARC and ARM both already support Azure Stack HCI for unified management and infrastructure-as-code workload deployment respectively, along with Azure Monitor Insights for workload analytics. Azure Backup and Site Recovery allow data protection and disaster recovery from Azure Stack HCI, integrated with Azure. The Azure Kubernetes Service is available with a local control plane to enable intermittently connected clusters, and Azure Virtual Desktop is in preview. Microsoft is adding services like Azure SQL, functions, and app services, which will enable more cloud-native application deployments on-premises.

Azure is a public cloud that is designed to accommodate existing on-premises applications, and Azure Stack HCI accommodates those applications that can’t be moved to the public cloud. As Microsoft adds more services to Azure Stack HCI, building more cloud-native capabilities around these applications, on-premises will become more practical. Azure Stack HCI is attractive to customers who have made a significant commitment to Microsoft and Azure. Its value to them will increase as more Azure services become available on-premises.

Microsoft doesn’t offer a complete fleet management solution for physical hardware yet, but it is quite clear that Azure Stack HCI can support infrastructures of any size for SMB, ROBO, and edge use cases.

Strengths: Azure Stack HCI acts as an extension of Azure Cloud, improving the overall user experience and purchasing model while also providing a seamless hybrid cloud user experience.

Challenges: Even though Microsoft has an enticing roadmap for Azure Stack HCI, hardware fleet management is not yet available, making it harder to manage large, distributed infrastructures.

NodeWeaver

NodeWeaver is solely focused on delivering an edge platform for applications. Scalability is both out and down, including not only many nodes and clusters but also low-resource nodes and small clusters. The NodeWeaver platform supports all those nodes and clusters by automating every task possible, which minimizes the need for human intervention. In addition, NodeWeaver uses minimal supporting infrastructure. Clusters are configured through DNSOps, where DNS entries provide information for nodes to bootstrap and join the cluster. Using DNS alone is simpler than using PXE, which requires DHCP and TFTP in addition to DNS. Within each node, an autonomic engine manages mapping the physical hardware of each node to the policies attached to the cluster. Hardware-independent cluster policies allow much simpler upgrades between generations of hardware, or any other requirement for non-uniform cluster hardware. Nodes within a cluster do not need to have the same CPU, RAM, or storage configuration, and they can range from as small as 2 CPU cores and 8GB of RAM to multiple CPU sockets and hundreds of GB of RAM. Policy-based management allows the autonomic engine to reconfigure the cluster when hardware fails or new hardware is deployed, rather than requiring manual reconfiguration. The autonomic engine is also responsible for ongoing optimization, automated updates, and pre-failure detection.

As an edge solution, NodeWeaver automation integrates into a customer’s wider application platform. Both an API and a CLI are available for management, along with a variety of SDKs for the API and plugins to support declarative tools such as Ansible and Terraform. NodeWeaver provides its own Kubernetes distribution, or customers can choose another distribution and integrate it with cluster storage through the CSI. Virtual machines can be operated in a fully virtualized KVM for compatibility with existing VMs. Alternatively, NodeWeaver uses paravirtualized KVM for newer VMs and particularly to support containers and Kubernetes.

The NodeWeaver storage cluster stores compressed data in chunks, as objects with metadata to assemble the chunks into a storage object. Nodes might have different sizes and types of storage devices, and the storage engine lays chunks out to get the most consistent application performance. Snapshots are simply point-in-time copies of the metadata, with no impact on performance, and they can be replicated to S3-compatible object storage for data protection. Once the NodeWeaver software is pre-installed on a client’s choice of hardware, the on-site deployment can be completed by untrained staff— the pizza maker can deploy the cluster in a pizzeria..

The choice of NodeWeaver as an edge HCI platform is likely to be driven by the application(s) to be deployed at the edge, rather than selecting it to be an all-purpose platform for all locations and forcing applications to fit to the platform.

Strengths: A very light and efficient platform, NodeWeaver scales to large numbers of clusters without requiring large supporting infrastructure.

Challenges: The very different control mechanism (DNSOps) may take some time for customers to embrace.

Nutanix

Nutanix is a global market leader in enterprise HCIs, with a take on edge infrastructures that is quite different from the competition.

Customers can start with a minimal 1-node configuration, which can be a good solution for ROBO use cases in which high availability is not a requirement. However, the resources needed to run the infrastructure make for a relatively expensive solution when compared with products specifically designed for edge deployments. Fortunately, users have plenty of choices, including Nutanix’s Acropolis Hypervisor (AHV) instead of VMware ESXi and Microsoft Hyper-V, to mitigate costs and simplify the entire stack.

Nutanix’s management and automation tools are well designed and scalable, but the real differentiator comes from the additional features the company is building into its platform-as-a-service offer. In fact, Nutanix Karbon tremendously simplifies the development and deployment of custom applications for edge use cases in industrial scenarios, with integrated analytics and AI frameworks that can be leveraged both on-premises and in the cloud. And the availability of a native Kubernetes environment for this platform gives additional options to users considering future Kubernetes deployments at the edge. The solution also offers strong system lifecycle management functionality to ease maintenance and upgrades for the entire stack (hardware and software).

Another enabling technology is Calm, which provides automated self-service application deployment and lifecycle management. Calm is now available as a SaaS product, providing a portal for hybrid-cloud automation. Flow Security Central adds security operations, monitoring, and compliance reporting, while Beam supplies hybrid-cloud cost governance, rounding out the hybrid-cloud portfolio to simplify operations beyond on-premises HCI.

Nutanix software is available for major hardware and cloud providers on a subscription basis. Its channel and technology partner ecosystems are very extensive and provide additional assurance about the overall flexibility of the solution.

Strengths: Nutanix is an end-to-end solution for building edge-to-cloud infrastructures with integrated and portable services. It has a strong ecosystem and partner network.

Challenges: The platform cost may be a significant issue for small sites.

Scale Computing

Scale Computing offers a complete end-to-end solution based on its own virtualization stack, which is designed for small, medium, and distributed enterprises with a particular focus on edge use cases. Even though its ecosystem of technology partners is still limited, in the last couple of years the company has signed important agreements with key partners, such as IBM, Lenovo, Google Cloud, Acronis, Leostream, Parallels, Milestone, Intel, and others. It is noteworthy that the company is often praised by its large user base for the quality of its support, and it has a very friendly approach with channel partners as well.

When it comes to edge computing deployments, Scale Computing offers one of the most cost-effective solutions on the market. The Scale Computing HE150 is based on Intel NUC compute nodes connected to a clever networking solution on the back end that doesn’t require external switches. This brings the cost and the number of components down to a bare minimum, making it ideal for unattended sites. Two HE150 appliances, based on Intel NUC hardware, can fit on a single shelf under the counter of a retail store and provide a highly available platform for multiple applications, such as PoS and signage. Dozens of these self-healing clusters can be managed centrally from a web console without sending technicians to the site. For more industrial deployments, Scale Computing partners with Lenovo for environment-tolerant servers.

The company is also working on fleet management to provide a complete view of the entire infrastructure and automate operations on large-scale deployments. This SaaS solution can be leveraged as well by MSPs. Disaster recovery on Google Cloud is an option that allows users to minimize upfront infrastructure investments.

Finally, ease of deployment and ease of use are other areas in which Scale Computing has invested heavily. New clusters can be installed in minutes and day-to-day operations are simplified, thanks to clean, well-designed dashboards. And the latest hardware appliances have improved VDI capabilities, due to GPU support.

Strengths: This is an end-to-end solution that is easy to deploy and use. It is cost-effective, with tools designed to manage large edge infrastructures.

Challenges: The scalability of single clusters is not comparable with what HCI market leaders provide. Even though the solution ecosystem is growing and all major use cases are covered, options are still limited.

StorMagic

StorMagic is one of the few solutions designed specifically for edge and SMB use cases. With more than 1,300 customers across all industries, and partnerships with the most important hardware manufacturers and cloud providers, StorMagic has built a simple yet very efficient product with strong security features and additional cloud services to ease deployment while improving the user experience.

Usually deployed with VMware ESXi, StorMagic SvSAN also supports the Microsoft Hyper-V and KVM hypervisors. It can run on practically any x86 hardware, with minimal resource needs, in cost-effective two-node configurations. Leading server manufacturers such as HPE and Lenovo also offer servers with SvSAN preinstalled, which simplifies the acquisition process. The management platform enables users to manage thousands of sites from a single console, and the cloud-based witness service further eases deployment of large, distributed infrastructures.

SvSAN usually has a two-node, synchronous mirrored configuration for high availability, with a shared witness to prevent split-brain during network failures. The shared witness, which can support up to 1,000 SvSAN clusters, is usually a virtual machine located in a central data center. A third mirror is supported, to maintain high availability during maintenance or after an initial failure. Additional compute-only nodes can be added, which consume the HCI storage over iSCSI. SvSAN does not have data reduction features. In small edge locations, the compute resources required for data efficiency are more costly than the small additional storage capacity. SvSAN can use an SSD as a cache to accelerate I/O in a hybrid storage configuration, where hard drives provide cost-effective capacity, and the intelligent use of an SSD cache reduces the need for VSA RAM to provide performance.

StorMagic has invested heavily in security over time, with a specific product for encryption key management (SvKMS) that complements the HCI solution. For customers with enterprise key management already in place, SvSAN can also work with third-party KMS solutions.

Users can choose between perpetual licenses with support or subscriptions based on capacity, with additional options for improved security, performance, and other services.

Strengths: Designed for edge-computing deployments, with a lightweight and resource-savvy architecture, this solution is easy to manage at scale and very cost effective. Data encryption and integrated key management are also strong.

Challenges: Data protection, such as backup and disaster recovery, requires external products.

Sunlight

Sunlight has two core products, the NexVisor HCI stack, and the Sunlight Infrastructure Manager (SIM) management platform. Both are software and are licensed together on a monthly subscription basis with all features included—a very simple licensing model. NexVisor is designed to have very low overhead while delivering near-native performance for modern storage and networking hardware. The low overhead reflects Sunlight’s philosophy that more hardware is not always the best solution; using what you have efficiently is better. As an edge-focused product, SIM supports thousands of clusters under a single console. The SIM platform includes an application marketplace from which third-party applications can be deployed to NexVisor clusters.

Sunlight expects to see a variety of processor architectures at the edge; currently both ARM and x86 are supported, with RISC-V on the roadmap. A common feature of the smaller system hardware that is deployed at the edge is that I/O capability tends to dominate over raw CPU power. If more CPU power is required for AI, it’s often implemented via GPUs rather than CPUs, Sunlight supports NVIDIA Jetson as a hardware platform, and it has optimized NexVisor for the massively parallel I/O that these systems can deliver, to handle the massive amount of data being generated at the edge that must be processed at the edge rather than forwarded to the cloud.

One of Sunlight’s unusual characteristics is how much of its technology has been developed in-house. The NexVisor stack is unique to Sunlight, although it does derive from Xen and so has good support. Sunlight cluster nodes also use a proprietary network protocol, rather than using TCP/IP like most HCI products. Using a specialized protocol for node-to-node communication reduces the required resources as well as the attack surface and risk from zero-day vulnerabilities in commonly used libraries. The node that is elected cluster leader does use TCP/IP to talk to SIM, with a single IP address per cluster to allow access through the firewall to SIM.

Sunlight is a software product, so it partners with hardware vendors for supported hardware configurations. The vendors tend to have specialized hardware for retail and manufacturing or industrial environments. High-performance platforms will include NVMe storage and Intel Optane, where NexVisor’s highly efficient parallel I/O can take advantage of the full power of the hardware. Though the Sunlight platform can run on data center servers, its greatest strength is delivering excellent performance with a small footprint.

Strengths: Sunlight offers turnkey HCI with hardware partners at edge scale. The platform is designed from the start for modern CPUs with strong I/O capabilities.

Challenges: Retaining support for legacy x86-based VMs may slow feature development in ARM, RISC-V, and Kubernetes-based capabilities.

VMware

VMware Cloud Foundation (VCF) is the market leader in enterprise virtualization. Already at the base of many HCI solutions from third-party vendors, VMware has, in the last few years, expanded its reach to edge use cases as well. In fact, it’s possible to run a VCF cluster in a three-node configuration, making the solution a viable option for edge deployments and scenarios where it is necessary to run a small infrastructure in a remote location. Near-edge deployments might be on rack-server hardware, while far-edge deployments could be on smaller form-factor machines from vendors specializing in industrial premises computers. VMware also ported its ESXi hypervisor to ARM CPUs, with an experimental version of the product available for free download, hinting at future support for tiny, power-efficient edge infrastructures for IoT and other advanced use cases. VMware provides several licensing options to meet the needs of different organizations.

The main advantage of VMware VCF at the edge is its ecosystem. In fact, the stack is totally integrated, and tools like vCenter and vRealize Suite for automation and orchestration enable users to manage the hardware and software lifecycle of large, distributed infrastructures with the same tools already in place for data center operations. VMware’s edge strategy includes more than the HCI platforms, with software-defined WAN (SD-WAN) and virtual desktops.

VMware has made significant investments in the cloud, and the platform is now available from all major cloud providers, enabling organizations of all sizes to take advantage of these services. VMware also has a compelling and cost-optimized DRaaS option for its customers.

Last but not least, VMware is quickly becoming a credible and strong player in the Kubernetes realm, thanks to the native integration and the Tanzu portfolio.

Strengths: VMware is a market leader with solid solutions for organizations of all sizes. VCF is now available on-premises and in the cloud, with additional cloud services planned for the future, for greater flexibility.

Challenges: VCF and vSAN may be expensive for deployments where non-traditional infrastructure approaches are taken at the edge to optimize cost savings.

6. Analyst’s Take

If your edge locations can accommodate rack-mount servers, your HCI needs are similar to a ROBO use case with a massive number of offices. If your edge locations don’t look like an office, you probably need a solution that supports servers that don’t look like servers.

The near-edge locations where you deploy servers from Cisco, Dell, Hitachi, HPE, or Scale Computing are probably places that engineers—yours or the server vendor’s—can visit but don’t want to. Racking and cabling will be done on-site, but ongoing operations will be remote. The applications you deploy can be a mix of VMs and containers on Kubernetes clusters as the HCI platform is still a general purpose compute service.

Far-edge locations, such as delivery trucks, oil wells, and wind farms, are usually much harder to visit. Deployment of Hivecell clusters, possibly with Sunlight running on them, doesn’t require an IT specialist; a diesel mechanic can install the boxes during a service visit.

We expect that over time, the far edge will be dominated by low-power processors and solid-state storage, with no moving parts to fail. Applications will always be king at the edge, so choosing an edge HCI platform starts with the application.

7. About Alastair Cooke

Alastair Cooke

Alastair Cooke is an independent industry analyst who has been writing about the challenges and changes in the IT industry for over ten years. IT infrastructure, virtualization, end-user compute, and public cloud technologies are all focus areas. Alastair started his thirty-year career as a hands-on technical consultant for global companies such as Glaxo Welcome and VMware, as well as smaller New Zealand-based companies. In the second half of his career, Alastair has been a technical trainer and content creator, making videos and writing about enterprise IT topics. As a trainer, Alastair taught courses for VMware, HPe, Nutanix, and AWS. He is a host of Build Day Live, doing hands-on video education with enterprise IT vendors such as HPe, Dell, Pure Storage, Oracle, Supermicro, Cohesity, and NetApp.

8. About GigaOm

GigaOm provides technical, operational, and business advice for IT’s strategic digital enterprise and business initiatives. Enterprise business leaders, CIOs, and technology organizations partner with GigaOm for practical, actionable, strategic, and visionary advice for modernizing and transforming their business. GigaOm’s advice empowers enterprises to successfully compete in an increasingly complicated business atmosphere that requires a solid understanding of constantly changing customer demands.

GigaOm works directly with enterprises both inside and outside of the IT organization to apply proven research and methodologies designed to avoid pitfalls and roadblocks while balancing risk and innovation. Research methodologies include but are not limited to adoption and benchmarking surveys, use cases, interviews, ROI/TCO, market landscapes, strategic trends, and technical benchmarks. Our analysts possess 20+ years of experience advising a spectrum of clients from early adopters to mainstream enterprises.

GigaOm’s perspective is that of the unbiased enterprise practitioner. Through this perspective, GigaOm connects with engaged and loyal subscribers on a deep and meaningful level.

9. Copyright

© Knowingly, Inc. 2021 "GigaOm Radar for Hyperconverged Infrastructure Solutions: Edge Deployments" is a trademark of Knowingly, Inc. For permission to reproduce this report, please contact sales@gigaom.com.