Sanbolic Team BlogThe most highly available blog you will ever read

Melio5 offers new features to simplify and enhance Desktop and Application Virtualization with Citrix XenDesktop 7

Nordic blogsite http://www.ervik.as highlights new features of Melio5, which simplifies and enhances desktop and application virtualization with Citrix XenDesktop 7: http://www.ervik.as/news/storage-virtualization/sanbolic/4077-citrix-xendesktop-7-and-sanbolic-melio5

Experience for yourself why over 700 customers globally have chosen Sanbolic over more costly and complex alternatives.
Deploy a free Trial of our full featured Melio software

Citrix XenDesktop 7 and Sanbolic Melio5 Accelerate VDI Adoption!

With the upcoming releases of Citrix XenDesktop 7 and Sanbolic Melio5, the two companies are on course to continue leading the charge in the adoption of VDI in the enterprise. By combining the advanced technologies found at the core of both products, customers of all sizes will be able to deploy desktop and application virtualization faster, easier and with lower cost than ever before!

Built on the new FlexCast architecture, Citrix XenDesktop 7 enables automated delivery of desktops and applications from a unified management platform for a seamless user experience. Offering various tools designed to speed up application migration and mobilization as well as enhanced security to protect intellectual property and private data in hosted-shared (RDS) Windows Server and Windows desktop OS-based VDI environments, XenDesktop 7 allows Windows desktops and applications to be delivered as a service easily and securely to any device over any network in order to maximize productivity in today’s mobile world.

A Citrix-Ready partner and industry leader in software-enabled storage and distributed data management, Sanbolic’s Melio5 data management platform adds new enterprise-class features and capabilities including RAID 0, 1, 10, 5, 50, 6 and 60, storage live migration, QoS, synchronous and asynchronous replication (for DR across geographically distributed sites), snapshots, optimized data placement in mixed (hybrid) storage infrastructures and support for on-premise and public cloud VDI deployments.

Also new to Melio is the ability for customers to use commodity server and storage hardware to create server-side, scale-out storage architectures that provide comprehensive application, data and storage management capabilities, enhanced fault-tolerance and performance levels reaching 650,000 IOPs+ per server, with performance scaling linearly as additional nodes are incorporated into the system. The resulting architecture offers a highly scalable and available storage solution for XenDesktop 7 deployments at a fraction of the cost of external storage arrays.

As with previous versions of Melio, Melio5 continues to enable and extend the capabilities of XenDesktop by enabling server and desktop OS image high availability and load-balancing, simplifying image management and maintenance, and significantly reducing storage capacity requirements. Built-in support for application clustering offers Citrix administrators a simple, cost-effective solution for protecting XenDesktop databases during SQL Server failures without the costs or complexities associated with failover clustering, SQL replication or SQL mirroring. And Melio’s new Latency Targeted Allocator™ (LTA™) feature lets customers mix their legacy storage devices with newer, faster storage devices such as SSD and Flash to maximize virtual desktop performance for a rich user desktop experience while enhancing capacity utilization.

Using Melio5, customers can start XenDesktop 7 pilots quickly and easily with lower cost and transition into production seamlessly with the flexibility to scale deployments dynamically without downtime! The result – Citrix customers realize the fastest and greatest ROI on their desktop and application virtualization projects.

Together, Citrix XenDesktop 7 and Sanbolic Melio5 make it easier than ever for customers to realize all the benefits of desktop and application virtualization to ensure the highest levels of productivity in an increasingly mobile world.

To learn more, visit Sanbolic at booth #515 during Citrix Summit/Synergy (May 21st – May 24th) in Los Angeles, CA. While you’re there, be sure to register for an opportunity to win a new iPad Mini.

See you at the show!

Experience for yourself why over 700 customers globally have chosen Sanbolic over more costly and complex alternatives.
Deploy a free Trial of our full featured Melio software

See VDI at its Best at Citrix Synergy 2013!

Event: Citrix Synergy 2013

Date: May 22nd – 24th

Place: Los Angeles, CA

Highlights: VDI with Citrix XenDesktop and Sanbolic Melio5

What you’ll see: The most scalable, available, performance-oriented VDI solution on the market!

What you’ll learn: How Sanbolic Melio5 enables and enhances Citrix VDI deployments

What you want to do: Visit booth #515 during the show to see and learn how easy it is to deploy a Citrix VDI solution with Sanbolic Melio5 to significantly reduce the burden of desktop and application administration while assuring your users a high-quality desktop experience.

Synopsis: Addressing all three tiers of IT infrastructure (application, data, storage), Sanbolic’s Melio5 data management software enables organizations of all sizes to realize the benefits of desktop virtualization faster, easier and with lower upfront and on-going cost than ever before using commodity server and storage hardware to avoid vendor lock-in and the high cost of external storage arrays.

See you at the show!

Experience for yourself why over 700 customers globally have chosen Sanbolic over more costly and complex alternatives.
Deploy a free Trial of our full featured Melio software

Software-Defined Storage: Focus on Benefits, Not Buzzwords

“Software-Defined Storage” is the flavor of the month for vendors hawking storage gear. While it can be a useful way of thinking about storage, it’s not something you can buy, but a set of capabilities. We recommend not getting hung up on formal definitions of “software-defined storage.” Instead, evaluate products on whether they can deliver the key benefits it promises. Those include cost savings, scalability, flexibility, and insuring agility by eliminating single points of failure.

Birth of a Buzzword

VMware popularized “Software-defined Storage” as part of their Software-Defined Data Center vision, which is the next step in their product evolution and still far from reaching the market. Over the past decade VMware has, of course, built a leadership position in server virtualization, providing a software layer to abstract the logical server from the physical hardware it runs on. This makes it much easier to deploy servers when needed, to share hardware with other applications, and to move workloads for load balancing or availability. The economics are compelling and well over half of x86 servers are now virtualized.

However, the expensive storage and networking gear attached to these virtual servers is typically still allocated to individual as if they were physical devices. To make matters worse these storage arrays typically lack flexibility, portability, or scalability, and remain one of the most costly aspects for corporate datacenters.

Customers found that these inflexible resources stood in the way of continued cost savings and the ability to dynamically provision IT resources. Several years ago VMware started publicly musing about extending the abstraction layer from servers to networking and storage, calling it their Software Defined Data Center vision. Through product announcements and acquisition they’ve begun fleshing out this strategy. Many competitors have jumped on the bandwagon, rebranding their point solutions as “Software-Defined Storage.”

The Public Cloud: Already There

Public cloud players like Google and Facebook have already built many of these “Software-Defined Storage” capabilities but only for their own use. The formula is familiar: Proprietary software aggregates commodity hardware into a storage pool and provides some level of data protection and resiliency. Google, for example, developed its Google File System a decade ago. Facebook recently replaced NetApp storage with commodity hardware and its own Haystack file system for its image library. This architecture also efficiently uses flash storage devices to accelerate metadata reads and writes, while using low cost SATA drives for longer-term storage. The bad news is that most customers lack the engineering talent to build and maintain this virtual infrastructure for themselves, and it is limited to very specific workload profiles.

That’s unfortunate because this model can bring similar, or greater, benefits on the storage front than on the server front. They include:

1)     Cost reduction through more efficient use of storage and lower cost hardware

2)     A “fluid” architecture that increases flexibility and reduces deployment time. By abstracting application data from storage hardware, it becomes much easier to scale out workloads, move them among physical devices, and ensure high availability.

Everyday customers need the scalability and performance benefits of these “public-cloud like” storage infrastructures, along with storage services such as data protection and service level management. Whether or not they’re called “Software-Defined Storage,” these solutions will also need to be agnostic across hardware and hypervisor platforms, and be compatible with legacy storage infrastructure.

The Public Cloud for the Rest of Us

Sanbolic has been working on the challenge of distributed computing and storage since 2000. Our Melio clustered file system and volume management technology delivers the benefits of “Software-Defined Storage” to hundreds of enterprise and government customers globally. We are also working with the leading Flash vendors to enable high availability and scale out utilizing flash within in servers and storage arrays, while also enabling the performance of Flash storage across all application workloads.

Sanbolic’s Melio5 is the only software-enabling platform to provide storage and data management capability across legacy SAN as well as DAS and server side flash storage. Hardware and hypervisor agnostic, Melio unifies customers infrastructure and can run on on-premise and in the Amazon and Rackspace cloud infrastructure and provide SAN/NAS scalability for thousands of servers and storage devices and Exabytes of data.

When you hear “Software-Defined Storage” we encourage you to think beyond the jargon and focus on the benefits it claims to provide. We then invite you to consider Melio5 to deliver the public-cloud levels of cost reduction and scalability – not to mention the data protection – you need in an enterprise-proven solution that is shipping today

Follow me on twitter @mvmsan

Related posts:

Getting Pitched on Storage? Ask ‘Em These Questions

SSD and Server-side Flash Scale Out: What to Look For

Solving Capacity Challenges with SSD and Server Side Flash

Web 2.0 infrastructures ready to move to the corporate data center

Why SSD and Flash Need More Brains to Reach the Corporate Data Center

Webinar OnDemand: Change the way you look at server-side SSD and Flash Storage with Sanbolic and Fusion IO

Experience for yourself why OVER 700 customers globally have chosen Sanbolic over more costly and complex alternatives. Melio for FREE

Experience for yourself why over 700 customers globally have chosen Sanbolic over more costly and complex alternatives.
Deploy a free Trial of our full featured Melio software

Getting Pitched on Storage? Ask ‘Em These Questions

Five Tough Questions for Your Storage Vendor

Just about every storage professional is in a vise these days. They need to manage more and more data and make it available more and more quickly for less and less money.

In response, everyone from the smallest angel-funded startup to the industry giants claim they can help. They’re all talking scaling to hundreds of thousands of IOPS and thousand of nodes with their own twist on caching data in server-side flash memory to boost performance.

And, of course, no press release is cool unless it makes a reference to software-defined storage, whatever that is.

It is true that we’re on the brink of massive, structural changes as intelligent software running on low-cost commodity servers and storage replaces proprietary, high-priced offerings. This will bring the kind of incredibly low-cost, highly reliable systems Google and Amazon have built to the corporate data center. But as with any massive industry shift, there’s also a massive amount of hype.

While the marketing fluff settles down, here are five tough questions to ask the next sales rep who clogs your voice mail.

  1. Is your product shipping? We’re seeing a lot of breathless “announcements” that, when you get to the last paragraph, admit their “solution” won’t ship for weeks, months or even a year or more. Promises are great, but you have problems to solve now. Until you can get your hands on something and test it, much less buy it, it’s irrelevant.
  1. Is it enterprise proven? The commodity hard drives, flash storage and servers underlying most of this “new-age” storage are fairly well proven. Not so the intelligent software that runs atop them. Can a vendor provide reference examples of how their software is actually working at enterprise levels of users, data, applications, and transactions?
  1. Is there a single point of failure? These days, if only takes a little bit of downtime for your impatient customers or business partners to look elsewhere. Especially when you’re using commodity hardware, the question isn’t whether something will fail, but when. Can this cool new technology instantly balance loads among servers to maintain performance if one goes down? Can it instantly rebuild data if one or more arrays fail, without trashing performance?
  1. Is the hardware truly commodity? Some vendors “bake” some of their software into their hardware in the form of custom ASICs. They may claim this improves performance. But it also forces you to buy that vendor’s pricey hardware every time you need more capacity. It also requires replacing all that gear if you ever want to change vendors. Ask if you can choose your own server and storage hardware without sacrificing performance, scalability, redundancy or other important features.

  1. Does it deliver a lower total cost of ownership? Even if you’re trying to shift spending from capital to operational budgets, the lowest hardware sticker won’t mean much if you get eaten up with long-term management costs.

As you can guess, we think we have some good answers to these questions and encourage you to call and grill us on them. But we’re also curious to hear what outrageous claims you’re hearing from storage vendors, and the tough questions you’d like to ask them.

Follow me on twitter @mvmsan

Related posts:

SSD and Server-side Flash Scale Out: What to Look For

Solving Capacity Challenges with SSD and Server Side Flash

Web 2.0 infrastructures ready to move to the corporate data center

Why SSD and Flash Need More Brains to Reach the Corporate Data Center

Webinar OnDemand: Change the way you look at server-side SSD and Flash Storage with Sanbolic and Fusion IO

Experience for yourself why OVER 700 customers globally have chosen Sanbolic over more costly and complex alternatives. Melio for FREE

Experience for yourself why over 700 customers globally have chosen Sanbolic over more costly and complex alternatives.
Deploy a free Trial of our full featured Melio software

How to upload files (i.e., event logs, Procmon logs, Perfmon logs, memory dumps, etc.) to Sanbolic’s FTP Site

Topic: How to upload files to Sanbolic’s ftp site at ftp.sanbolic.com.

Overview: This tech brief explains how to create a user account on Sanbolic’s ftp site to upload files that require analysis by Sanbolic.

Briefing: Prior to uploading files such as event logs, Procmon logs, Perfmong logs or memory dumps to Sanbolic’s FTP site, you must create an account on our FTP site. This tech brief walks through the process of creating a new account on Sanbolic’s FTP site.

To create a new account on Sanbolic’s FTP site, perform the following steps:

1. Log into Sanbolic’s clients system at http://clients.sanbolic.com using the account you used when you first registered on Sanbolic’s website. If you do not have an account, you’ll need to register an account with Sanbolic before uploading files to Sanbolic’s FTP site.

2. At the top of the page, click the “File Manager” tab

3. In the “File Manager” page, enter a username and password, along with the CAPTCHA prompt and click “OK” to create your account.

4. Once your account is created, use the FTP client within Windows or within File Explorer or a third-party FTP client such as FileZilla to FTP to ftp.sanbolic.com.

5. When prompted for your user credentials, enter the credentials for the account you created in step #3.

6. In the root directory, create a new folder labeled with your name or company name.

7. Upload your file(s) to the new folder you created in step #6.

8. Notify Sanbolic technical support once the file(s) have been successfully uploaded to the FTP site.

Experience for yourself why over 700 customers globally have chosen Sanbolic over more costly and complex alternatives.
Deploy a free Trial of our full featured Melio software

Disrupting IT Infrastructure Business Models

Disrupting IT Infrastructure Business Models

Overall IT infrastructure spending has slowed over the past year, and EMC is projecting only 3% annual market growth going forward-see EMC and VMware Strategic Forum 2013. But underneath this overall trend there are several segments of the infrastructure that are experiencing strong secular growth, in part by disrupting existing technology business models. These segments include:

1. Public Cloud Computing

2. Flash Storage

3. Software defined storage.

There is an interesting connection between these segments which is worth exploring. The common link is distributed computing and storage architectures. And distributed compute and storage systems are likely to be very disruptive to legacy infrastructure going forward, not just in the cloud, but in the enterprise.

Public Cloud Computing

The growth of advertising supported web services businesses such as Google and Facebook, and SaaS offerings like Salesforce, is not news to anyone, and the three of them alone have created $350 Billion is market value over the past decade.  Bain & Company estimates that the cloud computing market will grow by 20-30% over the next five years.

The Amazon Web Services Infrastructure as a Service story is less well known since Amazon doesn’t break out financials.  But a recent Bernstein Research report estimates that they have been growing on the order of 100% per year, are 5 or 10 times as large as their nearest competitor, and have an implied market capitalization of $24 billion. Right now AWS runs a lot of small businesses and small corporate projects together with some big customers like Netflix. It allows them to rent infrastructure by the hour to run their existing applications without rewriting them, and frequently lowers its price.  The question is: how quickly can AWS solve the issues of application availability and easy portability of applications and data between legacy on-premise infrastructure and the Amazon Cloud, in order to allow it to capture more of the enterprise market?  They are a very agile organization. VMware and Microsoft, the on-premise virtualization and server application incumbents, are VERY concerned about their customers moving to Amazon.

The hardware vendor incumbents in the on-premise data center are very concerned as well. Look around a public cloud data center.  You may see a lot of servers from Quanta. A lot of Fusion-io cards. Not a lot of NetApp or EMC storage arrays.  When workloads shift from on-premise to cloud data centers, the total available market for the branded server and storage vendors declines. Public cloud datacenters typically have better utilization rates than on-premise deployments, reducing the amount of hardware required. And the hardware that is used is less likely to be from a branded vendor.

Software Defined Storage and Flash

But the branded IT hardware vendors face an even bigger problem.  The cloud data centers use a different architecture than on-premise infrastructure. And that architecture is starting to find its way back to the enterprise.

Since they buy a LOT of hardware each year, the public cloud companies pay a lot of attention to pricing and supplier cost position. Facebook standardizes around 5 server architectures so they can put large volumes of each out to bid.  Quanta sold 1.2 million servers in 2012, and expects to grow 50% this year.  85% of those sales will be direct to a handful of public cloud companies. These companies have also noticed that server vendors operate on margins of less than 20%, disk drive makers operate at 30%, while storage vendors operate at 60%. There is a lot of expensive software and services packaged with the hardware in a storage array.  And…their hardware is not very flexible and doesn’t scale well.

So the public cloud data centers build their own massive storage systems out of commodity servers and storage drives with proprietary storage software to provide storage services. They learned to incorporate high performance, but expensive, Flash storage to handle workloads like file system metadata, while putting sequential content data on cheap SATA drives.  Fusion-io, the PCIe flash leader, has grown from nothing to more than $1.5 billion in market capitalization, and Facebook alone accounted for more than a third of their revenue in 2012. The public cloud storage systems have to be big, they have to be fast, and they have the flexibility to occasionally present old data or lose it entirely (would you miss the last million results on your search?) But they work really well. And they cost a lot less than enterprise infrastructure.

These distributed compute and storage systems run on commodity hardware, abstract the data from the hardware, and provide availability, resiliency, and storage services in software.  Most of the public cloud companies have a lot of really smart engineers and PhDs to write and maintain these systems. But what if this type of distributed storage solution was available in an integrated off-the-shelf software package for the enterprise and could use both legacy SAN and commodity storage? Software that not just abstracted and pooled data, and provided a basic level or resiliency, but one that provide enterprise storage services like multiple RAID levels, snapshots, remote replication, and granular quality of service? Software that incorporated flash storage into a highly available primary storage tier in hybrid flash/HDD volumes, while intelligently placing data within the volome based on file system access profile?  Software that scales out to thousands of nodes in shared data clusters, to Exabytes of storage, and to more than 500,000 IOPS per node? What if the software data abstraction layer made it easy to distribute workloads across large virtual server clusters running on any on-premise hypervisor, or to cloud platforms like AWS or Rackspace?

This type of software could dramatically reduce the cost of enterprise IT infrastructure by enabling commodity hardware, improving availability, and providing the flexibility to assign workloads to on-premise or cloud infrastructure based on both workload requirements  and cost/capacity utilization. It would provide public cloud economics on commodity hardware with enterprise reliability.

Well, you can wait for EMC or NetApp to cannibalize their existing high margin business models to deliver this type of solution. And maybe they will.  Or you can try Melio5 today.

Experience for yourself why over 700 customers globally have chosen Sanbolic over more costly and complex alternatives.
Deploy a free Trial of our full featured Melio software

Scaling Performance

The capacity-resilience problem

Computing performance is derived from standard CPU, RAM and storage modules – HDDs, SSDs. If more processing power is needed for a computing task then more CPUs are added. If greater storage is required then more storage devices are attached. The main question arises – how are these CPU, RAM and storage modules put together in order to maximize performance and reliability.

Vertical Scaling

The expansion can be done by adding these resources to one system – a process called vertical scaling. In this case all components are tightly interconnected which is good for performance because it enables fast data paths between internal components with relatively simple protocols. However, it is not good for reliability as tight interconnections imply significant mutual reliance between components. Failure of one internal device can easily bring the entire system down. For this reason the bigger the system the more reliable components are required. Besides increased cost, there is the problem that the level of reliability has an upper limit induced by physical limitations and the closer you get to this limit, the higher the price per component gets. In the end this architecture reaches an unacceptable price/performance ratio. The shared components, like system bus and bridges, become bottlenecks for performance. Still, the poor price/performance ratio is compensated for by the simplicity of management – after all, this is just one computer and all business applications are straightforward to deploy. Another benefit is the lack of partitioning between resources. This means that all components are available for use by all other components which allows for maximum resource utilization, whatever the business applications require. Further growth, however, brings the need to remove the bottleneck of common system bus between devices. The answer lies in the breaking of the architecture symmetry by partitioning the system components. An example is the NUMA architecture where each CPU has a preferred memory device which is ‘closer’ to it while the rest of the memory is slower to access. The non-symmetrical approach, however, leaves the resilience problem untouched.

Horizontal Scaling

In order to grow the overall power beyond the limit of a single system while keeping high resilience, partitioning of computing resources to smaller units is required. This allows each of them to fail without bringing down any other. This is achieved by replacing the tight interconnections of system bus and bridges with loose connections that can handle failure of connected devices.

Scaling via One Type of Nodes

The simplest way is to use small computers – nodes, interconnected with a LAN/WAN connection. The benefit is that conventional hardware can be used to increase the total bandwidth of buses between CPU, RAM, IO and storage devices at the cost of resource partitioning. The overall computing task must be partitioned in order to utilize the CPU devices of all nodes. To enable high availability of computing tasks, they must be moved between computers in case of failure, however most business applications are not designed to be moved between computers. The solution to this problem is virtualization of computer hardware on which the business application is installed and moving the entire virtual machine between nodes. However, this approach creates further problems by introducing additional partitioning of computing tasks on virtual machine boundaries. The virtualization itself imposes overhead to both storage and computing devices because the system software is installed and run multiple times on one physical machine. The increased amount of VMs and their overhead demands the appropriate increase in storage. Because the storage devices are split between different computers the processed data needs to be partitioned as well within the internal storage of each computer. This data partitioning has these major problems:

  1. The growth of data needs to happen gradually on all nodes’ chunks so that it does not overflow onto one before the others
  2. The CPU/storage ratio is determined by all computers and to change it one needs to upgrade all nodes
  3. The failure of a node would lose the data on its storage devices, potentially rendering the overall system nonoperational. To avoid that, every piece of data needs to be placed on at least two nodes, changing the CPU/storage device ratio.
  4. As such a system is non-standard it requires a complicated proprietary data management system that would be incompatible with most of the existing software – it would be a closed system. This would not allow easy deployment of new software if the need arises.

Obviously, the above problems mean that power and resilience come at the very high cost of developing proprietary data management software and doubling or tripling the amount of hardware in order to avoid data loss in case of any single or double failure of nodes.

Scaling via Two Types of Nodes

Instead of designing and implementing a very complex closed system that solves the above problems, a simple solution is to use two types of systems – computing nodes and storage nodes (disk arrays). All storage devices (HDD, SSD) would be attached to storage nodes only. In this way the ratio between overall CPU and storage devices in the computing cluster can be precisely controlled by adding the appropriate type of nodes as demand arises. Because the storage nodes are freed from computation their internal architecture can be optimized for the specific storage requirements – for example, using battery-backed RAM for write-back caching of RAID 1, 5, 6 and 10 dramatically boosts performance in some cases. A storage node can be designed for a certain amount of storage devices and their respective throughput of data through RAM to link can be guaranteed. Provided that any computing node can access any storage node, the reliability requirements to computing nodes become more relaxed as their failure does not make data inaccessible. In turn they can be optimized for the business applications they would run. A link that is optimized for mass storage can be used to connect every storage node to every computing node and present it as local SCSI device (SAN). The storage nodes can be configured as one large shared, fault-tolerant volume so failure of a storage node does not lose data. In this way the storage is aggregated again via the storage network. And the most important benefit is that any application can run on the shared storage just as it would on local storage. Data availability is guaranteed if at least one computing node and a sufficient amount of storage nodes (depending on RAID configuration) are available. The only missing link is a shared volume manager that can create shared volumes out of shared storage and a shared file system that can mount safely shared volumes. Melio provides both services. Its architecture is designed so that it does not create artificial partitioning or unavailability. The main principle is that if the hardware allows operation the software should provide it. For example, if two computing nodes A and B are accessing a storage node S, the failure of B should not cause A to fail operations. Melio FS fully guarantees this principle.

Scaling Method Comparison

Method Max CPUs Max Storage Devices Compute partition Storage partition Spatial Boundary Data Change Rate Transaction Visibility & Durability
Vertical Tens Tens No No Room (m) Fast Strict
Horiz. 2 node types Thousands Thousands Yes No (w.Melio) Campus (km) Fast Strict
Horiz. 1 node type Millions Millions Yes Yes Global Slow Loose

Storage nodes aggregate HDD and SSD drives so they allow for much higher transfer rates than when storage is scattered among many nodes. The latter case has the benefit of being able to replicate data with high demand to many nodes to increase read rates.

Application-level Clustering

Despite the fact that storage is not partitioned in SAN the computing resources still are. The straightforward approach is again virtualization of hardware on which to install conventional applications but this brings the problems described above. Melio addresses the problems of clustering in a more efficient way than virtualization. It provides a framework (AppCluster) that controls conventional business applications so that they do not move between servers but attach and detach data files from the shared storage, making sure their respective services are always available. In this way the overhead of VMs is spared and the freed computing and storage power can be utilized by business tasks.

SAN vs. NAS

NAS has been used traditionally to aggregate and share storage. Its major benefit is that it is simple to manage the storage devices because they are all concentrated in one big computer. All RAID configurations are possible, all devices can be aggregated in one big volume to avoid partitioning, backup and security are centralized. However the NAS approach to storage is in essence vertical scaling applied to storage only and as such it suffers from all the problems described before. Its margin of scaling is limited and the NAS server presents a single point of failure. SAN on the other hand is horizontal scaling applied to storage and computing nodes. Data partitioning is removed with the use of shared volume manager and shared file system. Every FS request in NAS is performed by two file systems – at the NAS client and at the NAS server. This overhead of using two file systems instead of one is significant and requires the NAS server machine to be sufficiently powerful and dedicated to file serving only. The same applies for its fail-over partner(s), wasting at least two systems that could perform business tasks otherwise. Melio FS is designed so that each server processes its own FS requests and interacts only with the previous owners of the locks it acquires. This approach guarantees that one node is not consuming resources of another, potentially creating bottleneck in it. In this way the computing nodes can grow horizontally without the need to grow the NAS server vertically at the same time.

Metadata controller vs. Fully Symmetrical FS

The fastest way to implement shared FS is to start with a NAS server that would perform all metadata operations and would grant leased locks to the other nodes that allow them direct access to user data. In this way it can keep all the lock state consistent and perform complex operations. This approach, despite being simple, leaves the vertical scaling for metadata with all its problems and adds to NAS the horizontal scaling for user data. To avoid a single point of failure the metadata role can be moved to backup server. However this process loses the state of all open files, because the NAS connection operates on the file level and the server knows the state of every file opened by clients. This surely leads to interruption of all clients operation just as it would in the case of a plain NAS server, no matter how fast the fail-over process is. All operations would fail and all data cached at the clients would be lost by the unexpected closure of files. Melio FS uses distributed locking protocol between servers that guarantees that the owner of a specific lock is its server as well. This means that whichever machines fail, the remaining machines are unaffected because there is no interruption of their ownership of shared resources. Furthermore they would experience just a delay in their operation only if they request locks that were owned by a failed machine.

Persisting lock state NAS vs. Fully Symmetrical FS

Using NAS server cluster on top of a SAN with metadata controller allows hiding the high availability deficiencies of metadata controller approach within the NAS implementation. To avoid NAS client interruption during metadata node fail-over the following is required:

  1. The state of all open files within clients must be persisted on shared storage in order to be able to recover it on the new metadata controller.
  2. No user data write caching is allowed on server in order to avoid data loss when it fails.
  3. To support caching of metadata on server, clients must replay metadata operations to the new metadata controller node on fail over.

Point one requires that every file open or lock operation be written to the shared storage prior completing the client’s request, inserting the time for at least one storage write operation in their latency. Point three requires that clients consume memory and CPU power to keep the changes that the metadata server has not flushed to storage yet. Therefore all three points have negative performance impact. Points one and three slow the failover process by requiring that the new metadata server restores the state of all clients’ open files and held locks from the shared storage and then replaying all metadata operations from all clients. This means that on failover the new metadata controller has to perform all write work, cached by its predecessor. This creates two contradictions:

  1. Between cache size and fail-over time: the more metadata changes are cached the better write performance and poorer fail-over speed.
  2. Between number of clients and fail-over times: the more clients are served – the greater amount of handles, locks and work on fail over and hence greater delay in restoring operation for all of them.

In a fully symmetrical FS the process of server failure is most natural. No owned locks are lost to recover and no handles are closed, no operations are repeated. User data and Meta data can be cached as much as RAM quantity allows for each individual server. The FS is much simpler because the main difference from non-shared FS is the shared lock protocol, all other operations being essentially the same. It supports lock caching and lock-ahead behaviors to minimize communication between nodes. The fail-over process in Melio requires nothing. The failed node is awaited for a defined time and then is declared dead. Even the recovery of its locks does not happen immediately but is postponed until they are actually required. The simplicity of the shared file system approach is far greater than a mix of technologies that are not designed from ground up for the particular purpose but are patched with one another along the way.

Soft SAN

To deploy SAN architecture onto existing set of hardware, Melio provides the ability to represent existing NAS servers as computing and storage nodes merged together. This enables the creation of a shared volume out of the internal storage already present in existing servers, while using them as computing nodes. As the need arises for more storage nodes, computing nodes or storage-and-computing nodes can be added, allowing risk-free gradual  scaling of performance.

Conclusion

Today there are many solutions for performance scalability and each of them has made different compromises based on different priorities. Common one is the desire to reuse existing technologies in order to keep development costs low. The main principle behind development of Melio, however, was to fully utilize the SAN potential in the simplest and most efficient way of aggregating its storage into one shared volume and presenting it to servers via shared file system. By designing and implementing it from the ground up we were able to make it as simple as possible without sacrificing capabilities of hardware.

Experience for yourself why over 700 customers globally have chosen Sanbolic over more costly and complex alternatives.
Deploy a free Trial of our full featured Melio software

How to Cluster SQL Servers for DB High Availability in the AWS™ Cloud

Topic: How to cluster SQL Servers for database high availability in the Amazon® AWS™ cloud.

Overview: This tech brief explains how to create a shared volume using Sanbolic Melio to enable SQL Server clustering for database high availability in the AWS public cloud.

Briefing: Since application clustering is not currently available in the public cloud, organizations interested in leveraging the economics of the cloud are finding it difficult, if not impossible, to achieve high availability and scale for their business-critical workloads. Sanbolic Melio allows AWS customers to overcome this challenge and cluster SQL Servers in the public cloud infrastructure for database high availability in order to maximize up-time for enterprise workloads dependent on SQL Server. » Read more…

Experience for yourself why over 700 customers globally have chosen Sanbolic over more costly and complex alternatives.
Deploy a free Trial of our full featured Melio software

SSD and Server-side Flash Scale Out: What to Look For

Server-side Flash Scale Out: What to Look For

Everyone and their brother seems to be promising “SSD and flash high availability and scale out” these days. But what exactly is it, why do you want it, and what features should you look for?

“Scale-out,” of course means expanding the amount of work your servers can do. The workload can be either block- or file-based, ranging from application serving to Web serving to database access. Such scale-out is only effective, of course, if it can be done as inexpensively as possible and as quickly as possible utilizing all the hardware and software platforms customers must support.

“Flash” refers to the high-speed solid-state memory deployed within servers, as opposed to the solid-state disks in storage arrays. Using this fast flash storage to create very low-cost, highly scalable and available server infrastructures is an attractive proposition for customers that need to quickly handle many more customers and much more data without extra cost or staff.

But not all the “solutions” that claim to deliver server-side flash are created equal. Some key questions to ask:

Does it create a single point of failure? High performance and scalability are not worth much if the failure of a single server can bring down an entire application or (in this virtualized era) multiple applications. Look for features such as dynamic and symmetrical clustering, load balancing, RAID and a shared file system that allows another server to instantly take over if one fails.

Does it create silos of applications or data? The same architectural constraints that make each server an island to itself can also make it harder for applications or users to share data. In today’s networked economy, such silos can cripple productivity and efficiency. The same shared file system, clustering and load balancing that eliminate single points of failure also allow any server or application to access vital data.

Does it span Flash, SSD and spinning disk? In addition to flash, most customers are also using SSDs as a storage cache layer, and low-cost high capacity JBOD (just a bunch of disks) with traditional HDD to store the vast majority of their data. To cost-effectively scale capacity across block and file storage while benefiting from the low latency and fast performance of SSD and Flash, look for solutions that support all those media types, and can intelligently route data to the most cost-effective media.

Does it provide appropriate data protection? If you’re going to run important applications on your agile infrastructure, you’ll need to protect the data. How well, and how easily, does your server-side scale out solution provide data protection if one of more drives fail? Look for solutions that support features such as snapshot and RAID.

How easy is it to manage? Using your existing server administrators to manage your storage can save you 40-50 percent on your staff costs. Look for centralized management capabilities that give server administrators the ability to monitor and manage your Flash, SSD and JBOD for maximum efficiency.

How efficiently does it use storage and server resources? The whole idea of virtualization and scale-out is to do more with less. Look for solutions that use a shared file system to allow all servers to access the same version of data without the performance and capacity hit of replication.

The flood of announcements about server-side caching and scale out won’t end anytime soon. Let us know what questions you have about what to look for, and if you want to stay informed about the intelligent software layer we’re adding to JBOD, SSDs and Flash.

Follow me on twitter @mvmsan

Related posts:

Solving Capacity Challenges with SSD and Server Side Flash

Web 2.0 infrastructures ready to move to the corporate data center

Why SSD and Flash Need More Brains to Reach the Corporate Data Center

Webinar OnDemand: Change the way you look at server-side SSD and Flash Storage with Sanbolic and Fusion IO

Experience for yourself why OVER 700 customers globally have chosen Sanbolic over more costly and complex alternatives. Melio for FREE

Experience for yourself why over 700 customers globally have chosen Sanbolic over more costly and complex alternatives.
Deploy a free Trial of our full featured Melio software
Rss Feed Tweeter button Facebook button Technorati button Reddit button Myspace button Linkedin button Webonews button Delicious button Digg button Flickr button Stumbleupon button Newsvine button Youtube button