Platform
- Data Resiliency Cloud
  Data Resiliency Cloud
  Enterprise Cloud Backup and data management across edge, on-premises and cloud workloads
- Data Protection
  Data Protection
  Modernize data protection to reduce costs and complexity
- Cyber Resiliency
  Cyber Resiliency
  Be ready for cyber attacks with data that is always safe, always ready
  - Accelerated Ransomware Recovery
  - Security Posture & Observability
- Governance & Compliance
  Governance & Compliance
  Secure, protect, and streamline data governance for all your critical data, wherever it lives
  - eDiscovery and Legal Hold
  - Sensitive Data Management
- Take a Tour
Solutions
- Business Drivers
  Business Drivers
  Learn how Druva helps you accelerate key business initiatives
- SaaS Applications
  SaaS Applications
  Druva provides comprehensive data protection that supports multiple SaaS applications from a single platform. Discover the Druva difference today.
- Enterprise Workloads
  - Virtualization
    Virtualization
    Transform data center backup and disaster recovery for virtual environments
    
    VMware
    
    Nutanix
  - Databases
    Databases
    Reduce the cost and complexity of data protection for enterprise databases
    
    Oracle
    
    MS SQL
    
    SAP HANA
  - Files
    Files
    Discover a more cost-efficient way to protect on-premises and cloud NAS
    
    NAS/files
  - Public Cloud
    Public Cloud
    Protect native AWS and Azure deployments with secure backups without the cost and complexity
    
    AWS
    
    Microsoft Azure
- Enterprise Endpoints
  Enterprise Endpoints
  Unify SaaS apps and end-user device protection to reduce data risks. Improve cyber resilience and compliance by protecting enterprise workloads and assets.
- Free Trial
Customers
- Explore All Customer Stories
  We are trusted by the world's leading organizations to protect their data. Explore customer success stories to see how your peers are using Druva.
- Ransomware recovery ready
  Learn why Medallia chose Druva
  
  SaaS data protection across the enterprise
  See why Regeneron partnered with Druva
Resources
- 2023 Gartner® Magic Quadrant™
  See why Druva is recognized as a Visionary
  
  Data Resiliency for Dummies
  Get your guide to data resiliency
Partners
- Strategic Partners
  Strategic Partners
  Learn about Druva's strategic capabilities across platform, OEM, and other partnerships. Find out how Druva accelerates and protects customers' cloud journeys.
  - Dell Technologies
  - AWS
  - VMware
  - Nutanix
- Programs
  Programs
  Learn how you can profit with Druva and a cloud-first SaaS selling motion. Explore partner programs, access resources, and discover the benefits of partnering with Druva.
- Become a Partner
Company
- - Company
  - Leadership
  - Investors
  - Careers
  - Contact Us
  - Newsroom
  - Awards
  - Events
  - Blog
  - Diversity, Equity & Inclusion
- Get in touch with us
  Contact Us
  
  News, product innovations, and more
  Blog
Get Started
Support
Login
Language

News/Trends, Tech/Engineering

A Beginner’s Guide to Self-Healing Storage

October 24, 2017 Abhay Ghaisas, VP, Engineering

When operating at scale, even backup systems need backups. If a disaster ever occurs, IT needs to be able to restore a historical snapshot of users’ data to keep the business up and running. But a single corrupted file among the millions or even billions of files in the backup snapshot can cause a restore error, leading to serious delays in getting the business back up and running. A truly reliable backup system needs to overcome storage inconsistencies to ensure that every snapshot is always restorable.

Old approaches to backup file storage aren’t good enough.

When a process fails, well-intentioned anti-virus goes awry, or hardware failures corrupt database entries, and stored data and metadata may become inconsistent. It’s difficult for backup systems to identify the correct data to restore and then restore that data without error. Old approaches to these challenges leave businesses vulnerable to restore errors and extended downtime, resulting in IT headaches and lost productivity.

Backup systems operate by taking snapshots to record the contents of a file system at any point in time, so that the data can be restored and made available in the event of disaster. The snapshots are incremental: rather than capturing the entire contents of the file system each time, they record changes to each file and directory. This helps save space while always maintaining a complete record of the file system they’re backing up. However, the incremental nature of the backups means that file corruption can become a difficult problem to solve.

For example, imagine that you have saved a file called A.txt. The file was last modified at time T1, and then backed up by a snapshot at time T2. But for some reason, that snapshot at T2 is corrupted—perhaps a network glitch interrupted the file backup momentarily, right when A.txt was being captured. Because the version of A.txt in the T2 backup is corrupted, an attempt to restore that snapshot will end in a restore failure.

The problem compounds over time because of the incremental nature of the snapshots. The next snapshot, at time T3, will reference the same corrupted version of A.txt as the previous one—and so will each future snapshot. This single file can cause the entire restore operation for each timestamp to fail, even though the corrupted file is just one tiny data point among millions.

How does self-healing storage solve the problem?

Druva’s self-healing storage system protects against these restore failures. This is accomplished by regularly simulating restores of backup snapshots to verify their viability and root out any data or metadata that could cause a failure.

In the example above, Druva would simulate a restore for the T3 snapshot and find that A.txt can’t be restored. The self-healing mechanism will then remove the metadata entry for A.txt from snapshots T2 and T3, ensuring that those snapshots are available for restore even though they’ll be missing A.txt.

In addition, the self-healing mechanism will force a full backup, so the next snapshot is clean and fully restorable, and replace the corrupted version of A.txt with a new clean version. This guarantees restorable snapshots for all data and ensures business continuity and an available restore even when there’s corrupted data or metadata.

This protection isn’t limited to inconsistencies that impact the restore process. Other inconsistencies could prevent compaction or incremental backups of the device, but Druva has its own file consistency utility check (fsck) functionality to detect, report, and fix these inconsistencies.

Druva’s cloud file system does more to protect your data.

Beyond the self-healing mechanisms discussed so far, Druva’s custom file system exceeds the capabilities of traditional file systems through an innovative approach to data durability and availability. Key features of the Druva file system include:

Source-side data deduplication improves storage performance by cleansing data for duplicate information before it hits the storage solution.
Continuous data protection means no downtime is ever required to improve metadata.
Compressed and encrypted data storage, both in-transit and at rest, reduces storage and bandwidth costs while ensuring consistent security.
Policy-based data retention gives Druva clients control over their data handling and helps them meet compliance standards for any industry.
Exceptionally high durability is provided through AWS S3, the industry leader in object storage, with 11-nines durability (99.999999999%).

Additionally, the Druva cloud file system uses Amazon DynamoDB, a fast and flexible NoSQL database, to manage file system metadata. This enables Druva to provide consistent, single-digit millisecond latency at any scale.

Availability is provided by Amazon Elastic Compute Cloud (EC2), a highly scalable cloud computing environment. High availability is essential. This cloud service approach ensures that extended outages will not be necessary to clean up inconsistencies or operate at scale.

Learn more about how self-healing storage plays a crucial part of a Data Management-as-a-Service platform.

A Beginner’s Guide to Self-Healing Storage

Old approaches to backup file storage aren’t good enough.

How does self-healing storage solve the problem?

Druva’s cloud file system does more to protect your data.

Blog

Druva Data Resiliency Cloud

Cloud Backup & Recovery

Data Protection

Governance & Compliance

Cyber Resilience

Business drivers

Workloads

Partners

Customers

Resources

Company