HBase Accelerated Writes in HDInsight, Failure Recovery

Question

HBase Accelerated Writes in HDInsight, Failure Recovery

prathap sagar 0

I'm trying to understand the durability and performance trade-offs of enabling HBase Accelerated Writes in HDInsight.

Specifically

What happens if the local disk copy fails or becomes corrupted before the asynchronous sync to ADLS Gen2 occurs?
- Is there a possibility of data loss?
- Does HDInsight rely solely on local WAL, or is any redundancy maintained?
- How does recovery work in this scenario?
How does the write-latency improvement compare against the potential durability or recovery risk?
- What kind of performance gains should be expected?
- Is accelerated write recommended only for certain types of workloads?

Anoop Sam John 0 Reputation points Microsoft Employee

2025-11-28T03:59:40.8666667+00:00

In Accelerated write clusters, HBase WAL will be written to the premium managed disks attached to RegionSever's worker node VMs. WAL data as such wont be kept other places like ADL gen2 or so. But there is replication of (default 3) the same data. HDFS will replicate to 3 places. Means it will be written to 3 disks attached to different 3 VMs. If all of these 3 disks get data corruption/loss, then only data loss.

3 answers

Your answer

Anoop Sam John 0 Reputation points Microsoft Employee

2025-11-28T03:59:40.8666667+00:00

In Accelerated write clusters, HBase WAL will be written to the premium managed disks attached to RegionSever's worker node VMs. WAL data as such wont be kept other places like ADL gen2 or so. But there is replication of (default 3) the same data. HDFS will replicate to 3 places. Means it will be written to 3 disks attached to different 3 VMs. If all of these 3 disks get data corruption/loss, then only data loss.

Answer 1

Deleted

This answer has been deleted due to a violation of our Code of Conduct. The answer was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.

Comments have been turned off. Learn more

Answer 2

Hi @prathap sagar

Thank you for contacting to Microsoft QA, below are the few detailed steps to mitigate the reported steps -

You are exploring how to enable HBase Accelerated Writes in Azure HDInsight and have questions about durability, performance trade-offs, and recovery. Let’s walk through each point:

What happens if the local disk copy fails before syncing to ADLS Gen2?

If the local Write Ahead Log (WAL) gets corrupted or fails before syncing to ADLS Gen2, that data could be lost—unless it’s replicated. HDInsight helps reduce this risk by keeping three copies of the WAL on different managed disks. So, unless all three copies fail at the same time, the chance of losing data is very low.

Is there a possibility of data loss?

Yes, but it’s rare. Data loss would only occur if all three WAL copies fail simultaneously, which is unlikely because HDInsight automatically replicates WAL data across multiple disks.

Does HDInsight rely only on local WAL?

No. It adds redundancy by replicating the WAL across three disks connected to different VMs, which improves durability.

How does recovery work?

If there’s a failure, HBase uses the WAL to replay recent updates that weren’t flushed to disk. Thanks to replication, the system can recover using these WAL copies.

How does write-latency improvement compare to durability risk?

Accelerated Writes can make writes 4–10 times faster by using premium SSD-managed disks instead of writing directly to cloud storage. This is great for performance, but it does introduce some durability risk if all WAL copies fail. It’s best suited for write-heavy workloads where speed is critical.

What kind of performance gains should you expect?

Expect a big boost—writes can be 4–10 times faster, which is especially helpful for applications that handle a lot of writes.

Is Accelerated Writes recommended for all workloads?

Not necessarily. It’s most useful for write-intensive workloads. If your app is mostly read-heavy, you probably don’t need it.

Additional Tips:

Make sure your cluster has at least three worker nodes for durability and recovery.
Always flush and disable HBase tables before making changes to avoid losing WAL data.

References:

Anonymous

2025-12-05T03:45:33.4766667+00:00

Hi @prathap sagar

I hope you had a chance to review the information shared earlier, and I hope this information has been helpful! If you still have questions, please let us know what is needed in the comments so the question can be answered.
Anonymous

2025-12-08T06:47:34.14+00:00

Hi @prathap sagar

If this solution helped resolve your issue, please consider clicking ‘Accept Answer’ or giving it an upvote to help others find it easily.

Answer 3

Enabling HBase Accelerated Writes in HDInsight offers significant performance improvements by writing the Write Ahead Log (WAL) to premium SSD-managed disks instead of cloud storage. However, there are important considerations regarding durability and recovery:

What happens if the local disk copy fails or becomes corrupted before the asynchronous sync to ADLS Gen2 occurs?
- Possibility of Data Loss: If the local disk copy fails or becomes corrupted before the asynchronous sync to Azure Data Lake Storage (ADLS) Gen2 occurs, there is a risk of data loss. The WAL serves as a commit log, and if updates are only present in the local WAL and not yet synced, they could be lost if the disk fails.
- Redundancy Maintenance: HDInsight does not rely solely on the local WAL for durability. The Accelerated Writes feature is designed to improve write performance, but it is crucial to ensure that data is eventually synced to a more durable storage solution like ADLS Gen2 to maintain redundancy.
- Recovery Process: In the event of a failure, recovery would typically involve replaying the WAL to restore the data to the last committed state, but this is contingent on the WAL being intact and not corrupted.
How does the write-latency improvement compare against the potential durability or recovery risk?
- Performance Gains: Users can expect significant improvements in write-latency due to the use of premium SSD-managed disks, which provide excellent I/O performance. This is particularly beneficial for write-intensive workloads, as it helps mitigate the bottleneck caused by writing to cloud storage.
- Workload Recommendations: Accelerated Writes are generally recommended for workloads that are write-intensive and can tolerate some risk of data loss in the event of a local disk failure. However, for critical data where durability is paramount, it is advisable to ensure that data is regularly synced to ADLS Gen2 or implement additional redundancy measures.

In summary, while Accelerated Writes can enhance performance, careful consideration should be given to the potential risks of data loss and the importance of syncing to durable storage solutions for critical applications.

References:

Share via

HBase Accelerated Writes in HDInsight, Failure Recovery

3 answers

Your answer