Consistency Conundrum: The Challenge of Keeping Data Aligned

Maintaining consistency is crucial to ensure a unified view of the data, which is essential for the correct functioning of distributed applications.

Ammar Husain

Jan. 15, 25 · Analysis

Likes (0)

Comment

Save

2.1K Views

A system may store and replicate its data across different nodes to fulfill its scaling, fault tolerance, load balancing, or partitioning needs. This causes data synchronization issues, read-write conflicts, causality problems, or out-of-order updates. These issues arise due to concurrent updates on copies of the same data, network latency or network partition between nodes, node or process crashes, and clock synchronization, to name a few.

Due to these issues, the application may read stale or incorrect data. Non-repeatable reads may occur, and own writes may not be read, either! The solution to these common problems of a distributed system is to maintain consistency, i.e., keep the data aligned.

Moreover, maintaining consistency is crucial for ensuring that all nodes have a unified view of the data, which is essential for the correct functioning of distributed applications.

Definition

As per Merriam-Webster, consistency is defined as "agreement or harmony of parts or features to one another or a whole." While this definition may seem intuitive, with every context, consistency has a different meaning.

When used in the context of transactions (ACID), it refers to the database being in a perceived good state. The good state definition is application-specific.
In CAP theorem, it refers to linearizability, i.e., even if the system maintains multiple data copies, they appear as if there is a single copy — simply strong or atomic consistency.
While managing large clusters, consistent core refers to a centralized approach for managing cluster membership and cluster metadata.
With a replicated system, it denotes the replica states will converge to a common state sometime in the future, thus achieving eventual consistency.
While rebalancing, consistent hashing refers to an approach for an even distribution of workload across partitions.

An article isn’t enough to cover all these variations. Thus, the main focus of this article will be on managing the consistency in distributed systems only.

Types of Consistency

Consistency requirements vary from system to system. Thus, it is imperative to understand the various types of it, along with associated trade-offs.

Strong Consistency or Linearizability

A system that applies an update immediately to each copy of data it maintains is deemed as strongly consistent. Thus, all reads, even from different nodes, will always return the latest update.

Figure 1: Strong Consistency (simplified view) where all updates are written immediately to every node

Figure 2: Sequential Consistency (copy to different nodes omitted for brevity) — *Figure 2:* *Sequential Consistency (copy to different nodes omitted for brevity)*

Figure 3: Eventual Consistency — *Figure 3*: *Eventual Consistency*

Figure 4: Weak Consistency — *Figure 3*: *Eventual Consistency*

Data consistency Data (computing) Consistency model

Published at DZone with permission of Ammar Husain. See the original article here.

Opinions expressed by DZone contributors are their own.

Consistency Conundrum: The Challenge of Keeping Data Aligned

Maintaining consistency is crucial to ensure a unified view of the data, which is essential for the correct functioning of distributed applications.

Definition

Types of Consistency

Strong Consistency or Linearizability

Sequential Consistency

Eventual Consistency

Weak Consistency

Causal Consistency

References and Further Reading

Future Trends

Hybrid Consistency

Context or Situation-Aware Consistency

Blockchain and Distributed Ledger

Edge, Serverless, and Quantum Computing

Autonomous System

Partner Resources

Related

Trending

Consistency Conundrum: The Challenge of Keeping Data Aligned

Maintaining consistency is crucial to ensure a unified view of the data, which is essential for the correct functioning of distributed applications.

Definition

Types of Consistency

Strong Consistency or Linearizability

Sequential Consistency

Eventual Consistency

Weak Consistency

Causal Consistency

References and Further Reading

Future Trends

Hybrid Consistency

Context or Situation-Aware Consistency

Blockchain and Distributed Ledger

Edge, Serverless, and Quantum Computing

Autonomous System

Related

Partner Resources