Database Caching Mechanisms and Cache Consistency Issues

Database Caching Mechanisms and Cache Consistency Issues

Description
Database caching mechanisms refer to the technology of storing frequently accessed data in high-speed storage media (such as memory) to accelerate data read operations. The cache consistency problem refers to how to ensure that the data in the cache remains synchronized with the data in the underlying database (e.g., disk), preventing applications from reading stale or incorrect data. This issue is particularly prominent in distributed systems and concurrent access environments.

Knowledge Point Explanation

1. Why is Caching Needed?

Performance Bottleneck: The primary data of a database is usually stored on disk, and disk I/O (input/output) speed is much slower than memory access speed. When a large number of requests directly access the database, disk I/O can become a system bottleneck, causing response delays.
Role of Cache: Storing hotspot data (frequently accessed data) in memory allows subsequent requests to read data directly from memory, avoiding time-consuming disk I/O, thereby greatly improving read performance and data throughput.

2. Common Cache Patterns
There are three classic patterns mainly used to handle read/write logic between the application, cache, and database.

Pattern One: Cache-Aside (Lazy Loading)
This is the most commonly used pattern. The cache and the database are "separate"; the application is directly responsible for interacting with both.

Read Process:
1. The application receives a read request.
2. The application first attempts to read the data from the cache.
3. Cache Hit: If the data exists in the cache, it is returned directly.
4. Cache Miss: If the data is not in the cache, the application queries the database for the data.
5. After obtaining the data from the database, the application writes it to the cache for use by subsequent requests.
6. Returns the data.
Write Process:
1. The application receives a request to update data.
2. The application directly updates the database.
3. After a successful update, invalidates the corresponding data in the cache (deletes the key from the cache).
Advantages: Simple to implement; the cache only contains data actually requested by the application.
Disadvantages: There may be a brief time window after a write operation and before cache invalidation where read requests might read old cached data (dirty data) before it's reloaded into the cache. This can be more complex under concurrent conditions.

Pattern Two: Read-Through
This pattern uses the cache as the primary data access point. The application interacts only with the cache, not directly with the database.

Read Process:
1. The application requests data from the cache.
2. If it's a cache hit, the data is returned directly.
3. If it's a cache miss, the cache component itself is responsible for loading the data from the database, storing it in the cache, and then returning it to the application. The application is unaware of this process.
Write Process:
1. The application updates data. (Write strategy is usually combined with Write-Through or Write-Behind.)
Advantages: Cleaner code, encapsulating cache logic within the cache layer.

Pattern Three: Write-Through

Write Process:
1. The application writes data to the cache.
2. The cache component synchronously writes the data to the database. The write operation is considered complete only after both the database and cache updates succeed.
Advantages: Ensures strong consistency between the cache and the database.
Disadvantages: Higher write latency because each write operation involves two potentially slow operations (cache and database).

Pattern Four: Write-Behind (Write-Back)

Write Process:
1. The application writes data to the cache.
2. The cache immediately acknowledges the write as successful but does not immediately write to the database.
3. The cache asynchronously updates the data to the database in batches after a delay (or based on some strategy).
Advantages: Extremely high write performance, reducing write pressure on the database.
Disadvantages: Risk of data loss (if the cache crashes, data not yet persisted is lost) and can only guarantee eventual consistency.

3. Cache Consistency Issues and Their Solutions
The core problem is how to ensure that the data in the cache remains consistent with the data in the database after an update.

Problem Scenarios:

Update Database First, Then Delete Cache: This is the recommended practice for the Cache-Aside pattern. However, if the database update succeeds but the cache deletion fails, subsequent read requests will get dirty data.
Delete Cache First, Then Update Database: In high-concurrency scenarios, the following classic problem can occur:
1. Thread A deletes the cache.
2. Thread B finds the cache empty and reads the old value from the database.
3. Thread B writes the old value to the cache.
4. Thread A updates the database with the new value.
  Result: The cache contains the old data written by Thread B, causing long-term data inconsistency.

Solutions:

Delayed Double Deletion Strategy:
- First, delete the cache.
- Then, update the database.
- Sleep for a period of time (e.g., 500 ms, determined based on business read latency), then delete the cache again.
- Purpose: To clear any old data that might have been written to the cache by other threads during the "update database" time window.
- Disadvantage: Sleep time is difficult to set precisely and reduces throughput.
Subscribe to Database Logs (e.g., MySQL Binlog):
- This is a more elegant and reliable solution.
- Process:
  1. The application updates the database.
  2. The database records its changes in the Binlog.
  3. An independent middleware (e.g., Canal, Debezium) subscribes to and parses the Binlog.
  4. This middleware sends delete commands to the cache based on the parsed data change information.
- Advantages:
  - Decouples cache update logic from business code.
  - Guarantees eventual consistency because all updates to the database are captured and processed by the Binlog.

4. Other Key Cache Problems

Cache Penetration: Querying for data that simply does not exist in the database. This causes every request to miss the cache and hit the database directly, potentially overwhelming it.
- Solutions: Cache a null value for non-existent keys (with a short expiration time); use a Bloom filter to quickly determine if data certainly does not exist.
Cache Breakdown: The moment a hot key expires in the cache, a large number of concurrent requests access it simultaneously, all bypassing the cache and hitting the database.
- Solutions: Use a mutex lock (Mutex Key), allowing only one thread to rebuild the cache while others wait.
Cache Avalanche: At the same time, a large number of cache keys expire collectively, or the cache service fails, causing all requests to flood the database.
- Solutions: Set random expiration times for different keys to avoid simultaneous expiration; build a highly available cache cluster (e.g., Redis Sentinel or Cluster mode).

Summary
Database caching is a core technology for improving system performance, but its central challenge lies in maintaining cache consistency. Cache-Aside is the most commonly used pattern in practice. Combining it with "update database first, then delete cache" and ensuring eventual consistency through subscribing to database logs (like Binlog) is a robust architectural choice. At the same time, it is essential to properly handle cache penetration, breakdown, and avalanche to build an efficient and reliable caching system.