Let’s understand the term Distributed Transactions :- A distributed transaction is a database transaction in which two or more network hosts are involved. Usually, hosts provide transactional resources, while the transaction manager is responsible for creating and managing a global transaction that encompasses all operations against such resources. Distributed transactions, as any other transactions, must have all four ACID (atomicity, consistency, isolation, durability) properties, where atomicity guarantees all-or-nothing outcomes for the unit of work (operations bundle).
On very high level- A transaction involving several physical systems, machines, or computers.
Let’s explore this via an example.
In India we have Zepto 10 minutes grocery delivery app.
We will look at business side to understand how this application making money and works.
Here we have two key points, to ensure that groceries are delivered within 10 minutes. Zepto should accept order only when.
- Grocery is available in micro store.
- Delivery partner is available.
Let’s see tech design of the above problem statement.
Only place an order if there are groceries available in the microstore and a delivery partner has been assigned.
The one thing that distributed transactions must have is atomicity, which is the most difficult to achieve here.
This signifies that either all or none of the stages are successful.
Thinking about -ve or edge cases, on very high level these should be two situations like below.
- Transaction is successful in store service database but got failed on delivery service database.
- Transaction is failed in store service database but got successful on delivery service database.
What we gonna think for above two situations - If each of them fails, we fall back on others. If not that will cause negative impact on Zepto/our business.
- Poor delivery partner experience.
- Manpower and time were lost since the micro store spent time packing and other activities.
These are two classic cases of distributed systems that we must develop in two phases.
We will split 2 phase commit flow into two parts.
- Prepare- Reserve the order
- Commit- Final Booking
Let’s draw this system into a diagram.
- Prepare-
- If both fail, the entire transaction fails, and the process is aborted.
- If we only succeed once, we will cancel the reservation and Abort, or the timer will cancel the reservation for us.
- If both succeeds, we will move ahead and commit the phase.
- Commit-
- If both succeeded, order is placed.
- If anyone fails, we will cancel the reservation and Abort
- If order service fails, timer will cancel the reservation automatically.
Now time to look at Advantage of this system
- Guaranteed ATMOIC transactions.
- Guaranteed isolation.
and grass is not always green so just we will look as slightly but not major down side of this system. bit slow.
chances for dead lock (and we know how to handle this so chill).
Note:- In case of n/w failure, we already discussed on re-try mechanism link is below.
Github 💡