We will be analyzing schedules of transactions in a system
that uses some form of timestamp-based concurrency control.
In each exercise, there are three transactions (T1, T2 and T3)
and three data items (X, Y and Z).
You should assume that:
The read and write timestamps of all of the data items are
initially 0.
The timestamps assigned to the transactions are:
TS(T1) = 100
TS(T2) = 200
TS(T3) = 300
These timestamps are consistent with the fact that, in the
schedules we’ll be analyzing below, T1 starts first, followed by T2,
followed by T3.
In giving your answers, you should fill in a table that begins as
follows:
request
response
state changes and/or explanation
r1(Y)
allowed
RTS(Y) = 100
The third column should include any changes to the state maintained
for the data item. If a request is denied or ignored, that column
should also include a brief explanation of why.
You do not need to restart a transaction that is rolled back,
which means that you can skip any requests by a transaction that come
after the point at which it is rolled back.
Exercise 1: commit bits; one version of each data item
In this exercise, we will assume that the DBMS is using
timestamp-based concurrency control with commit bits and
with only one version of each data item.
What is the purpose of maintaining commit bits?
To prevent dirty reads, and to thereby produce schedules that are
recoverable and cascadeless.
Note that the RTS doesn’t change in this case! The RTS is not
necessarily the timestamp of the most recent reader. Rather, it is
the timestamp of the reader that comes latest in the equivalent
serial schedule. These values may not be the same because reads
don’t conflict with each other and thus they don’t need to follow
the timestamp ordering.
w3(Y):
first test: TS(T3) ≥ RTS(Y), 300 ≥ 100, true
second test: TS(T3) ≥ WTS(Y), 300 ≥ 200, true, so allow
change: WTS(Y) = TS(T3) = 300
c(Y) remains true, and T3 becomes Y’s most recent writer.
Note that the WTS is always the timestamp of the most recent
writer, since writes are only allowed if they follow the timestamp
ordering – i.e., if the transaction requesting the write comes
later in the timestamp ordering than the writer of the
previous value.
r3(X):
test: TS(T3) ≥ WTS(X), 300 ≥ 200, true
second test: c(X) is false, so make T3 wait, which prevents
what would otherwise be a dirty read, because the writer of the
prior value of X (T2) has not yet committed.
If we weren’t using commit bits, this read would be allowed!
c2: T2 commits
c(X) is set to true because T2 is the writer of X’s current value.
Note that we do not set c(Y) to true. T2 did write Y,
but T3 overwrote that value, so T2 is not the writer of Y’s
current value.
If we weren’t using commit bits, we could ignore the commits,
since they aren’t relevant in the absence of commit bits
r3(X): now that c(X) is true, T3 is allowed to try again
and the read is allowed.
c3: T3 commits.
c(Y) is set to true because T3 is the writer of Y’s current value.
w1(Y):
first test: TS(T1) ≥ RTS(Y), 100 ≥ 100, true
second test: TS(T1) ≥ WTS(Y), 100 ≥ 300, false, so don’t allow
third test: c(Y) is true, so ignore the write.
Note that the write cannot be allowed, because the
current value of Y was written by T3, which comes after T1 in the
equivalent serial schedule based on the timestamps. The DBMS
ignores the request—treating T1’s write as if it occurred before
the write by T3 and was overwritten.
Note: If c(Y) were false in this case, T1 would be made to wait,
since the value that it wants to write may be needed if (1) the
writer of Y’s current value is rolled back and (2) there is
another transaction waiting to read Y when the rollback occurs.
If we weren’t using commit bits, we wouldn’t need the third
test to know that we should ignore the write.
r1(X):
test: TS(T1) ≥ WTS(X), 100 ≥ 200, false, so deny and roll back
Note that the WTS tells us that T2 wrote the current value of
X. T1 comes before T2 in the equivalent serial schedule, so it
should have read the original value of X instead of its current
value. Thus, this read cannot be allowed. The system rolls back
T1 and will later restart it with a larger timestamp.
Since T1 is the latest reader of Y (RTS(Y) == 100), we restore
RTS(Y) to its prior value of 0.
c1: T1 has already been rolled back, so this action doesn’t occur.
Exercise 2: no commit bits; multiple versions of each data item
In this exercise, we will assume that the DBMS is using
timestamp-based concurrency control without commit bits and
with multiple versions of each item.
What is the purpose of maintaining multiple versions?
To prevent rollbacks that stem from reading an item.
Consider the following schedule (which is slightly different than
the one from Exercise 1):
only test: TS(T1) ≥ RTS(Z(0)), 100 ≥ 300, false, so
deny and roll back.
In the equivalent serial schedule based on the timestamps,
T3 (and T2 as well!) should have read the version of Z
that T1 wants to write, but they didn’t, so T1’s write is
too late.
We review all of the versions of the data items, and
we see that RTS(X(0)) and RTS(Y(0)) both equal TS(T1).
Therefore, as part of rolling back T1, we restore the
previous RTS values of those versions.