It graphically illustrates that the effective time period of this version is wholly included in the effective time period of an episode of its parent object, client C903, that episode be
Trang 1date of Now().2So if it were those versions that were the parents
in a TRI relationship, this process would continually invalidate
temporal foreign keys (TFKs) by ending the assertion time of
the versions they refer to
Temporal Referential Integrity: The Basic
Diagram
Figure 11.1 is the basic diagram we will use in our discussion
of temporal referential integrity It consists of timelines for three
objects Besides policy P861, there is a timeline for client C882
and for client C903 The dotted-line vertical arrows represent
temporal foreign key (TFK) relationships from a child version
to a parent episode Parent episodes are underlined to
empha-size that those vertical arrows are not pointing to specific
vers-ions, but rather to entire episodes
The shaded rectangle on the left covers the effective time
period of version 2 of episode P861-A, which extends from July
2010 to May 2011 It graphically illustrates that the effective time
period of this version is wholly included in the effective
time period of an episode of its parent object, client C903, that
episode being C903-A It also graphically shows why a TRI
relationship is between a child version and a parent episode
No single version of C903-A could be a TRI parent to
P861-A(2), because no single version of C903-A covers [Jul 2010 –
May 2011], the effective time period for P861-A(2).3
The shaded rectangle on the right covers [Oct 2013 – 12/31/
9999] This is the effective time period of P861-C(8) In this case,
a single parent version effective time includes (i.e [fills-1]) that
child version, but that is merely happenstance For example,
suppose that we wanted to change client C882’s name from
“Smith” to “Jones”, effective May 2014 This would make the
effective time period of C882-C(4) [Sep 2013 – May 2014] But
if that happens, there would be no version of C882-C that could
2
This, of course, is a description of a basic temporal update transaction But a similar
description of the mechanics of non-basic temporal updates leads to the same
conclusion, that TFKs do not point to specific versions in a parent asserted version
table.
3 We use the notation X-{A, B, Z} to denote an episode of an object Thus, C882-B
denotes episode B of client C882 We use the notation E(n) to denote a version of
an episode Thus, P861-A(2) denotes version 2 of policy P861, included within episode
A Note, however, that it only happens to denote the second version of that episode.
For example, P861-C(8) denotes version 8 of that policy, but that version is the second
version of that episode, not the eighth one.
Trang 2be a TRI parent to P861-C(8) The new C882-C(5) goes into effect
on May 2014, so its effective time period does not cover the ear-lier clock ticks in P861-C(8) And C882-C(4) ends its effectivity on May 2014, so its effective time period does not cover the ongoing effectivity of P861-C(8), whose effective time period is, once again, [Oct 2013 – 12/31/9999]
As in the previous chapter, we assume for now that all relationships exist within current assertion time, and that all temporal transactions specify an assertion time of [Now() – 12/ 31/9999] We also assume that delete transactions against clients cascade down to the policies that they own, in accordance with the metadata declaration made in the Temporal Foreign Key metadata table, shown in Figure 8.4
We can read the somewhat schizophrenic history of policy P861 from this diagram.4 Think of a vertical line running from the top to the bottom of the diagram, and initially positioned
at January 2010 As time passes, this line moves to the right The history of P861 is recorded in the begin and end dates of its versions So as that line reaches each such date, there is a change in the state of P861
AsFigure 11.1shows, the policy was originally owned by cli-ent C882 The only episode of C882 whose effective time period included that of P861, at the time P861-A(1) was created, was C882-A And so that became the episode of client C882 that the policy pointed to
The next thing that happened was that, on July 2010, P861 changed hands At that time, ownership was transferred to client C903 The only episode of C903 that existed at that time was C903-A, and so that became the parent episode to P861, begin-ning on that date This change of ownership is recorded in ver-sion 2 of P861-A Note that C903-A became effective on April
2010, two months after P861-A did If episodes were the child managed objects in TRI relationships, then this relationship would be invalid But they are not C882-A is the parent to P861-A(1) C903-A is the parent to P861-A(2)
The third event in the life of P861 was a delete cascade issued against client C903 As of May 2011, C903 was no longer
a client Because C903 owned policy P861 at that point in time, the policy’s existence was terminated on that same date, May 2011
4 Schizophrenic in that the policy can’t make up its mind which client it belongs to.
As unlikely as such a policy history might be, in the real world, it will have to serve as
an example of how TRI relationships are managed.
Trang 3The next event in the life of this policy occurred in November
2011 It took place as part of the same event in which client C882
was reinstated On that date, a second episode of client C882
began, and a second episode of policy P861 began also, and was
designated as a policy owned by C882 After that, three changes
occurred to the policy between November 2011 and January
2013, but none of them changed the ownership of the policy
The fifth event in the life of the policy was that client C882
asked to terminate her relationship with our company as of
January 2013 Since she owned P861 at that time, and would still
own it on that termination date, the policy was terminated along
with the client
Four months later, on May 2013, policy P861 was reinstated
and assigned to client C902 So a third episode of the policy
was created, P861-C It was an open-ended episode, one with
an effective end date of 12/31/9999, and so the only owner that
could be assigned to it would be one with an open-ended
epi-sode that began on or before May 2013 Fortunately, client
C903 had such an episode, having been reinstated, after a
5-month absence, with episode C903-C
With this information as part of our production data, we
know, at any point in the history of policy P861, who its owner
was and when and for how long she had been the owner
For any claims submitted for medical services provided to either
C903 or C882, no matter how delayed the filing of those claims
may have been, we know exactly when each client was covered
by that policy and exactly when she was not covered by it—an
essential piece of information needed to pay claims correctly
And we don’t have to go digging in archival storage, or
histor-ical data warehouses, for that information—which, in a high
transaction volume claims processing system, is a very good
thing That historical data exists in the same table as data about
current policies and their current owners The service date on
the claim selects the correct version of the policy, and that
ver-sion points to its owner If its owner is not the person for whom
the claim is submitted, the claim is rejected
Foreign Keys and Temporal Foreign Keys
Before proceeding, let’s remind ourselves of the difference
between (i) foreign keys (FKs), the relationships they implement
and the constraints they impose, and (ii) temporal foreign keys
(TFKs), the relationships they implement and the constraints
they impose
Trang 4A foreign key is a column in a relational table whose job is to relate rows to other rows.5If the foreign key column is declared to the DBMS to be nullable, then any row in that table may or may not contain a value in its instance of that column But if it does con-tain a value, that value must match the value of the primary key of a row in the table declared as the target table for that foreign key For non-nullable foreign keys, of course, every row in the source table must contain a valid value in its foreign key column
In addition, once the FK relationship is declared to the DBMS, the DBMS is able to guarantee that the two managed objects—the child row and the parent row—accurately reflect the existence dependency between the objects they represent
It does so by enforcing the constraint expressed in the declara-tion, the constraint that if the child row’s FK points to a parent row, that parent row must have existed in its table at the time the child row was added to its table, and must continue to exist
in the parent table for as long as the child row exists in its table and continues to point to that same parent
This is a somewhat elaborate way of describing something that most of us already understand quite well, and that few of
us may think is worth describing quite so carefully—that foreign keys relate child rows to parent rows and that, in doing so, they reflect a relationship that exists in the real world We have gone
to this length in order to be very clear about both the semantics and the mechanics of foreign keys—semantics described in our talk about objects, and mechanics in our talk about managed objects—and to place the descriptions at a level of generality where the semantics and mechanics of TFKs can be seen as analogous to those of the more familiar FKs So if we use an
“X/Y” notation in which the “X” term is part of the referential integrity description and the “Y” term is part of the temporal ref-erential integrity description, we have a description which makes
it clear that temporal referential integrity really is temporalized referential integrity, that TRI is RI as it applies to temporal data That description is given in the following paragraph
Once the FK/TFK relationship is declared to the DBMS/AVF, the DBMS/AVF is able to guarantee that the two managed objects—the child row/version and the parent row/episode— accurately reflect the existence dependency between the objects they represent Each does so by enforcing the constraint expressed in the declaration, the constraint that if the FK/TFK in the child row/version points to a parent row/episode, that parent
5 We will assume that all primary and foreign keys consist of single columns, since the complications that arise with multi-column keys are irrelevant to this discussion.
Trang 5row/episode must have existed in its table/be currently asserted
and currently effective at the time the child row/version was
added to its table, and must continue to exist/be currently
asserted and currently effective in the parent table for as long as
the child row/version exists/is currently asserted and currently
effective in its table and continues to point to that same parent
TFKs: A Data Part and a Function Part
As a data element, a TFK is a column in an asserted version
table whose job is to relate child managed objects to parent
managed objects Of course, the same may be said of FKs The
difference is that the parent managed object of a FK is a
non-temporal row, while the parent managed object of a TFK is a
group of possibly many rows A TRI child table is an asserted
version table that contains a TFK A TRI parent table is an
asserted version table referenced by a TFK The FK reference is
a data value, and is unambiguous; but the TFK reference, as a
data value, is not unambiguous
So as a data element, all a TFK can do is designate the object
on which the object represented by its own row is existence
dependent There may be any number of versions representing
that object in the parent table, and those versions may be
grouped into any number of episodes scattered along the
asser-tion and effective time timelines So as a data value, a TFK
refer-ence is incomplete
For example, a TFK data value in a Policy table references all
the episodes in a Client table which represent the client on
which that policy is existence dependent, that being the client
whose oid matches the data value in the TFK To complete the
reference, we need to identify, from among those episodes, the
one episode which was in effect when the policy version went
into effect, and will remain in effect as long as that policy version
remains in effect
What is needed to complete the reference is a function We
will name this function fTRI It has the following syntax:
fTRI(PTN, TFK, [eff-beg-dt – eff-end-dt])
PTN is the name of the parent table which this TFK points to
Given the TFK and effective time period of a version in a TRI
child table, the AVF searches the parent table for an episode
whose versions have that oid as part of their primary key, and
whose effective time period fully includes the effective time
period designated by the function If there is such an episode,
it is the TRI parent episode of that version, and the fTRI function
Trang 6evaluates to True If there is no such episode, then the function evaluates to False, and that version will never be added to the database because if it were, it would violate TRI
If the AVF finds such an episode, in carrying out this function, it does not have to check further to insure that there is only one such episode If there were more than one, then those episodes would be
in TEI conflict across all their clock ticks which [intersect] The AVF does not allow TEI violations to occur, so if there is a TRI parent epi-sode for the TFK reference, there is only one of them
For example, the oid value in the TFK of P861-A(2) picks out client C903 Before the AVF added that version to the database,
it used the fTRI function to determine whether or not it was ref-erentially valid.6 That TRI validation check would look some-thing like this:
IF ISTRUE( fTRI(Client, C903, [Jul10 – 9999])) THEN {add the version}
ELSE {notify the calling program of a TRI error}
ENDIF
Together, the explicit and implicit parts of the TFK, its data ele-ment part and its function part, complete an unambiguous refer-ence from a TFK to the one episode which satisfies the TRI constraint on the relationship from that version to that episode Note that this description of a TFK is a semantic description, not
an implementation-level description The fTRI function is one component of a TFK Its representation here is obviously not source code that could be compiled or interpreted But however it is expressed, whether in the AVF or in some other framework based
on these concepts, it is a function; and without it, the columns of data we call TFKs are not TFKs Those columns of data are simply those components of TFKs which can be expressed as data
Temporal Transactions and Associative Tables
In a non-temporal database, an associative table, often infor-mally referred to as an xref table, implements a many-to-many relationship between two other tables Each of those other tables
6 This is a logical description of what the AVF does It does not imply that the AVF code makes a single function call to carry out its TRI checks, let alone that it calls a function named fTRI.
Trang 7is a parent to the xref table, which is thus RI dependent on both
of them
Each row in the xref table has two FKs, one to a parent row in
one table and one to a parent row in another table (or, possibly,
in the same table) As we already know, this dual RI dependency
means that a row cannot be inserted into the xref table unless
both its parent rows already exist in the database, and neither
parent row can be deleted as long as that xref row remains in
the database
TRI with Multiple TFKs
If a child version has two or more TFKs, the effective
timespan of an episode of each of the objects which those TFKs
reference must fully include the effective timespan of the
ver-sion If either of them did not, that would be a TRI violation
So consider an associative asserted version table, whose
vers-ions each contain two TFKs What of the Allen relatvers-ionships
between the two parent episodes related by any version in this
table? Are there any constraints on those parent episodes?
In fact, there are Those two effective timespans must
[inter-sect] If they did not [intersect], then there would be no clock
tick when both were in effect, and so no clock tick in which an
xref row, TRI dependent on both parents, could exist
Consider an example in which we have a customer episode
C773-B with an effective timespan from March 2013 until further
notice, which we will write as C773-B[Mar 2013 – 12/31/9999],
and also a salesperson episode S217-D[Sep 2013 – Dec 2013]
What can we say of the effective timespan of a version in an
asserted version associative table relating that customer episode
to that salesperson episode?7
First, that associative table version cannot have an effective
begin date prior to September 2013 because that would make
the start of its effective time period earlier than the start of
S217-D By the same token, that version cannot have an effective
end date after December 2013 because that would make the end
of its effective time period later than the end of S217-D
So knowing what we do of the two parent episodes, what is
the maximum effective timespan that would be valid for the
7 As a complete aside, we note that the in-line notations developed in Chapter 6 and
elsewhere in this book, for example the S217-D[Sep 2013 – Dec 2013] notation
developed in this chapter, might be the basis for a degree of automated semantic
interoperability between structured and semi-structured representations of temporal
data.
Trang 8child version? It is the later of the two parents’ begin dates, and the earlier of their end dates This gives a maximum effective timespan of the xref table child version of [Sep 2013 – Dec 2013], which happens to be the effective timespan of its parent salesperson episode This is because the salesperson episode occurs [during] the customer episode
Next, let’s consider an example that does not involve 12/31/
9999 Suppose that the effective timespans of our parent episodes are like this: C773-B[Mar 2013 – Jun 2013] and S217-D [Sep 2013 – Dec 2013] Using our earlier/later rule, the maximum effective timespan of the xref version happens to be the same as
it was in the previous case: [Sep 2013 – Dec 2013]
But this isn’t the end of the story In our first example, the two parent episodes [intersected], and the timespan during which they intersected was that widest timespan possible for the child version But in this second example, the parent episodes do not [intersect] C773-B ceases being in effect three months before S217-D begins to be in effect
An associative table version cannot have two non-intersecting TRI parents because there would then be no effective time clock ticks shared by the parents, and therefore no clock ticks in which both TRI relationships are satisfied
In summary: the effective timespan of an xref row must be fully included in the effective timespans of both of its parent episodes It follows that if there are no effective time clock ticks which those parent episodes have in common, no version which
is TRI dependent on both of them can exist in the database It also follows that if there are one or more clock ticks which those two parent episodes do have in common, the widest extent of the effective time period of the TRI dependent version is pre-cisely that set of [intersecting] clock ticks
Temporal Delete Options
The three options for standard delete transactions are (i) RESTRICT, (ii) SET NULL, and (iii) CASCADE As applied to tem-poral delete transactions, the RESTRICT option is straightfor-ward For example, suppose there is a RESTRICT option on deletes applied to the Client table, and suppose that the data-base is populated as shown inFigure 11.1 Episode C903-B could
be deleted in its entirety because no policies are dependent on it Episode C882-A could be deleted from the single clock tick January 2010, or from July 2010 through April 2011 because the resulting episode, removed from any of those months, will still
Trang 9satisfy the TRI relationship from P861-A(1) But an attempt to
remove client C903 from January 2011, for example, would be
restricted because a dependent child—P861-A(2)—is TRI
depen-dent on it during that month
As for the SET NULL option, its temporal form is not as
straightforward It means that if a temporal delete would violate
a TRI constraint, and the SET NULL option is in effect for that
table, then the TFK in the child row that would otherwise be
orphaned will be set to NULL In the last example just
men-tioned, if the delete option was SET NULL, episode C903-A
would be split into two episodes by removing it from January
2011 P-861A(2) would be split into three versions, with effective
time periods of [Jul 2010 – Jan 2011], [Jan 2011 – Feb 2011] and
[Feb 2011 – May 2011] The TFK in the middle of the three
ver-sions would then be set to NULL
But the temporal form of the CASCADE option is both
mechan-ically and semantmechan-ically even more complex than this As for its
semantics, a temporal delete cascade will attempt to remove both
the parent object, and all its dependent children, from the clock
ticks specified in the transaction For example, if we specified a
temporal delete cascade on client C882 for the effective time period
[Jul 2012 – Jan 2013], we would find that episode P861-B would
be subject to a {shorten backwards} transformation for those six
clock ticks This would remove P861-B(6) from current assertion
time, and would also shorten P861-B(5) by one clock tick But this
should cause no concern We already understand the mechanics
of temporal extent state transformations
Temporal Referential Integrity Applied to
Temporal Transactions
A Temporal Insert Transaction
Let’s assume that the Client and Policy tables are as shown in
Figure 11.1, and let’s begin by considering a temporal insert of
P861 which has a TFK of C903 In order to satisfy TRI constraints,
every clock tick in the effective time period specified on the
trans-action must already be occupied by C903 So there are only a
lim-ited number of effective time spans that can validly be specified by
a temporal insert transaction, in this situation They are:
(i) The three months of [Feb 2013 – May 2013], or the two
months of [Mar 2013 – May 2013] or the month of [Apr
2013 – May 2013], each of which will {lengthen P861-C
backwards}
Trang 10(ii) The two months of [Feb 2013 – Apr 2013], which will create a new episode between P861-B and P861-C
Let’s be sure we understand why these are the only possibilities To begin with, the existing episodes of C903, the parent object, cover the effective time clock ticks [Apr 2010 – May 2011], [Apr 2012 – Sep 2012] and [Feb 2013 – 12/31/9999]
So if all the clock ticks in a new version of P861 fall anywhere within any one of those three ranges, that version will satisfy TRI; and otherwise, it won’t
However, this is a temporal insert transaction, and therefore none of the clock ticks in the new version being created can already be occupied by another version of P861 This is the TEI constraint applied to temporal insert transactions This rules out [Feb 2010 – May 2011], [Nov 2011 – Jan 2013] and [May
2013 – 12/31/9999] So, eliminating these clock ticks that are already occupied by P861 from the clock ticks occupied by C903, we are left with only the three clock ticks of February, March and April 2013
A Temporal Update Transaction
By definition, temporal updates neither add a representation
of an object to a clock tick nor remove a representation of an object from a clock tick But they can still cause temporal refer-ential constraints to be violated They can do so by changing the TFK value in one or more clock ticks
For example, suppose a temporal update is submitted which specifies that in November and December of 2012, P861’s owning client should be C903 The transaction looks like this:
UPDATE Policy [P861, C903,, ] Nov 2012, Jan 2013
The problem is that there is no representation of C903 in either
of those two clock ticks The function fTRI(Client, C903, [Nov12 – Jan13]) will evaluate to False Therefore, the AVF will restrict this transaction because of TRI constraints This is the equivalent of working with a non-temporal table, and trying to change a FK value to point to a parent row that does not, at that time, exist
A Temporal Delete Transaction
A temporal delete withdraws its target object from one or more effective time clock ticks In the process, it may {erase}
an entire episode from current assertion time, or {split} an epi-sode in two, or {shorten} an epiepi-sode either forwards or back-wards, or do several of these things to one or more episodes with one and the same transaction