2
adapt them dynamically to changes [15, 27, 29]. The source of graph data could come
from relational, XML, or JSON that exist in the different databases.
The history of the database evolution has shown that new applications often require
new data models leading to extended infrastructure of DBMSs with new query lan-
guages over these new data models. One existing solution is the polyglot persistency
approach, which leverages numerous DBMSs to support different data models and
integrates them programmatically at the application layer. The biggest issue of poly-
glot persistency is that the combined DBMSs is neither declarative nor unified. It
leaves database application to procedurally join data among multiple data models and
manually transform among data model instances. Instead of putting the burden on
applications, it is more desirable to have a unified single DBMS, which hides the
complexity of multiple data models by providing declarative approach of querying
multi-model data instances and just-in-time data model transformation.
In this position paper, we advocate a multi-model database management system
(MMDBMS) that has the ability to incorporate any data model and allows users to
manipulate all data models declaratively. Users are able to explore the real power of
an MMDBMS by leveraging its ability to autonomously transform data from one data
model to another. MMDBMSs allow data providers and data consumers to look at the
same data using different models depending on their most effective view.
MMDBMSs accomplish these data model transformation autonomously on behalf of
users.
We argue that the design of a full-fledged MMDBMS requires a more powerful math-
ematical foundation. The last few decades have witnessed a tremendous success of
RDBMSs leveraging the relational algebra as theoretical foundation and therefore
limiting this foundation to relational data. We recognize the same data can be repre-
sented relationally, hierarchically, graphically and are thus queryable by SQL,
XQuery, Property-Graph Query Language respectively. Therefore, we feel the need of
having a new theoretical foundation to provide transparent data model and query lan-
guage transformations among those data models and languages. In other words,
MMDBMSs require a powerful mathematical foundation to reason about declarative
data model transformation among multiple data models. In this paper, we promote
category theory [5,14] shall be able to play the role of the new mathematical founda-
tion to reason declarative construction and transformations among various data mod-
els.
In addition, this paper describes a set of shared data infrastructure services. The
shared services not only include essential common data services, such as transaction,
recovery, security, high availability but also include integrating artificial intelligence
to provide “Just-In-Time” data model access and telemetry service to promote multi-
model situation awareness service [4].
Organization The remainder of this paper is structured as follows: Section 2 intro-
duces the preliminaries on categories and examples of model transformation. Section
3 presents category theory as the mathematical foundation for MMDBMS. Section 4
illustrates MMDBMS infrastructure services. Section 5 shows related work and sec-
tion 6 concludes the paper.
评论