
Efficient algorithms for reachability and path queries on temporal bipartite graphs
1.2.2 Tracing metabolic pathways
In biochemistry, cellular metabolism, which is a set of bio-
chemical reactions among metabolites, is crucial to maintain
the life of organisms [34]. Specifically, the metabolites serve
as reactants and regulate the circadian rhythm in a cell at dif-
ferent times [24]. If two metabolites participate in a reaction
simultaneously, they will give rise to a product, which may
become the substrate for another reaction. A metabolic path-
way is a linked series of such reactions, and its end product
is crucial for anabolism and catabolism [34]. In the cellu-
lar metabolism, metabolites and reactions form a temporal
bipartite graph, in which an edge indicates that a metabolite
participates in a specific reaction [41]. Accordingly, a path
in the temporal bipartite graph serves as a representation of
the metabolic pathway, delineating the journey from an initial
metabolite (e.g., glucose) to a pivotal end product (e.g., ATP)
in cellular metabolism. This modeling approach unveils the
sequence of reactions and intermediate compounds, provid-
ing essential insights into the intricate processes of energy
production within cells. Note that metabolic pathways in a
cell can also be dynamic, constantly changing in response to
various internal and external stimuli.
1.2.3 Modeling the flow of information
In online social networks, people usually post content
through online social media (e.g., Facebook and Instagram).
Here, an edge connects a user to a post, and the timestamp
indicates when the user interacts with or creates the content.
Note that when a user engages in discussions about topics
like new ideas, technologies, and products, she/he has a great
potential to disseminate the information to other groups of
people [27]. Thus, the proposed temporal bipartite reacha-
bility can be utilized to model the flow of information and
help us understand how information spreads in a network.
Specifically, reachability and path query can trace the flow
of a specific piece of information from its source (e.g., Alice,
who initially shares details about a new technology) to a key
influencer (e.g., Bob, a prominent tech blogger). This reveals
how the information cascades through various user interac-
tions and how it gains prominence in the network. Note that
social networks contain extensive data due to their expan-
sive user bases, with new interactions frequently emerging,
necessitating ongoing incremental maintenance.
1.2.4 Characterize trading relationships
In the literature, analyzing the trading information between
investors and stocks is useful for uncovering the statistical
properties of stock trading networks [22, 42]. N It’s notewor-
thy that the investor-stock trading network can be naturally
depicted as a temporal bipartite graph, where each tempo-
ral edge signifies a trading action between an investor and
a stock. In this context, a time-overlapping wedge indicates
that two investors are involved in trading the same stock at
a similar time, implying a congruent trading action for these
two investors. Performing reachability and path queries in
such networks can help in characterizing the trading rela-
tionships among investors and comprehending the evolving
dynamics of trading over time, which can provide valuable
insights for downstream tasks such as market sentiment ana-
lytics and risk management.
1.3 Applying existing techniques
To answer the temporal bipartite reachability and path
queries, one may consider projecting the temporal bipar-
tite graph into a temporal unipartite graph and extending
the existing techniques on temporal unipartite graphs [49,
55, 63] to solve the problems. Despite projection provides
a possible solution, size inflation and information loss are
two of its main drawbacks as evaluated in [35, 37]. In our
experiments, extending the state-of-the-art technique on tem-
poral unipartite graphs [55] is inefficient when answering
both single-pair and single-source reachability queries. Fur-
thermore, the existing algorithms [49, 55, 63] are hard to
answer the earliest-arrival path queries since the informa-
tion of one vertex layer is lost after projection. Even though
auxiliary information can be recorded for each edge in the
projected graph, it is obviously time-prohibitive to retrieve
paths by searching from these information. Alternatively, to
answer queries directly based on temporal bipartite graphs, a
straightforward BFS-based solution can be devised. How-
ever, the algorithm has a large search scope, making it
impractical to support online queries on large graphs.
1.4 Our approaches
In this paper, we focus on indexing-based approaches to
efficiently answer temporal bipartite reachability and path
queries. We first propose an index structure, namely TBP-
Index, which is based on the well-known notion of 2-hop
labeling [6, 8, 17] and can efficiently support all possible
single-pair reachability queries. To efficiently answer the
single-source reachability query and the earliest-arrival path
query, we further investigate how to extend the TBP-Index
with negligible extra costs such that the above two queries
can be answered without having to iterate over each vertex
or inspecting the original graph.
To efficiently compute the TBP-Index, we propose a
novel index construction algorithm, namely TBP-build
∗
,
which incorporates two optimization techniques, i.e., time-
priority-based traversal and temporal-based edge partition.
In brief, the time-priority-based traversal technique explores
the containment relationship among time intervals. Uti-
123
相关文档
评论