[Day 151] Reading more about taxi OD matrix prediction architectures + more Scottish dataset audio included

 Hello :)
Today is Day 151!


A quick summary of today:
  • my collaborator recorder more audio and the Scottish (Glaswegian) dataset is growing
  • read more papers related to OD (origin-destination) matrix prediction for taxis


The process of adding data to huggingface is faster now, so it is going great. We are up to ~18 mins.


Secondly, about the papers from today

First paper is GNN for Traffic Forecasting - A survey

Problems

Traffic flow

Traffic flow is the number of vehicles that pass through a spatial unit, such as a road segment or traffic sensor point in a given time slot. Three types are considered: road-level, region-level and station-level.

Traffic speed

This is the average speed of vehicles passing through a spatial unit in a given time slot. Traffic speed problems are categorized into road-level and region-level issues, including travel time and congestion predictions. Traffic congestion prediction aids in optimizing road conditions and vehicle flow. Challenges vary between urban roads and freeways, with urban roads posing more complexities due to intricate connection patterns, speed limit variations, and spatial dependencies.

Traffic demand

Refers to the potential demand for travel. An example of demand is the amount of requests sent through taxi hiring apps (whether all the demand is met is different).


Graph Neural Networks

GNNs are currently the SoTA in traffic forecasting. Variations include recurrent GNNs, convolutional GNNs, graph autoencoders, and spatiotemporal GNNs, the last being used in this case since traffic forecasting is a spatiotemporal problem.

Open data and Source codes

Open data

Three types - graph-related data(transportation network data), historical traffic data(historical traffic state records), and external data(weather and calendar data). Links to traffic sensor data, taxi data, ride-hailing data, bike data, and subway data used in surveyed studies.

Open source codes

Links available to repositories.

Challenges

Heterogeneous data

  • data quality
  • incomplete data
  • traffic anomalies data is difficult to collect to train DL models
  • data privacy

Multi-task performance

  • Multi-task frameworks in ITSs are needed to predict demand for multiple transportation modes simultaneously.
  • Knowledge adaptation helps transfer information from data-rich to data-sparse sources for demand prediction.
  • Challenges include data format incompatibilities and differing spatial or temporal patterns.
  • Most surveyed models are designed for single-task training, though some handle multiple tasks like traffic flow and speed prediction on the same road segment.
  • Multi-task forecasting is challenging in graph-based modeling due to different graph structures for different tasks (e.g., road-level vs. station-level problems).
  • Some GNN-based models have tackled multi-task prediction (e.g., taxi departure/arrival flow, region-flow, transition-flow, crowd flows, OD flows).
  • Existing attempts typically use the same graph with multiple outputs via feed forward layers.
  • Significant further development is needed for GNN-based multi-task prediction, especially for tasks requiring multiple graph structures.

Practical implementation

  • Data Bias: Existing GNN-based studies often use less than one year of data, introducing bias and limiting applicability to different times or places. Using longer traffic data requires updating underlying traffic infrastructures, increasing costs and complexity.
  • Computation Scalability: GNNs face scalability issues with large-scale traffic network graphs, often leading to consideration of only subsets of nodes and edges. Solutions like graph partitioning and parallel computing have been proposed, but they provide only marginal performance improvements over simpler models.
  • Infrastructure Changes: Real-world network graphs change with modifications in transportation infrastructure (e.g., new road segments, bus lines, points-of-interest). Static graph formulations are inadequate. Solutions include dynamic Laplacian matrix estimators and Data Adaptive Graph Generation (DAGG) modules to handle changing spatial dependencies.

Model interpretation

  • Interpreting "black-box" machine learning or deep learning models, including GNNs, is a major criticism in traffic forecasting.
  • Early-Phase Techniques: Techniques for explaining GNN predictions are still in the early stages of development and have not yet been applied to traffic forecasting.
  • Severity in Transportation: The lack of model interpretation is particularly problematic in transportation due to complex and heterogeneous traffic data, making it harder to design interpretable models compared to simpler data formats like images and text.
  • Current Efforts: Some efforts, such as incorporating state space models, have been made to improve model interpretation in traffic forecasting, but the issue remains largely unresolved, especially for GNN-based models.

Future directions

  • creating a centralised data depository
  • designing a transportation knowledge graph that leverages the traffic semantic information to improve forecasting performance
  • using other techniques
    • data augmentation
    • transfer learning
    • meta learning
    • Generative Adversarial Networks (GANs)
    • AutoML
    • Bayesian Networks

Applications in real-world ITS systems

Most GNN-based studies are based on simulations with historical traffic data, and not used in real-world applications to test their validity.


Second paper is Taxi origin and destination demand prediction based on deep learning: a review

Introduction

When it comes to graphs and taxi demand prediction - node-level predictions are for number of trips for that particular node(region), while edge-level predictions are for demand relationships between two nodes(regions).

Mathematical statistical methods

Statistical models are based on historical and time series data. The most widely applied once are HA, ARMA, MA, ARIMA, and the Kalman filtering model. However, such models lack in the ability to work with high dimensional, spatiotemporal data such as taxi data.

Traditional machine learning methods

Common ones are linear regression, SVM, decision trees, RF, artificial neural networks. In these models feature selections directly influences their accuracy, and while they improve upon the statistical methods, they do not effectively solve the nonlinear correlation of complex multidimensional data.

Deep learning background

CNN, RNN, GCN are commonly used for extracting temporal and spatial information. Some papers also use multi-task learning, residual networks and other methods to get better accuracy. To make demand and OD matrix predictions, historical regional taxi data with both temporal and spatial information is needed. Below, the review focuses on four areas: spatial topology construction, spatial-dependent modelling, time-dependent modelling, and other factors.

Spatial topology construction

Different DL methods require different spatial topologically structured data. CNNs process raster data, while GCNs - graph data.

Raster data

For CNNs, the area of interest is partitioned into non-overlapping n amount of grids of predetermined size. However, transportation data has spatiotemporal attributes and non-Euclidean structural characteristics which renders the grid structure powerless. In addition, if the raster data is small, the area of interest may be split and result in higher data volume that increases the difficulty of prediction. If the raster is too large, then it might be harder to extract demand features and we will end up with reduction in accuracy.

Graph

For GCNs, the travel demand data is transformed into images (non-Euclidean spatial data). There are two graph structures: static and dynamic graphs. Firstly, the graph is constructed with the OD pairs serving as the nodes, and the features of the nodes and edges are included in the prediction network model.

For GCNs, the travel demand data is transformed into images (non-Euclidean spatial data). There are two graph structures: static and dynamic graphs. Firstly, the graph is constructed with the OD pairs serving as the nodes, and the features of the nodes and edges are included in the prediction network model.

  • Static graph The model assumes that the graph structure is constant. To construct the OD matrix we can use methods like distance measures or Gaussian kernels. Otherwise we can construct a binary adjacency matrix using information about which nodes are connected. Studies have also looked into adding auxiliary features like connectivity maps, semantic function maps, weather, distance maps, traffic connectivity maps.
  • Dynamic graph There are two types: (i) nodes and edges continually change over time, and (ii) node and edge properties vary over time. Traditional graph representation learning frameworks generate static representations and overlook the dynamic nature of transportation data. Taxi demand data is spatiotemporal so dynamic graphs can better represent it's nature. There are two construction methods:
    • discrete-time dynamic graphs (DTDG): it defines a fixed length r, and updates the embedding at each t time unit. The result is a dynamic adjacency matrix or a sequence of multiple graphs. Each graph is like a snapshot at a particular time slot. DTDG relies on the length r. A loose r can result in missing useful information like trends, whereas a more tight r can lead to extra noise. Therefore the DTDG method suffers from loss of information due to the fixed (discrete) segmentation of the OD stream information.
    • continuous time dynamic graphs (CTDG): here the node representation is updated based on event data which can include type of event, the location, and its time. An example of an OD request can be described by a tuple (e, l ,t), where e is the type of OD request, l is the location, and t is the timestamp. This method can only capture the time dependence of finite time steps. Also, due to the spatiotemporal demand imbalance, the OD matrix can be very sparse in some regions.

Spatial dependency

Traditional CNNs work on Euclidean structured data and lack the ability to handle the opposite. Transportation networks have spatiotemporal attributes which are well-handled by GCNs.

Temporal dependency

Three main architectures used - RNNs, Transformer, and TCN (Temporal Convolutional Network)

Other factors

External characteristics

Taxi demand can be affected by various auxiliary factors like weather.

Model helper methods

Papers have used methods like attention, multi-task learning, and ResNet networks

Challenges

Challenge 1: Representation of dynamic correlations in OD flow

It is common for the relationship between two regions to change over time (peak vs non-peak hours). Static graphs cannot capture this, so research has looked into different ways of using dynamic graphs for that purpose. papers to read

Challenge 2: Spatial-temporal correlation

Spatiotemporal correlation means that each node can influence its neighbours at the next time slot. Spatiotemporal heterogeneity comes from the fact that the OD flow is different in cases like morning, evening, downtown, or city outskirts. At the time of writing, using two independent components to capture both spatial and temporal dependencies in a chained prediction often fails to capture this correlation and heterogeneity. papers to read

Challenge 3: Differentiation of different semantics of origin and destination

In complex and irregular transportation networks, passenger demand between OD pairs can be both geographically and semantically correlated, with directed and bidirectional relationships. Modeling demand separately for origins and destinations ignores the flow relationship between OD pairs and lacks practical application. Moreover, considering only the distance and flow information between grids, without distinguishing between origin and destination, overlooks the directionality of OD flows and the varying attraction relationships at different times. papers to read

Challenge 4: Time window selection

The predominant method for predicting OD flows is the discrete dynamic graph approach, which aggregates historical transactions into demand snapshots within fixed time windows. This method results in disconnected OD flows and lacks rigor due to the arbitrary choice of time granularity, which can introduce noise or overlook important information. Continuous-time dynamic graph methods offer a solution by maintaining dynamic state vectors for traffic nodes, but they face challenges in updating and maintaining representations for numerous continuous-time nodes. Recent advancements include using a spatiotemporal attention network and developing a continuous-time dynamic graph framework to improve prediction accuracy and capture complex time patterns. papers to read

Challenge 5: Data sparseness problem solving

The time sequence associated with each OD pair involves intricate spatial dependencies. Discrete dynamic graph-based prediction methods often result in information loss and a high frequency of zero values. Continuous-time dynamic graph-based prediction methods face challenges due to sparse data for certain OD pairs, which is exacerbated by the quadratic increase in predicted OD demand. papers to read

Going forward

Spatiotemporal dynamic correlation

  • An individual node is affected by interactions with its surroundings, and randomness of the network itself. Currently this is tackled using attention mechanisms, but this can be explored further

External information addition

  • External factors like holidays, weather, points of interest, large events, and traffic accidents can also significantly affect taxi demand. Introducing such auxiliary information is rare in current research.

Regional division

  • Current papers split the region of interest into a set of grids or traffic zones. But this method can lack in understanding all relationships between the data, so a more rigorous approach can be developed.

OD data sparseness and data overload

  • Data sparsity affects the model, and future research can look into how to handle such sparsity in the OD matrix.

Technical related issues

  • Most papers use RNNs, GCNs, a few use GATs and GAEs. Future research can look into how we can use GNNs for OD demand prediction problems better


On another note, tonight I have to stay up till 2am because I signed up to give a short ppt on text2chart (which I created some time ago) in a project group organised by Stanford AI professional certificate students. I uploaded the ppt pdf to the github repo. 


That is all for today!

See you tomorrow :)

Popular posts from this blog

[Day 198] Transactions Data Streaming Pipeline Porject [v1 completed]

[미리 공부] 기초 통계 복습 (Day 1는 1월2일)

[Day 61] Stanford CS224N (NLP with DL): Machine translation, seq2seq + a side CDCGAN mini project