Bibliographic Details
Title: |
Statistically Discriminative Sub-trajectory Mining |
Authors: |
Duy, Vo Nguyen Le, Sakuma, Takuto, Ishiyama, Taiju, Toda, Hiroki, Nishi, Kazuya, Karasuyama, Masayuki, Okubo, Yuta, Sunaga, Masayuki, Tabei, Yasuo, Takeuchi, Ichiro |
Publication Year: |
2019 |
Collection: |
Computer Science Statistics |
Subject Terms: |
Statistics - Machine Learning, Computer Science - Machine Learning |
More Details: |
We study the problem of discriminative sub-trajectory mining. Given two groups of trajectories, the goal of this problem is to extract moving patterns in the form of sub-trajectories which are more similar to sub-trajectories of one group and less similar to those of the other. We propose a new method called Statistically Discriminative Sub-trajectory Mining (SDSM) for this problem. An advantage of the SDSM method is that the statistical significance of the extracted sub-trajectories are properly controlled in the sense that the probability of finding a false positive sub-trajectory is smaller than a specified significance threshold alpha (e.g., 0.05), which is indispensable when the method is used in scientific or social studies under noisy environment. Finding such statistically discriminative sub-trajectories from massive trajectory dataset is both computationally and statistically challenging. In the SDSM method, we resolve the difficulties by introducing a tree representation among sub-trajectories and running an efficient permutation-based statistical inference method on the tree. To the best of our knowledge, SDSM is the first method that can efficiently extract statistically discriminative sub-trajectories from massive trajectory dataset. We illustrate the effectiveness and scalability of the SDSM method by applying it to a real-world dataset with 1,000,000 trajectories which contains 16,723,602,505 sub-trajectories. |
Document Type: |
Working Paper |
Access URL: |
http://arxiv.org/abs/1905.01788 |
Accession Number: |
edsarx.1905.01788 |
Database: |
arXiv |