When to Update Sequential Patterns in Stream Data?: Difference between revisions

From Simple Sci Wiki
Jump to navigation Jump to search
Created page with "Title: When to Update Sequential Patterns in Stream Data? Abstract: This research investigates the optimal time to update sequential patterns in stream data. It proposes a method to determine when to update sequential patterns by balancing the performance of increasingly updating algorithms and the difference between old and new patterns. Experiments on two data sets show that the optimal size of incremental windows to update is approximately 15-25% of the initial windo..."
 
No edit summary
 
(One intermediate revision by the same user not shown)
Line 1: Line 1:
Title: When to Update Sequential Patterns in Stream Data?
Title: When to Update Sequential Patterns in Stream Data?


Abstract: This research investigates the optimal time to update sequential patterns in stream data. It proposes a method to determine when to update sequential patterns by balancing the performance of increasingly updating algorithms and the difference between old and new patterns. Experiments on two data sets show that the optimal size of incremental windows to update is approximately 15-25% of the initial window size.
Abstract: This research investigates the optimal time to update sequential patterns in stream data. It proposes a method called TPD (Tradeoff between Performance and Difference) to determine when to update sequential patterns by making a tradeoff between the performance of increasingly updating algorithms and the difference of sequential patterns. The study uses experimental data from GSM alarm data sets to verify the proposed method.


Main Research Question: How can we determine the optimal time to update sequential patterns in stream data while maintaining the performance of increasingly updating algorithms?
Main Research Question: How can we determine the optimal time to update sequential patterns in stream data while balancing the performance of increasingly updating algorithms and the difference of sequential patterns?


Methodology: The study first defines a distance measure to determine the difference between old and new sequential patterns. It then proposes a method to decide when to update sequential patterns by considering the trade-off between the performance of increasingly updating algorithms and the difference in patterns.
Methodology: The study first defines a distance measure between old and new sequential patterns, which is proven to be a distance. It then proposes the TPD method to decide when to update sequential patterns by making a tradeoff between the performance of increasingly updating algorithms and the difference of sequential patterns.


Results: Experiments on two data sets demonstrate that as the size of incremental windows grows, the values of the speedup and the differences in patterns decrease and increase, respectively. The study shows that the optimal size of incremental windows to update is around 15-25% of the initial window size.
Results: Experiments on two GSM alarm data sets show that, as the size of incremental windows grows, the values of the speedup (performance) and the values of the difference (between old and new sequential patterns) decrease and increase respectively. The study also finds that the incremental ratio determined by the TPD method does not monotonically increase or decrease but changes in a range between 20 and 30 percentage for the IUS algorithm.


Implications: This research provides a practical solution to the problem of determining when to update sequential patterns in stream data. It helps data miners balance the need for updating patterns with the computational resources available. The proposed method can be applied to various domains, such as finance, healthcare, and e-commerce, where real-time data analysis is crucial.
Implications: The TPD method provides a practical solution to the problem of determining when to update sequential patterns in stream data. It helps balance the performance of increasingly updating algorithms and the difference of sequential patterns, ensuring that updates are made when necessary without overburdening resources. This can lead to more efficient data mining processes in real-time applications.


Link to Article: https://arxiv.org/abs/0203028v1
Link to Article: https://arxiv.org/abs/0203028v3
Authors:  
Authors:  
arXiv ID: 0203028v1
arXiv ID: 0203028v3


[[Category:Computer Science]]
[[Category:Computer Science]]
[[Category:Sequential]]
[[Category:Patterns]]
[[Category:Patterns]]
[[Category:Data]]
[[Category:Update]]
[[Category:Update]]
[[Category:Data]]
[[Category:Between]]
[[Category:Sequential]]
[[Category:Size]]

Latest revision as of 04:19, 24 December 2023

Title: When to Update Sequential Patterns in Stream Data?

Abstract: This research investigates the optimal time to update sequential patterns in stream data. It proposes a method called TPD (Tradeoff between Performance and Difference) to determine when to update sequential patterns by making a tradeoff between the performance of increasingly updating algorithms and the difference of sequential patterns. The study uses experimental data from GSM alarm data sets to verify the proposed method.

Main Research Question: How can we determine the optimal time to update sequential patterns in stream data while balancing the performance of increasingly updating algorithms and the difference of sequential patterns?

Methodology: The study first defines a distance measure between old and new sequential patterns, which is proven to be a distance. It then proposes the TPD method to decide when to update sequential patterns by making a tradeoff between the performance of increasingly updating algorithms and the difference of sequential patterns.

Results: Experiments on two GSM alarm data sets show that, as the size of incremental windows grows, the values of the speedup (performance) and the values of the difference (between old and new sequential patterns) decrease and increase respectively. The study also finds that the incremental ratio determined by the TPD method does not monotonically increase or decrease but changes in a range between 20 and 30 percentage for the IUS algorithm.

Implications: The TPD method provides a practical solution to the problem of determining when to update sequential patterns in stream data. It helps balance the performance of increasingly updating algorithms and the difference of sequential patterns, ensuring that updates are made when necessary without overburdening resources. This can lead to more efficient data mining processes in real-time applications.

Link to Article: https://arxiv.org/abs/0203028v3 Authors: arXiv ID: 0203028v3