The curse of dimensionality.
Dimensionality Reduction is reducing the “dimension” of the data set. In time series analysis, the dimension is the number of data points.
PAA
Reduce the number of points by taking the average value for each segment. Given points, we want segments → Reduced PAA → Reconstructed PAA.
Length of each segment = .
Example 1:
Limitations
APCA
An adaptive PAA; it is called adapative, because it allows segments to be of variable lengths.
Haar Discrete Wavelet Transform (DWT)
- Find DWT Coefficients
- Truncate DWT Coefficients
- Reconstruct DWT
- Truncate if padded
- Replace approximates with actual averages
If mismatch in desired segments and achieved segments:
- will have to merge segments
- we merge adjacent segments with least deviation in value
- merging is averaging
- repeat this process until desired segments = achieved segments
Example: