论文标题
repad:实时积极主动检测时间序列
RePAD: Real-time Proactive Anomaly Detection for Time Series
论文作者
论文摘要
在过去的十年中,在不同领域(例如网络监测,欺诈检测和入侵检测)中引入了许多异常检测方法。但是,他们需要了解数据模式,并且通常需要长时间的离线时间来构建目标数据的模型或网络。在没有人类干预和领域知识的情况下,为流媒体时间序列提供实时和主动的异常检测是非常有价值的,因为它大大减少了人类的努力,并在发生灾难性的损害,失败或其他有害事件之前可以进行适当的对策。但是,这个问题尚未得到很好的研究。为了解决这个问题,本文提出了repad,这是一种基于长期记忆(LSTM)的流动时间序列的实时主动的异常检测算法。 Repad利用短期历史数据点来预测和确定即将到来的数据点是否表明在不久的将来可能发生异常。通过动态调整检测阈值,随着时间的推移,Repad能够在时间序列中忍受较小的模式变化,并主动或按时检测异常。基于从NUMENTA异常基准收集的两个时间序列数据集的实验表明,Repad能够主动检测异常并实时提供早期警告,而无需人工干预和领域知识。
During the past decade, many anomaly detection approaches have been introduced in different fields such as network monitoring, fraud detection, and intrusion detection. However, they require understanding of data pattern and often need a long off-line period to build a model or network for the target data. Providing real-time and proactive anomaly detection for streaming time series without human intervention and domain knowledge is highly valuable since it greatly reduces human effort and enables appropriate countermeasures to be undertaken before a disastrous damage, failure, or other harmful event occurs. However, this issue has not been well studied yet. To address it, this paper proposes RePAD, which is a Real-time Proactive Anomaly Detection algorithm for streaming time series based on Long Short-Term Memory (LSTM). RePAD utilizes short-term historic data points to predict and determine whether or not the upcoming data point is a sign that an anomaly is likely to happen in the near future. By dynamically adjusting the detection threshold over time, RePAD is able to tolerate minor pattern change in time series and detect anomalies either proactively or on time. Experiments based on two time series datasets collected from the Numenta Anomaly Benchmark demonstrate that RePAD is able to proactively detect anomalies and provide early warnings in real time without human intervention and domain knowledge.