论文标题

通过定性更改对时间序列数据进行罕见互动建模:在重症监护单元中的结果预测应用

Modeling Rare Interactions in Time Series Data Through Qualitative Change: Application to Outcome Prediction in Intensive Care Units

论文作者

Ibrahim, Zina, Wu, Honghan, Dobson, Richard

论文摘要

许多研究领域的特征是大规模高度的时间序列数据泛滥。但是,使用可用于预测和决策的数据,当前滞后在我们揭示和量化了解释结果的真实互动的能力方面受到阻碍给定的一组变量的特定值(或值范围),而是由于这些值与随着时间的时间记录的正常状态的偏差,iv)需要解释模型的预测。在这里,尽管已经为结果预测制定了许多数据挖掘模型,但他们无法解释其预测。 我们提出了一个模型,用于发现与高度时间序列数据中产生结果的最高可能性相互作用的模型。变量之间的相互作用由关系图结构表示,该图形结构依赖于定性抽象来克服非均匀的采样,并捕获与随时间变化的变化和偏差相对应的相互作用的语义。使用这样的假设,即小型相互作用的类似模板是导致结果的原因(在医疗领域中很普遍),我们重新制定了发现任务,以从数据中检索最明显的模板。

Many areas of research are characterised by the deluge of large-scale highly-dimensional time-series data. However, using the data available for prediction and decision making is hampered by the current lag in our ability to uncover and quantify true interactions that explain the outcomes.We are interested in areas such as intensive care medicine, which are characterised by i) continuous monitoring of multivariate variables and non-uniform sampling of data streams, ii) the outcomes are generally governed by interactions between a small set of rare events, iii) these interactions are not necessarily definable by specific values (or value ranges) of a given group of variables, but rather, by the deviations of these values from the normal state recorded over time, iv) the need to explain the predictions made by the model. Here, while numerous data mining models have been formulated for outcome prediction, they are unable to explain their predictions. We present a model for uncovering interactions with the highest likelihood of generating the outcomes seen from highly-dimensional time series data. Interactions among variables are represented by a relational graph structure, which relies on qualitative abstractions to overcome non-uniform sampling and to capture the semantics of the interactions corresponding to the changes and deviations from normality of variables of interest over time. Using the assumption that similar templates of small interactions are responsible for the outcomes (as prevalent in the medical domains), we reformulate the discovery task to retrieve the most-likely templates from the data.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源