论文标题

时间序列工作流的云计算服务的语义

Semantic of Cloud Computing services for Time Series workflows

论文作者

Parra-Royón, Manuel, Baldan, Francisco, Atemezing, Ghislain, Benitez, J. M.

论文摘要

时间序列(TS)存在于许多知识,研究和工程领域。 TS的处理和分析对于从数据中提取知识并处理TS的建模是一项艰巨的任务,需要高度统计专业知识以及有关数据挖掘(DM)和机器学习(MACHICEL LEALLED(ML)方法的杰出知识),对TS的处理和分析至关重要。 TS的总体工作不仅限于几种技术的线性应用,而是由方法和测试的开放工作流程组成。这些工作流程主要是在编程语言上开发的,在包括云计算(CC)环境在内的不同系统上有效地执行和运行非常复杂。 CC的采用可以促进服务的集成和可移植性,从而使服务互联网技术(IT)工业化采用解决方案。 TS的工作流服务的定义和描述开辟了有关在CC环境中部署此类问题的复杂性的一组新可能性。从这个意义上讲,我们设计了一个基于语义建模(或词汇)的有效建议,该建议提供了为时间序列建模作为CC服务的工作流程的完整描述。我们的建议包括大量扩展的操作,适应时间序列分类,回归或聚类问题的任何工作流程,以及包括评估措施,信息,测试或机器学习算法等。

Time series (TS) are present in many fields of knowledge, research, and engineering. The processing and analysis of TS are essential in order to extract knowledge from the data and to tackle forecasting or predictive maintenance tasks among others The modeling of TS is a challenging task, requiring high statistical expertise as well as outstanding knowledge about the application of Data Mining(DM) and Machine Learning (ML) methods. The overall work with TS is not limited to the linear application of several techniques, but is composed of an open workflow of methods and tests. These workflow, developed mainly on programming languages, are complicated to execute and run effectively on different systems, including Cloud Computing (CC) environments. The adoption of CC can facilitate the integration and portability of services allowing to adopt solutions towards services Internet Technologies (IT) industrialization. The definition and description of workflow services for TS open up a new set of possibilities regarding the reduction of complexity in the deployment of this type of issues in CC environments. In this sense, we have designed an effective proposal based on semantic modeling (or vocabulary) that provides the full description of workflow for Time Series modeling as a CC service. Our proposal includes a broad spectrum of the most extended operations, accommodating any workflow applied to classification, regression, or clustering problems for Time Series, as well as including evaluation measures, information, tests, or machine learning algorithms among others.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源