论文标题

Sigma工作簿:用于云数据仓库的电子表格

Sigma Workbook: A Spreadsheet for Cloud Data Warehouses

论文作者

Gale, James, Seiden, Max, Utkarsh, Deepanshu, Frantz, Jason, Woollen, Rob, Demiralp, Çağatay

论文摘要

云数据仓库(CDWS)带来了大规模的数据,并更接近企业的用户。但是,现有用于分析CDW数据的工具要么受到临时转换的限制,要么很难用于业务用户。在这里,我们介绍了Sigma Workbook,这是一种新的交互式系统,使业务用户可以轻松地对CDW中的数据进行视觉分析。为此,Sigma Workbook提供了一个可访问的电子表格状界面,以通过直接操作进行分析。 Sigma Workbook动态构建了从用户交互中的SQL查询匹配的,该查询基于SQL的多功能性和表达性。构造的查询直接在CDW上执行,利用新一代CDW的出色特征,包括可伸缩性。我们通过3个现实生活中的用例(队列分析,会话和数据增强)演示了Sigma工作簿,并强调了工作簿的易用性,可伸缩性和表现力。

Cloud data warehouses (CDWs) bring large-scale data and compute power closer to users in enterprises. However, existing tools for analyzing data in CDWs are either limited in ad-hoc transformations or difficult to use for business users. Here we introduce Sigma Workbook, a new interactive system that enables business users to easily perform a visual analysis of data in CDWs at scale. For this, Sigma Workbook provides an accessible spreadsheet-like interface for analysis through direct manipulation. Sigma Workbook dynamically constructs matching SQL queries from user interactions, building on the versatility and expressivity of SQL. Constructed queries are directly executed on CDWs, leveraging the superior characteristics of the new generation CDWs, including scalability. We demonstrate Sigma Workbook through 3 real-life use cases -- cohort analysis, sessionization, and data augmentation -- and underline Workbook's ease of use, scalability, and expressivity.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源