论文标题

contentwise印象:包含印象的工业数据集

ContentWise Impressions: An Industrial Dataset with Impressions Included

论文作者

Maurera, Fernando Benjamín Pérez, Dacrema, Maurizio Ferrari, Saule, Lorenzo, Scriminaci, Mario, Cremonesi, Paolo

论文摘要

在本文中,我们介绍了ContentWise Impressions数据集,隐含交互的集合以及来自顶级媒体服务的电影和电视连续剧的印象,该服务通过Internet提供了其媒体内容。该数据集与其他已经可用的多媒体建议数据集区别了,即印象的可用性,即向用户显示的建议,其大小和开源。与其他常用数据集相比,我们描述了数据收集过程,应用预处理,其特征和统计信息。我们还强调了几种可能的用例和研究问题,这些问题可以从开源数据集中的用户印象中受益。此外,我们发布了加载和拆分数据的软件工具,以及如何在几种常见建议算法中使用用户交互和印象的示例。

In this article, we introduce the ContentWise Impressions dataset, a collection of implicit interactions and impressions of movies and TV series from an Over-The-Top media service, which delivers its media contents over the Internet. The dataset is distinguished from other already available multimedia recommendation datasets by the availability of impressions, i.e., the recommendations shown to the user, its size, and by being open-source. We describe the data collection process, the preprocessing applied, its characteristics, and statistics when compared to other commonly used datasets. We also highlight several possible use cases and research questions that can benefit from the availability of user impressions in an open-source dataset. Furthermore, we release software tools to load and split the data, as well as examples of how to use both user interactions and impressions in several common recommendation algorithms.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源