推荐系统的用户相似性系数计算的改进模型

论文标题

The improved model of user similarity coefficients computation For recommendation systems

论文作者

Meleshko, Yelyzaveta, Drieiev, Oleksandr, Al-Oraiqat, Anas Mahmoud

论文摘要

本文的主题是计算推荐系统的用户相似性系数的模型。目的是开发建议系统的用户相似性系数计算的改进模型，以优化形成建议列表的时间。要解决的任务是：通过比较其相似性系数的时间来调查更改推荐系统用户偏好的可能性，以调查哪种分布函数描述了用户及时的相似性系数的变化。所使用的方法是：图理论，概率理论，放射性理论，算法理论。结论。在研究过程中，改进了推荐系统计算的用户相似性系数的模型。该模型与已知的模型不同，因为它考虑了单个用户的相似性系数的重新计算周期，以及系统或特定用户组的所有用户的相似性系数的平均重新计算期。已经开发了该软件，其中进行了一系列实验以测试开发方法的有效性。进行的实验表明，开发的方法一般会提高推荐系统的质量，而没有明显的精确性和系统回忆的显着波动。精度和召回可能会略微降低或增加，具体取决于传入数据集的特征。提出的解决方案的使用将增加用户的相似性系数的应用期限，以预测偏好，而无需重新计算，因此，它将缩短形成时间和推荐列表的发行时间多达2次。

The subject matter of the article is a model of calculating the user similarity coefficients of the recommendation systems. The goal is the development of the improved model of user similarity coefficients calculation for recommendation systems to optimize the time of forming recommendation lists. The tasks to be solved are: to investigate the probability of changing user preferences of a recommendation system by comparing their similarity coefficients in time, to investigate which distribution function describes the changes of similarity coefficients of users in time. The methods used are: graph theory, probability theory, radioactivity theory, algorithm theory. Conclusions. In the course of the researches, the model of user similarity coefficients calculating for the recommendation systems has been improved. The model differs from the known ones in that it takes into account the recalculation period of similarity coefficients for the individual user and average recalculation period of similarity coefficients for all users of the system or a specific group of users. The software has been developed, in which a series of experiments was conducted to test the effectiveness of the developed method. The conducted experiments showed that the developed method in general increases the quality of the recommendation system without significant fluctuations of Precision and Recall of the system. Precision and Recall can decrease slightly or increase, depending on the characteristics of the incoming data set. The use of the proposed solutions will increase the application period of the previously calculated similarity coefficients of users for the prediction of preferences without their recalculation and, accordingly, it will shorten the time of formation and issuance of recommendation lists up to 2 times.

下载PDF全文

下载文献需遵守相关版权规定

论文标题