论文标题
Graph Learning索引器:一个适合图形学习基准的贡献者友好和元数据富含的平台
Graph Learning Indexer: A Contributor-Friendly and Metadata-Rich Platform for Graph Learning Benchmarks
论文作者
论文摘要
建立开放和一般的基准是现代机器学习技术成功背后的关键推动力。随着机器学习被应用于更广泛的领域和任务,有必要建立更丰富,更多样化的基准,以更好地反映应用程序场景的现实。 Graph Learning是一个新兴的机器学习领域,迫切需要更多和更好的基准测试。为了满足需求,我们介绍了图形学习索引器(GLI),这是一个用于图形学习的基准策展平台。与现有的图形学习基准库相比,GLI突出了两个新型设计目标。首先,GLI旨在激励\ emph {数据集贡献者}。特别是,我们纳入了各种措施,以最大程度地减少贡献和维护数据集,提高贡献数据集的可用性,并鼓励归因于数据集的不同贡献者。其次,GLI旨在策划基准数据集的知识库,而不是简单的集合。我们使用多种元信息来源来增强具有\ emph {Rich特征}的基准数据集,以便可以轻松地选择并将其用于下游研究或开发。 GLI的源代码可在\ url {https://github.com/graph-learning-benchmarks/gli}中获得。
Establishing open and general benchmarks has been a critical driving force behind the success of modern machine learning techniques. As machine learning is being applied to broader domains and tasks, there is a need to establish richer and more diverse benchmarks to better reflect the reality of the application scenarios. Graph learning is an emerging field of machine learning that urgently needs more and better benchmarks. To accommodate the need, we introduce Graph Learning Indexer (GLI), a benchmark curation platform for graph learning. In comparison to existing graph learning benchmark libraries, GLI highlights two novel design objectives. First, GLI is designed to incentivize \emph{dataset contributors}. In particular, we incorporate various measures to minimize the effort of contributing and maintaining a dataset, increase the usability of the contributed dataset, as well as encourage attributions to different contributors of the dataset. Second, GLI is designed to curate a knowledge base, instead of a plain collection, of benchmark datasets. We use multiple sources of meta information to augment the benchmark datasets with \emph{rich characteristics}, so that they can be easily selected and used in downstream research or development. The source code of GLI is available at \url{https://github.com/Graph-Learning-Benchmarks/gli}.