论文标题
一个简单而敏捷的云基础架构,用于支持以网络安全为导向的机器学习工作流程
A Simple and Agile Cloud Infrastructure to Support Cybersecurity Oriented Machine Learning Workflows
论文作者
论文摘要
生成最新的机器学习数据集(ML)安全模型是一个独特的工程挑战,因为大量数据量,标签的复杂性和恒定的概念漂移使得难以生成有效的培训数据集。在这里,我们描述了一种简单,有弹性的云基础架构,用于生成ML培训和测试数据集,从而提高了我们的团队能够研究并保持生产的速度。
Generating up to date, well labeled datasets for machine learning (ML) security models is a unique engineering challenge, as large data volumes, complexity of labeling, and constant concept drift makes it difficult to generate effective training datasets. Here we describe a simple, resilient cloud infrastructure for generating ML training and testing datasets, that has enhanced the speed at which our team is able to research and keep in production a multitude of security ML models.