论文标题
用于长期天文观测数据档案的重新分布工具
A Redistribution Tool for Long-Term Archive of Astronomical Observation Data
论文作者
论文摘要
天文观察数据需要长期保存,并且观察数据的迅速积累使得有必要考虑长期存档存储的成本。除了基于低速磁盘的在线存储外,还可以使用基于光盘或基于磁带的离线存储来节省成本。但是,对于需要历史数据(尤其是时间域天文学)的天文学研究,数据访问技术的性能和能耗会引起问题,因为所需的数据(根据观察时间组织)可能位于多个存储设备之间。在这项研究中,我们设计并开发了一种称为Astrolayout的工具,以使用空间聚合重新分布观察数据。核心算法使用图形分区来根据原始观察数据统计和目标存储系统生成优化的数据放置。对于给定的观察数据,Astrolayout可以根据此放置的位置复制目标存储系统中的长期存档。效率评估表明,当响应时间域天文学研究中的数据访问请求时,Astrolayout可以减少激活的设备数量。除了提高数据访问技术的性能外,Astrolayout还可以减少存储系统功耗。为了增强适应性,它支持任何媒体的存储系统,包括光盘,磁带和硬盘。
Astronomical observation data require long-term preservation, and the rapid accumulation of observation data makes it necessary to consider the cost of long-term archive storage. In addition to low-speed disk-based online storage, optical disk or tape-based offline storage can be used to save costs. However, for astronomical research that requires historical data (particularly time-domain astronomy), the performance and energy consumption of data-accessing techniques cause problems because the requested data (which are organized according to observation time) may be located across multiple storage devices. In this study, we design and develop a tool referred to as AstroLayout to redistribute the observation data using spatial aggregation. The core algorithm uses graph partitioning to generate an optimized data placement according to the original observation data statistics and the target storage system. For the given observation data, AstroLayout can copy the long-term archive in the target storage system in accordance with this placement. An efficiency evaluation shows that AstroLayout can reduce the number of devices activated when responding to data-access requests in time-domain astronomy research. In addition to improving the performance of data-accessing techniques, AstroLayout can also reduce the storage systems power consumption. For enhanced adaptability, it supports storage systems of any media, including optical disks, tapes, and hard disks.