论文标题
在未知渠道统计下,解散的RF指纹提取的表示形式学习
Disentangled Representation Learning for RF Fingerprint Extraction under Unknown Channel Statistics
论文作者
论文摘要
应用于设备的射频指纹〜(RFF)的深度学习(DL),由于其非凡的分类性能,引起了物理层认证的极大关注。传统的DL-RFF技术通过采用最大似然估计〜(MLE)训练。尽管它们的可区分性最近已扩展到开放场景的未知设备,但它们仍然倾向于过度拟合培训数据集中嵌入的频道统计信息。这限制了他们的实际应用,因为收集足够的培训数据来捕获所有可能的无线渠道环境的特征是一个挑战。为了应对这一挑战,我们提出了一个DL表示的DL框架〜(DR)学习,该框架首先学会通过对抗性学习将信号分解为相关的组件和设备 - iRretrelevant组件。然后,它将数据集中的这两个部分混合起来,以进行隐式数据增强,这在RFF提取器学习上实施了强大的正则化,以避免在不收集未知渠道的其他数据的情况下过度拟合设备 - IRRERELERELERELERELERVENTION统计信息。实验验证了所提出的方法(称为基于DR的RFF),即使在未知的复杂传播环境下,即使在简单的直接直接线路线中收集了所有训练数据,也可以在未知的复杂传播环境(例如,分散多径褪色的渠道)上胜过传统方法,即使分散多径褪色通道(LOS)传播。
Deep learning (DL) applied to a device's radio-frequency fingerprint~(RFF) has attracted significant attention in physical-layer authentication due to its extraordinary classification performance. Conventional DL-RFF techniques are trained by adopting maximum likelihood estimation~(MLE). Although their discriminability has recently been extended to unknown devices in open-set scenarios, they still tend to overfit the channel statistics embedded in the training dataset. This restricts their practical applications as it is challenging to collect sufficient training data capturing the characteristics of all possible wireless channel environments. To address this challenge, we propose a DL framework of disentangled representation~(DR) learning that first learns to factor the signals into a device-relevant component and a device-irrelevant component via adversarial learning. Then, it shuffles these two parts within a dataset for implicit data augmentation, which imposes a strong regularization on RFF extractor learning to avoid the possible overfitting of device-irrelevant channel statistics, without collecting additional data from unknown channels. Experiments validate that the proposed approach, referred to as DR-based RFF, outperforms conventional methods in terms of generalizability to unknown devices even under unknown complicated propagation environments, e.g., dispersive multipath fading channels, even though all the training data are collected in a simple environment with dominated direct line-of-sight~(LoS) propagation paths.