论文标题

生物标志物引导的异质性分析遗传法规通过多元稀疏融合

Biomarker-guided heterogeneity analysis of genetic regulations via multivariate sparse fusion

论文作者

Zhang, Sanguo, Hu, Xiaonan, Luo, Ziye, Jiang, Yu, Sun, Yifan, Ma, Shuangge

论文摘要

异质性是许多复杂疾病的标志。有多种定义异质性的方法,其中已经提出了CNV(拷贝数变化)和甲基化的遗传法规(例如GES(基因表达))的异质性,但很少研究。遗传法规的异质性可以与疾病的严重程度,进展和其他特征有关,并且在生物学上很重要。但是,由于调节的两侧以及稀疏和弱信号的高维度,该分析可能非常具有挑战性。在本文中,我们考虑了受试者形成未知子组的方案,每个亚组具有独特的遗传调节关系。此外,这种异质性是由已知的生物标志物“指导”的。我们开发了MSF(多变量稀疏融合)方法,该方法创新地应用了惩罚的融合技术来同时确定每个亚组中亚组和调节关系的数量和结构。开发了有效的计算算法,并进行了广泛的模拟。使用TCGA数据对GE-CNV法规的异质性和GE-甲基化法规的异质性分析导致有趣的发现。

Heterogeneity is a hallmark of many complex diseases. There are multiple ways of defining heterogeneity, among which the heterogeneity in genetic regulations, for example GEs (gene expressions) by CNVs (copy number variations) and methylation, has been suggested but little investigated. Heterogeneity in genetic regulations can be linked with disease severity, progression, and other traits and is biologically important. However, the analysis can be very challenging with the high dimensionality of both sides of regulation as well as sparse and weak signals. In this article, we consider the scenario where subjects form unknown subgroups, and each subgroup has unique genetic regulation relationships. Further, such heterogeneity is "guided" by a known biomarker. We develop an MSF (Multivariate Sparse Fusion) approach, which innovatively applies the penalized fusion technique to simultaneously determine the number and structure of subgroups and regulation relationships within each subgroup. An effective computational algorithm is developed, and extensive simulations are conducted. The analysis of heterogeneity in the GE-CNV regulations in melanoma and GE-methylation regulations in stomach cancer using the TCGA data leads to interesting findings.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源