鸿源韬生物

ChIP-Seq生信分析报告

1.项目简介

1.1 样本信息

合同编号 RS202503001
实验技术 ChIP-Seq
靶蛋白 H3K27me3
物种名称 拟南芥
拉丁名 Arabidopsis thaliana
参考基因组 tair10
报告生成日期 2025年04月23日


1.2 实验原理及流程

染色质免疫共沉淀高通量测序(Chromatin Immunoprecipitation and high-throughput sequencing,ChIP-Seq)是一种用于分析蛋白质与DNA交互作用的研究方法。该技术将染色质免疫沉淀(ChIP)与高通量测序技术结合,可鉴定蛋白在全基因组上的结合位点。ChIP-Seq 可以鉴定任意具有DNA结合活性的蛋白在体内的结合位点,是研究表观遗传学调控和转录调控的经典方法。
表观遗传学调控的主要方式 DNA 甲基化修饰和组蛋白修饰。组蛋白氨基酸可以发生乙酰化、磷酸化和甲基化等修饰,这些修饰可影响组蛋白与DNA的亲和性从而改变染色质的疏松和凝集状态,或改变其他蛋白质与 DNA 结合的亲和性从而间接地影响基因表达。转录调控主要是通过启动子区的顺式作用元件和反式作用的蛋白因子(主要是转录因子)的互作来实现的。ChIP-Seq 技术可以动态的捕捉到生物体内转录因子在基因启动子区的结合情况及强度变化,因此是研究转录调控的有力武器。



1.3 实验流程


ChIP-Seq实验主要步骤有六步:
1)使用甲醛处理组织/细胞,使得DNA与蛋白质交联;
2)破碎细胞,收集细胞核;
3)通过超声打断DNA成200-1000bp的片段;
4)利用抗体将与目的蛋白相结合的DNA片段沉淀下来;
5)解交联释放DNA片段并提取DNA;
6)构建测序文库后进行高通量测序。
ChIP-Seq 文库构建如下图所示:


ChIP-Seq DNA 文库构建流程


进行高通量测序上机前必须要构建与之匹配的测序文库,测序文库的主要构建流程如下:
1)对IP下来的 DNA进行片段化与接头连接;
2)将DNA片段进行PCR扩增,富集带有接头的DNA片段·;
3)利用磁珠纯化回收目的大小片段,获得待测序文库;
4)利用琼脂糖电泳方法对所构建的文库进行质检;
5)利用Qubit4.0对文库进行定量,确定文库浓度是否适合上机;
6)文库质检合格后,把不同文库按照有效浓度及目标下机数据量的需求在测序仪上进行上机测序。

DNA文库构建

1.4 分析流程

获得测序原始数据(raw data)后,首先对原始数据进行过滤,获得高质量的测序数据(clean data),将测序数据(clean data)比对到项目物种的参考基因组上,对比对结果进行鉴定峰位点(peak calling),对peak关联基因进行注释以及富集分析, 有生物学重复时进行差异Peak、共识Peak分析以及motif分析。


ChIP-Seq 生物信息学分析流程



2. 数据质控

我们交付的原始数据为fastq(简称fq)格式文件的压缩包,文件名后缀通常为 “.fq.gz”。交付数据前我们会计算每个压缩文件的md5值。在您拿到数据之后,请您先校>验每个压缩文件的md5值,Linux下可以在数据目录使用“md5sum -c <*md5.txt>”命令进行校验,Windows下可使用hashmyfiles等校验工具,如发现压缩文件md5值与附在数据文件目录下的md5文档中的不一致则说明文件可能在传输的过程中被损坏。数据文件大小为文件占用磁盘空间的大小,文件的大小通常与磁盘格式、压缩比例等因素有关,与测序数据量(碱基数)的多少无对应关系,因此对应PE测序的 read1和read2两个文件大小也可能不相同。

将高通量测序得到的原始图像数据经过Base Calling 转化为序列数据,即FASTQ格式,得到最原始的测序数据文件。FASTQ 格式文件可记录所测读段(read)的碱基及其质量分数。FASTQ 格式以测序读段为单位进行存储,每条读段占 4 行,第一行是序列标识(read ID)以及相关的描述信息,以“@” 开头;第二行即为碱基序列,长度由测序策略决定;第三行以“+”开头,后面是序列标示符、描述信息,或者什么也不加; 第四行是测序质量值(phred),与第二行一一对应,phred值以ASCII码标记,对应的 ASCII 值减去33,即为第二行对应碱基的测序质量值,示例如下:

@HWI-ST1276:71:C1162ACXX:1:1101:1208:2458 1:N:0:CGATGT
NAAGAACACGTTCGGTCACCTCAGCACACTTGTGAATGTCATGGGATCCAT
+
#55???BBBBB?BA@DEEFFCFFHHFFCFFHHHHHHHFAE0ECFFD/AEHH

测序错误率用e表示, 平台测得数据的碱基质量值用Qphred表示,则有:Qphred=-10log10(e)。软件中碱基识别正确率与Phred分值之间的简明对应关系见下表:

Phred分值不正确的碱基识别碱基正确识别率Q-score
101/1090%Q10
201/10099%Q20
301/100099.9%Q30

测序Reads的错误率往往会随着测序接近尾声而升高,这是由测序过程中化学试剂的消耗造成共有的特征。



2.1 原始数据质控

ChIP-Seq实验基于第二代测序(NGS)平台完成,采用双端测序文库构建策略(插入片段~300 bp)。我们需要对原始测序数据进行质量评估与过滤,以确保后续分析的可靠性。首先,我们使用FastQC(version 0.12.1)(Andrews, 2010)对原始测序数据(raw data)进行全局质量分析,包括碱基质量分布(Phred score)、碱基组成平衡性(base content uniformity)、重复序列比例(duplication level)及GC含量偏差等指标,以全面评估测序质量。
我们使用Fastp(version 0.24.0)(Chen et al., 2018)对原始测序数据进行以下过滤操作。
接头序列去除:识别并切除双端reads中的接头序列;
低复杂度序列过滤:剔除含模糊碱基(N碱基占比≥10%)的reads;
动态质量修剪:通过滑动窗口法(5 bp窗口步长)评估局部序列质量,当窗口平均Phred score小于20时,执行3'端截断;
长度筛选:保留长度≥25 bp的paired-end reads,长度不足的reads及其匹配reads(R1/R2)均被排除。
原始和过滤后质控结果请详见result/1.qc文件夹,raw为原始数据质控结果,clean为过滤后质控结果。



图2.1 各个样本平均测序碱基质量分数,横坐标代表150 bp长度序列中各个位置,纵坐标为该位置平均的碱基质量值Q;盒形图中间的红线表示中位数(median value);黄色部分代表四分位距(25-75%);上下分割线代表 90%和 10%的上下临界值;蓝色的线代表碱基质量的平均值。


图2.2 各个样本碱基平衡性,图中四条线代表A T C G在每个位置平均含量。理论上,A和T应该相等,G和C应该相等,且4种碱基平行且接近分布。正常情况下四种碱基的出现频率应该是接近的,而且没有位置差异。因此好的样本中四条线应该平行且接近。当部分位置碱基的比例出现 bias 时,即四条线波动较大时可能存在测序数据或者文库污染。如果所有位置的碱基比例一致的表现出bias 时,即四条线平行但分开,往往代表文库有 bias (建库过程或本身特点),或者是测序中的系统误差。测序刚开始由于测序仪状态不稳定,在15bp之前很可能出现波动。


图2.3 各个样本每条序列GC含量,各个样本每条序列GC含量,横轴为序列GC含量百分比; 纵轴是每条序列GC含量对应的数量,蓝色的线是程序根据经验分布给出的理论值,红色是真实值,理想情况下两者接近,当红色的线出现双峰时,可能存在其他物种DNA污染。


图2.4 各个样本重复序列水平,测序深度越高,越容易产生一定程度的重复(duplication),这属于正常的现象。但如果duplication 的程度很高,就提示我们可能有 bias 的存在(如建库过程中由于 PCR 扩增引起的duplication)。横坐标为 reads 重复的次数,纵坐标为重复次数对应的 reads 占 unique reads 的比例,以unique reads 的总数作为 100%。这里,我们仅对文件前 2000000 个reads 进行统计:对长度小于75bp 的reads 将其截短为 50bp,用于统计重复。


2.2 过滤后数据质控

这里展示Fastp过滤后的数据质控结果,图片内容与上面raw data类似。

图2.5 各个样本平均测序碱基质量分数,横坐标代表150 bp长度序列中各个位置,纵坐标为该位置平均的碱基质量值Q;盒形图中间的红线表示中位数(median value);黄色部分代表四分位距(25-75%);上下分割线代表 90%和 10%的上下临界值;蓝色的线代表碱基质量的平均值。

图2.6 各个样本碱基平衡性,图中四条线代表A T C G在每个位置平均含量。理论上,A和T应该相等,G和C应该相等,且4种碱基平行且接近分布。正常情况下四种碱基的出现频率应该是接近的,而且没有位置差异。因此好的样本中四条线应该平行且接近。当部分位置碱基的比例出现 bias 时,即四条线在某些位置波动较大时,可能测序数据或者文库存在污染。当所有位置的碱基比例一致的表现出bias 时,即四条线平行但分开,往往代表文库有 bias (建库过程或本身特点),或者是测序中的系统误差。一般测序的时候,刚开始测序仪状态不稳定,在15bp之前很可能出现波动。

图2.7 各个样本每条序列GC含量,横轴为序列GC含量百分比; 纵轴是每条序列GC含量对应的数量,蓝色的线是程序根据经验分布给出的理论值,红色是真实值,理想情况下两者接近,当红色的线出现双峰时,可能存在其他物种DNA污染。

图2.8 各个样本重复序列水平,测序深度越高,越容易产生一定程度的重复(duplication),这属于正常的现象。但如果duplication 的程度很高,就提示我们可能有 bias 的存在(如建库过程中由于 PCR 扩增引起的duplication)。横坐标为 reads 重复的次数,纵坐标为重复次数对应的 reads 占 unique reads 的比例,以unique reads 的总数作为 100%。这里,我们仅对文件前 2000000 个reads 进行统计:对长度小于75bp 的reads 将其截短为 50bp,用于统计重复。



2.3 数据过滤结果统计

我们对数据过滤结果进行统计,如下表所示:

Sample Raw_Total_Reads Raw_Total_Bases Raw_Q20_Rate Raw_Q30_Rate Raw_GC_Content Clean_Total_Reads Clean_Total_Bases Clean_Q20_Rate Clean_Q30_Rate Clean_GC_Content
H3K27me3_Seedling_rep1_INPUT 14.64M 2195.60M 0.953 0.897 0.501 14.61M 1026.53M 0.986 0.954 0.423
H3K27me3_Seedling_rep1_IP 29.70M 4455.28M 0.963 0.913 0.471 29.60M 2833.20M 0.983 0.947 0.420
H3K27me3_Seedling_rep2_INPUT 15.56M 2334.41M 0.952 0.897 0.500 15.53M 1099.30M 0.987 0.956 0.422
H3K27me3_Seedling_rep2_IP 34.28M 5142.63M 0.962 0.913 0.468 34.15M 3312.78M 0.983 0.947 0.417
H3K27me3_Spm_rep1_INPUT 15.24M 2285.82M 0.930 0.876 0.483 15.20M 1300.96M 0.981 0.942 0.423
H3K27me3_Spm_rep1_IP 24.54M 3681.16M 0.946 0.897 0.443 24.47M 2643.71M 0.979 0.938 0.404
H3K27me3_Spm_rep2_INPUT 16.54M 2480.36M 0.957 0.900 0.472 16.50M 1450.92M 0.982 0.942 0.414
H3K27me3_Spm_rep2_IP 23.94M 3590.80M 0.963 0.913 0.461 23.88M 2497.34M 0.981 0.942 0.423

表 2.1数据过滤结果统计:
Sample:样品名称;
Raw_Total_Reads/Clean_Total_Reads:过滤前后样本总reads数量,单位为百万;
Raw_Total_Bases/Clean_Total_Bases:过滤前后样本总碱基数量,单位为百万;
Raw_Q20_Rate/Clean_Q20_Rate:过滤前后样本Q20碱基比例;
Raw_Q30_Rate/Clean_Q30_Rate:过滤前后样本Q30碱基比例;
Raw_GC_Content/Clean_GC_Content:过滤前后样本GC含量。







3. 比对参考基因组

我们将各样品过滤后的clean data的reads与参考基因组进行比对,获取Reads在参考基因组上的定位信息,这里使用的软件是Bowtie2(version 2.4.5)(Langmead B. et al., 2018)。来自一个DNA片段的多个拷贝,可能会锚定在多个read上,经过测序得到的这些reads就是PCR重复。PCR本身就是为了产生重复序列的。理论上来讲,不同的序列在进行PCR扩增时,扩增的倍数应该是相同的。但是由于聚合酶的偏好性,PCR扩增次数过多的情况下,会导致一些序列持续扩增,而另一些序列扩增到一定程度后便不再进行,也就是我们常说的PCR偏好性。因此,比对完成后我们使用软件Sambamba(version 1.0.1)(Tarasov A. et al., 2015)去除PCR重复,获取unique reads。

3.1 比对参考基因组情况

sample clean_reads PCR_dup PCR_dup % prop_map_reads prop_map % MAPQ30
H3K27me3_Spm_rep1_IP 24,471,390 6,461,266 26.400 23,046,712 94.180 20,399,978
H3K27me3_Seedling_rep2_INPUT 15,528,804 4,959,143 31.940 15,329,122 98.710 11,247,030
H3K27me3_Seedling_rep2_IP 34,151,238 12,569,857 36.810 33,229,690 97.300 27,345,650
H3K27me3_Seedling_rep1_IP 29,596,026 9,985,388 33.740 28,471,452 96.200 23,367,474
H3K27me3_Spm_rep2_INPUT 16,498,512 5,441,349 32.980 16,179,912 98.070 12,119,348
H3K27me3_Spm_rep2_IP 23,876,862 6,649,682 27.850 22,030,384 92.270 18,233,777
H3K27me3_Seedling_rep1_INPUT 14,605,950 4,409,460 30.190 14,459,770 99.000 10,579,696
H3K27me3_Spm_rep1_INPUT 15,199,356 5,052,727 33.240 14,940,288 98.300 10,724,899

表 3.1比对结果统计:
Sample:样品名称;
clean_reads:clean后reads总数;
PCR_dup:鉴定为PCR重复的reads数;
PCR_dup %:PCR重复reads百分比;
prop_map_reads:完美比对的reads总数,PE两端reads比对到同一条序列,且根据比对结果推断的插入片段大小符合设置的阈值;
prop_map %:完美比对reads百分比;
MAPQ30:MAPQ值大于30的reads数。




3.2 Reads富集情况

我们使用Deeptools(version 3.5.4)(Ramírez F. et al., 2016)软件对reads富集情况进行可视化,绘制信号在基因不同区域(Transcription Start Site,转录起始位点,TSS; Transcription End Site,转录终止位点,TES)的分布。对于可能富集在基因区域或者启动子区域的靶蛋白,其IP信号会富集在基因的TSS上游到TES区域,且显著高于INPUT信号。


图3.1 各样本reads富集情况。横坐标为相对基因位置,纵坐标为按照基因组大小RPGC标准化后reads富集分数。



图3.2 各样本reads富集热图。下方热图代表基因上下游Reads富集情况,每一行代表一个基因上下游区域reads富集程度。




3.3 比对可视化

软件比对所得结果为bam格式文件(位于report/result/2.map文件夹中),bam文件是压缩的⼆进制⽂件,无法直接作为文本打开查看。由于bam文件数据较大,我们将其转为较小的bw格式文件。客户可以结合物种参考基因组和注释文件使用IGV (Integrative Genomics Viewer) 浏览器对bam、bw、bed等文件进行可视化浏览。IGV浏览器使用方法可参考我们提供的使用说明文档IGV快速上手







4. 峰鉴定

我们使用MACS3(version 3.0.0)(Zhang Y. et al. 2008)进行peak鉴定,即找到靶蛋白富集的基因组区域。MACS(Model-based Analysis of ChIP-Seq)是一种基于统计学模型的算法,专用于从免疫沉淀测序数据中精准识别转录因子结合位点或组蛋白修饰富集区域(Peaks)。其核心原理是通过对比实验组(IP)与对照组(Input或非特异性抗体样本)的测序信号,构建动态背景噪声模型以区分特异性结合与非随机分布的背景事件。算法首先通过双滑动窗口扫描基因组,利用泊松分布或负二项分布评估局部富集信号的显著性(P-value),并结合片段长度推断(Fragment Size)优化覆盖深度分析;随后通过两步法(粗筛峰与精细调整)定位峰边界,并确定信号峰值中心(Summit)(如果是broad模式则无summit)。使用Phantompeakqualtools(version 1.2.2)(Landt SG. et al. 2012)进行基于phantom peak的数据质量评估,计算NSC和RSC。本节结果请详见位于report/result/3.peak文件夹中

4.1 Peak信息统计

sample Peak num FRIP NSC RSC Peak reads Total reads
H3K27me3_Seedling_rep1 10,305 0.468 1.122 0.925 13,863,472 29,596,026
H3K27me3_Seedling_rep2 12,192 0.462 1.115 0.978 15,783,447 34,151,238
H3K27me3_Spm_rep1 11,395 0.331 1.085 0.983 8,087,911 24,471,390
H3K27me3_Spm_rep2 12,638 0.325 1.119 0.973 7,760,576 23,876,862

表 4.1 Peak信息统计:
sample:样品名称;
Peak num:Peak数量;
FRiP(Fraction of Reads in Peaks)值表示映射到峰区的 reads 占总 reads 的比例,反映了 ChIP 实验的富集效果。较高的 FRiP 值表明实验成功地富集了目标区域的 DNA 片段,而较低的 FRiP 值可能表明富集效果差或背景噪声较高。
NSC(Normalized strand cross-correlation coefficent)衡量富集区域的标准化交叉相关性得分。NSC值越大表明富集效果越好,NSC值低于1.1表明较弱的富集,小于1表示无富集。NSC值稍微低于1.05,有较低的信噪比或很少的峰,这可能是生物学真实现象,比如有的因子在特定组织类型中只有很少的结合位点;也可能确实是数据质量差。
RSC(Relative strand cross-correlation coefficient)评估ChIP-Seq数据中的伪峰情况,RSC是片段长度相关值减去背景相关值除以phantom-peak相关值减去背景相关值。RSC的最小值可能是0,表示无信号;富集好的实验RSC值大于1;低于1表示质量低。
Peak reads:映射到峰区域的reads数;Total reads:样本总reads数。
Qv:MACS3 callpeak使用的qvalue阈值。




4.2 Call Peak结果

各个样本peak信息结果表部分内容如下,完整信息请查看report/result/3.peak/{样本名称}_peaks.tsv表格。“.broadPeak”或“.narrowPeak”文件为MACS3的输出文件,用于描述峰区域信息,可在IGV浏览器中打开。

显示前100行 (共10330行)
chr start end length pileup -log10(pvalue) fold_enrichment -log10(qvalue) name
1 9766 13394 3629 27.610 10.904 3.892 9.213 H3K27me3_Seedling_rep1_peak_1
1 14210 14635 426 13.910 3.995 2.732 2.628 H3K27me3_Seedling_rep1_peak_2
1 16711 16982 272 11.260 2.621 2.231 1.370 H3K27me3_Seedling_rep1_peak_3
1 17881 19394 1514 15.130 4.286 2.797 2.899 H3K27me3_Seedling_rep1_peak_4
1 40756 40987 232 17.210 4.009 2.549 2.646 H3K27me3_Seedling_rep1_peak_5
1 43409 44564 1156 12.750 3.116 2.432 1.820 H3K27me3_Seedling_rep1_peak_6
1 57872 58647 776 16.530 3.354 2.317 2.047 H3K27me3_Seedling_rep1_peak_7
1 78810 80660 1851 17.410 6.023 3.307 4.533 H3K27me3_Seedling_rep1_peak_8
1 83098 84837 1740 28.470 12.924 4.690 11.122 H3K27me3_Seedling_rep1_peak_9
1 86808 87131 324 13.400 2.521 2.092 1.291 H3K27me3_Seedling_rep1_peak_10
1 102645 103765 1121 14.840 4.357 2.841 2.964 H3K27me3_Seedling_rep1_peak_11
1 104636 105336 701 78.210 48.033 8.117 45.263 H3K27me3_Seedling_rep1_peak_12
1 113312 113844 533 15.940 4.444 2.794 3.061 H3K27me3_Seedling_rep1_peak_13
1 132412 132660 249 19.590 4.993 2.758 3.569 H3K27me3_Seedling_rep1_peak_14
1 138444 139505 1062 30.510 14.917 5.211 13.039 H3K27me3_Seedling_rep1_peak_15
1 148423 148971 549 15.750 3.661 2.490 2.327 H3K27me3_Seedling_rep1_peak_16
1 149405 149683 279 23.680 4.578 2.448 3.182 H3K27me3_Seedling_rep1_peak_17
1 151176 152039 864 13.030 2.575 2.144 1.337 H3K27me3_Seedling_rep1_peak_18
1 152864 154179 1316 24.150 10.199 4.222 8.510 H3K27me3_Seedling_rep1_peak_19
1 154944 155963 1020 26.460 8.401 3.363 6.812 H3K27me3_Seedling_rep1_peak_20
1 167051 167878 828 42.170 18.364 4.909 16.369 H3K27me3_Seedling_rep1_peak_21
1 172406 172777 372 14.100 3.869 2.668 2.513 H3K27me3_Seedling_rep1_peak_22
1 177320 177841 522 16.340 3.422 2.357 2.111 H3K27me3_Seedling_rep1_peak_23
1 209196 210541 1346 22.890 8.643 3.840 7.015 H3K27me3_Seedling_rep1_peak_24
1 211131 212086 956 15.020 4.007 2.682 2.643 H3K27me3_Seedling_rep1_peak_25
1 214155 214308 154 12.960 3.431 2.558 2.103 H3K27me3_Seedling_rep1_peak_26
1 223985 224123 139 13.410 2.663 2.168 1.416 H3K27me3_Seedling_rep1_peak_27
1 238461 238787 327 17.580 2.592 1.973 1.361 H3K27me3_Seedling_rep1_peak_28
1 244769 245995 1227 17.900 5.261 2.958 3.811 H3K27me3_Seedling_rep1_peak_29
1 246766 246954 189 14.350 3.495 2.503 2.176 H3K27me3_Seedling_rep1_peak_30
1 247395 247621 227 11.680 2.477 2.218 1.243 H3K27me3_Seedling_rep1_peak_31
1 257045 257509 465 41.400 14.172 3.930 12.325 H3K27me3_Seedling_rep1_peak_32
1 258066 258199 134 13.740 2.734 2.191 1.482 H3K27me3_Seedling_rep1_peak_33
1 310484 310917 434 14.110 3.689 2.591 2.357 H3K27me3_Seedling_rep1_peak_34
1 311506 312565 1060 24.740 4.288 2.276 2.913 H3K27me3_Seedling_rep1_peak_35
1 341231 341877 647 21.710 5.021 2.639 3.605 H3K27me3_Seedling_rep1_peak_36
1 358126 359557 1432 40.040 21.705 6.198 19.588 H3K27me3_Seedling_rep1_peak_37
1 365185 367640 2456 56.570 38.688 7.625 36.288 H3K27me3_Seedling_rep1_peak_38
1 370185 372367 2183 22.670 9.399 3.827 7.794 H3K27me3_Seedling_rep1_peak_39
1 415117 418254 3138 37.980 22.255 6.431 20.149 H3K27me3_Seedling_rep1_peak_40
1 418842 419953 1112 22.930 8.911 3.942 7.268 H3K27me3_Seedling_rep1_peak_41
1 420695 422243 1549 16.390 4.961 2.950 3.541 H3K27me3_Seedling_rep1_peak_42
1 460332 460802 471 34.810 15.194 4.797 13.315 H3K27me3_Seedling_rep1_peak_43
1 464019 467888 3870 26.810 10.963 4.074 9.264 H3K27me3_Seedling_rep1_peak_44
1 472752 473105 354 19.590 6.352 3.345 4.829 H3K27me3_Seedling_rep1_peak_45
1 479508 483108 3601 30.320 14.277 4.897 12.434 H3K27me3_Seedling_rep1_peak_46
1 486920 487361 442 27.500 9.051 3.604 7.391 H3K27me3_Seedling_rep1_peak_47
1 489184 490517 1334 23.480 8.060 3.519 6.464 H3K27me3_Seedling_rep1_peak_48
1 498069 498446 378 21.310 6.433 3.095 4.935 H3K27me3_Seedling_rep1_peak_49
1 504886 505315 430 14.460 3.991 2.686 2.629 H3K27me3_Seedling_rep1_peak_50
1 506448 507139 692 21.350 5.183 2.734 3.749 H3K27me3_Seedling_rep1_peak_51
1 535203 535457 255 12.550 2.817 2.268 1.555 H3K27me3_Seedling_rep1_peak_52
1 536608 537164 557 35.060 14.863 4.729 12.963 H3K27me3_Seedling_rep1_peak_53
1 543282 543832 551 14.140 3.200 2.367 1.907 H3K27me3_Seedling_rep1_peak_54
1 545609 551586 5978 21.400 7.727 3.638 6.146 H3K27me3_Seedling_rep1_peak_55
1 557323 558696 1374 26.970 8.845 3.549 7.217 H3K27me3_Seedling_rep1_peak_56
1 582412 583843 1432 21.670 8.013 3.701 6.419 H3K27me3_Seedling_rep1_peak_57
1 605299 606624 1326 22.500 9.560 4.152 7.899 H3K27me3_Seedling_rep1_peak_58
1 610772 612373 1602 30.370 15.970 5.422 14.073 H3K27me3_Seedling_rep1_peak_59
1 618378 620017 1640 40.660 17.490 4.598 15.563 H3K27me3_Seedling_rep1_peak_60
1 620724 621144 421 22.850 6.453 3.021 4.956 H3K27me3_Seedling_rep1_peak_61
1 663140 663936 797 15.070 3.021 2.244 1.746 H3K27me3_Seedling_rep1_peak_62
1 665395 666228 834 18.060 5.755 3.190 4.268 H3K27me3_Seedling_rep1_peak_63
1 677413 678786 1374 18.880 4.906 2.733 3.503 H3K27me3_Seedling_rep1_peak_64
1 698258 698532 275 21.140 8.265 3.814 6.668 H3K27me3_Seedling_rep1_peak_65
1 699089 700577 1489 17.700 6.069 3.284 4.584 H3K27me3_Seedling_rep1_peak_66
1 706291 706433 143 19.270 3.987 2.441 2.626 H3K27me3_Seedling_rep1_peak_67
1 747271 747501 231 16.570 3.288 2.287 1.985 H3K27me3_Seedling_rep1_peak_68
1 748305 748972 668 19.600 5.967 3.136 4.474 H3K27me3_Seedling_rep1_peak_69
1 769892 770347 456 20.030 7.580 3.720 5.998 H3K27me3_Seedling_rep1_peak_70
1 772034 772773 740 14.250 3.863 2.680 2.503 H3K27me3_Seedling_rep1_peak_71
1 790271 790847 577 19.110 2.939 2.055 1.669 H3K27me3_Seedling_rep1_peak_72
1 819880 820458 579 12.230 2.798 2.302 1.529 H3K27me3_Seedling_rep1_peak_73
1 820868 821083 216 13.270 3.498 2.587 2.171 H3K27me3_Seedling_rep1_peak_74
1 843379 843960 582 25.650 5.189 2.492 3.757 H3K27me3_Seedling_rep1_peak_75
1 856103 856345 243 17.130 4.046 2.624 2.678 H3K27me3_Seedling_rep1_peak_76
1 857316 859694 2379 24.410 9.085 3.690 7.467 H3K27me3_Seedling_rep1_peak_77
1 871722 872926 1205 33.460 19.359 6.215 17.338 H3K27me3_Seedling_rep1_peak_78
1 899204 899399 196 13.170 2.833 2.254 1.575 H3K27me3_Seedling_rep1_peak_79
1 922616 929704 7089 31.430 15.393 4.873 13.540 H3K27me3_Seedling_rep1_peak_80
1 952324 956502 4179 47.360 31.352 6.908 29.116 H3K27me3_Seedling_rep1_peak_81
1 957239 957975 737 54.840 21.743 4.782 19.623 H3K27me3_Seedling_rep1_peak_82
1 958651 959643 993 24.090 8.519 3.576 6.928 H3K27me3_Seedling_rep1_peak_83
1 960052 960568 517 16.280 4.758 2.938 3.338 H3K27me3_Seedling_rep1_peak_84
1 967743 970870 3128 37.990 22.453 6.452 20.352 H3K27me3_Seedling_rep1_peak_85
1 971649 972885 1237 13.450 3.683 2.632 2.339 H3K27me3_Seedling_rep1_peak_86
1 974060 974196 137 15.270 3.672 2.533 2.328 H3K27me3_Seedling_rep1_peak_87
1 977364 977711 348 15.790 3.253 2.356 1.950 H3K27me3_Seedling_rep1_peak_88
1 982726 983288 563 17.450 3.919 2.507 2.570 H3K27me3_Seedling_rep1_peak_89
1 985780 987965 2186 34.590 15.505 4.785 13.624 H3K27me3_Seedling_rep1_peak_90
1 988426 990864 2439 23.050 7.019 3.148 5.503 H3K27me3_Seedling_rep1_peak_91
1 1023437 1023606 170 11.470 2.746 2.284 1.482 H3K27me3_Seedling_rep1_peak_92
1 1082091 1084287 2197 22.400 7.322 3.421 5.771 H3K27me3_Seedling_rep1_peak_93
1 1104643 1105838 1196 21.400 6.636 3.243 5.123 H3K27me3_Seedling_rep1_peak_94
1 1120087 1120628 542 23.340 8.210 3.684 6.606 H3K27me3_Seedling_rep1_peak_95
1 1121243 1122454 1212 38.890 20.763 5.952 18.695 H3K27me3_Seedling_rep1_peak_96
1 1123687 1124637 951 13.700 3.740 2.644 2.391 H3K27me3_Seedling_rep1_peak_97
1 1136307 1138331 2025 24.380 10.647 4.243 8.954 H3K27me3_Seedling_rep1_peak_98
1 1159532 1159876 345 13.500 3.681 2.652 2.333 H3K27me3_Seedling_rep1_peak_99
1 1161494 1161933 440 19.790 5.554 2.979 4.092 H3K27me3_Seedling_rep1_peak_100

表 4.2 Call Peak结果。样本_peaks.tsv是一个表格文件,其中包含有关被调用峰的信息。您可以在excel/WPS中打开它并使用函数进行排序/过滤。各列信息为:
1.chr,染色体名称;
2.start,peak起始位置;
3.end,peak的结束位置;
4.length,peak长度;
5.abs_summit,峰顶的位置(absolute peak summit position),narrowPeak存在而broadPeak不存在;
6.pileup,峰顶上的堆积高度(pileup height at peak summit);
7.-log10(pvalue) for the peak(例如 pvalue =1e-10,那么这个值应该是 10);
8.fold_enrichment,该峰的富集倍数,与该位置λ的随机泊松分布相对应,peak文件中signalValue列等于该列;
9.-log10(qvalue) for the peak,peak文件中score列是该列数值x10。







5. 差异Peak分析

在存在多个分组且组内有生物学重复的情况下,可以对组间进行差异Peak(differential peak)分析,以确定哪些Peak在组间存在显著差异,同时获取组内共识峰(consensus peak)。如果没有差异分析则本节内容为空。

5.1 差异Peak分析结果

存在组内生物学重复时,我们使用软件DiffBind(version 3.10)(Stark,R., & Brown,G.,2012)对样本peaks进行分析。结果详见report/result/4.peak。后文中提到的“diff”代表组间差异,“cons”代表组内交集。
DiffPeak:“sampAvssampB_res.csv”为各组样本差异分析结果,sampA代表实验组,sampB代表对照组; DiffPeak_sampAvssampB_up.bed为sampA和sampB比较,结合强度上调的peak; DiffPeak_sampA vs sampB_down.bed为sampA和sampB比较,结合强度下调的peak; sampA_consensus_peaks.bed为A组组内共识峰。



图5.1 比较组PCA图。主成分分析是将原来较多维度的指标 (peak 的分布特征),降维到较低的维度(二维),来研究样品间的主成分关系。二维PCA分析结果中,会展示主成分1(PC1) 和主成分2(PC2)分别作为 X 轴和 Y 轴的散点图,每个点代表 1 个样本。坐标轴上百分比代表主成分的贡献率,贡献率越大,说明该主成分对样本差异的解释能力越强。如果两个样本距离越远,则说明样本 peaks 分布的差异越大。 反之,则说明相应样本peaks整体分布模式越接近。所以,PCA 分析常用于评估样本重复性的好坏。理想情况下,生物学重复的样本应该聚类在一起,而不同组间应该可以区分开。



图5.2 差异Peak火山图。横坐标为log2(Fold Change),纵坐标为-log10(FDR),蓝色为显著性下调的峰,红色为显著性上调的峰,灰色为非显著性差异的峰。







6. 基因组注释

为了进一步探讨peak结合位点特征,理解染色质开放区域对基因调控的机制, 使用R包ChIPseeker(version 1.36)(Wang et al., 2022)对Peak区域进行注释,我们统计Peak在各基因功能元件分布情况,并将各个peak与基因关联。本节结果请详见位于report/result/5.anno文件夹。

6.1 Peak 在基因组分布

图6.1 Peak在基因功能元件上分布饼图。
一般来说,peaks最多的区域是位于转录起始点(TSS)上游1kb的启动子区域“promoter(<=1kb)”,它与基因的表达调控密切相关;“promoter(1~2kb)”代表TSS上游1~2kb的启动子区域,“promoter(2~3kb)”代表TSS上游2~3kb的启动子区域。
5'非翻译区(5' UTR)和外显子区域(Exon)与mRNA的稳定性或基因表达的调控有关。
3'非翻译区(3' UTR)、内含子(Intron)、远端基因间区(Distal Intergenic)以及TSS下游区(Downstream),这些区域的调控活动可能涉及长距离的基因调控或影响基因的后续处理和表达。



图6.2 各样本Peak在基因功能元件上分布比例堆叠条状图,samples代表单个样本,cons代表组内共识峰,diff代表组间差异峰。



图6.3 各样本Peak在TSS(转录起始位点)侧翼分布比例堆叠条状图,samples代表单个样本,cons代表组内共识峰,diff代表组间差异峰,各元件内容含义见图 6.1。



6.2 Peak关联基因注释

各个样本Peak关联基因注释结果表部分内容如下,完整信息请查看report/result/5.anno/{样本名称}_PeakAnno.csv表格。{组名}_PeakAnno.csv代表组内共识峰注释结果,{比较组}_{up/down}_PeakAnno.csv代表组间差异峰注释结果。

显示前100行 (共10248行)
chr start end peaknum annotation geneChr geneStart geneEnd geneLength geneStrand geneId transcriptId distanceToTSS
1 9765 13394 H3K27me3_Seedling_rep1_peak_1 Promoter (<=1kb) 1 11101 11372 272 1 AT1G03987 AT1G03987.1 0
1 14209 14635 H3K27me3_Seedling_rep1_peak_2 Promoter (<=1kb) 1 11649 13714 2066 2 AT1G01030 AT1G01030.2 -496
1 16710 16982 H3K27me3_Seedling_rep1_peak_3 Distal Intergenic 1 11649 13714 2066 2 AT1G01030 AT1G01030.2 -2997
1 17880 19394 H3K27me3_Seedling_rep1_peak_4 Distal Intergenic 1 23121 31227 8107 1 AT1G01040 AT1G01040.1 -3727
1 40755 40987 H3K27me3_Seedling_rep1_peak_5 Promoter (<=1kb) 1 38752 40945 2194 2 AT1G01070 AT1G01070.2 0
1 43408 44564 H3K27me3_Seedling_rep1_peak_6 Promoter (<=1kb) 1 43087 43295 209 2 AT1G04003 AT1G04003.1 -114
1 57871 58647 H3K27me3_Seedling_rep1_peak_7 Promoter (<=1kb) 1 57164 59215 2052 2 AT1G01120 AT1G01120.1 568
1 78809 80660 H3K27me3_Seedling_rep1_peak_8 Promoter (<=1kb) 1 78927 79037 111 2 AT1G01183 at1g01183 0
1 83097 84837 H3K27me3_Seedling_rep1_peak_9 Promoter (<=1kb) 1 82984 84864 1881 2 AT1G01190 AT1G01190.1 27
1 86807 87131 H3K27me3_Seedling_rep1_peak_10 Promoter (1-2kb) 1 86486 88409 1924 2 AT1G01200 AT1G01200.1 1278
1 102644 103765 H3K27me3_Seedling_rep1_peak_11 Promoter (1-2kb) 1 104440 105330 891 2 AT1G01250 AT1G01250.1 1565
1 104635 105336 H3K27me3_Seedling_rep1_peak_12 Promoter (<=1kb) 1 104440 105330 891 2 AT1G01250 AT1G01250.1 0
1 113311 113844 H3K27me3_Seedling_rep1_peak_13 Promoter (<=1kb) 1 114202 116407 2206 1 AT1G01290 AT1G01290.1 -358
1 132411 132660 H3K27me3_Seedling_rep1_peak_14 Promoter (1-2kb) 1 130736 130858 123 1 AT1G01335 AT1G01335.1 1676
1 138443 139505 H3K27me3_Seedling_rep1_peak_15 Promoter (<=1kb) 1 138489 139671 1183 1 AT1G01355 AT1G01355.2 0
1 148422 148971 H3K27me3_Seedling_rep1_peak_16 Promoter (<=1kb) 1 148018 149806 1789 2 AT1G01390 AT1G01390.1 835
1 149404 149683 H3K27me3_Seedling_rep1_peak_17 Promoter (<=1kb) 1 148018 149806 1789 2 AT1G01390 AT1G01390.1 123
1 151175 152039 H3K27me3_Seedling_rep1_peak_18 Promoter (<=1kb) 1 150689 152210 1522 2 AT1G01400 AT1G01400.1 171
1 152863 154179 H3K27me3_Seedling_rep1_peak_19 Promoter (<=1kb) 1 153113 154198 1086 1 AT1G01410 AT1G01410.1 0
1 154943 155963 H3K27me3_Seedling_rep1_peak_20 Promoter (<=1kb) 1 154492 156011 1520 2 AT1G01420 AT1G01420.1 48
1 167050 167878 H3K27me3_Seedling_rep1_peak_21 Promoter (<=1kb) 1 166618 167842 1225 2 AT1G01453 AT1G01453.2 0
1 172405 172777 H3K27me3_Seedling_rep1_peak_22 Promoter (<=1kb) 1 171525 172948 1424 2 AT1G01470 AT1G01470.1 171
1 177319 177841 H3K27me3_Seedling_rep1_peak_23 Promoter (1-2kb) 1 176141 178400 2260 1 AT1G01480 AT1G01480.2 1179
1 209195 210541 H3K27me3_Seedling_rep1_peak_24 Promoter (<=1kb) 1 209208 213080 3873 1 AT1G01580 AT1G01580.1 0
1 211130 212086 H3K27me3_Seedling_rep1_peak_25 Promoter (1-2kb) 1 209208 213080 3873 1 AT1G01580 AT1G01580.1 1923
1 214154 214308 H3K27me3_Seedling_rep1_peak_26 Promoter (<=1kb) 1 214150 217734 3585 1 AT1G01590 AT1G01590.1 5
1 223984 224123 H3K27me3_Seedling_rep1_peak_27 Promoter (<=1kb) 1 221642 224351 2710 2 AT1G01610 AT1G01610.1 228
1 238460 238787 H3K27me3_Seedling_rep1_peak_28 Promoter (<=1kb) 1 232840 237905 5066 2 AT1G01650 AT1G01650.1 -556
1 244768 245995 H3K27me3_Seedling_rep1_peak_29 Promoter (<=1kb) 1 242837 245676 2840 2 AT1G01670 AT1G01670.2 0
1 246765 246954 H3K27me3_Seedling_rep1_peak_30 Promoter (<=1kb) 1 242713 246054 3342 2 AT1G01670 AT1G01670.1 -712
1 247394 247621 H3K27me3_Seedling_rep1_peak_31 Promoter (<=1kb) 1 246262 248440 2179 2 AT1G01680 AT1G01680.1 819
1 257044 257509 H3K27me3_Seedling_rep1_peak_32 Promoter (1-2kb) 1 258717 258963 247 2 AT1G04037 AT1G04037.1 1454
1 258065 258199 H3K27me3_Seedling_rep1_peak_33 Promoter (<=1kb) 1 258717 258963 247 2 AT1G04037 AT1G04037.1 764
1 310483 310917 H3K27me3_Seedling_rep1_peak_34 Promoter (<=1kb) 1 310169 313130 2962 1 AT1G01900 AT1G01900.1 315
1 311505 312565 H3K27me3_Seedling_rep1_peak_35 Promoter (<=1kb) 1 311828 312884 1057 2 AT1G04077 AT1G04077.1 319
1 341230 341877 H3K27me3_Seedling_rep1_peak_36 Promoter (<=1kb) 1 340108 342040 1933 2 AT1G01980 AT1G01980.1 163
1 358125 359557 H3K27me3_Seedling_rep1_peak_37 Promoter (<=1kb) 1 357906 359078 1173 2 AT1G02040 AT1G02040.1 0
1 365184 367640 H3K27me3_Seedling_rep1_peak_38 Promoter (<=1kb) 1 365209 367443 2235 1 AT1G02065 AT1G02065.1 0
1 370184 372367 H3K27me3_Seedling_rep1_peak_39 Promoter (<=1kb) 1 370195 371090 896 2 AT1G02070 AT1G02070.1 0
1 415116 418254 H3K27me3_Seedling_rep1_peak_40 Promoter (<=1kb) 1 415079 418010 2932 1 AT1G02190 AT1G02190.2 38
1 418841 419953 H3K27me3_Seedling_rep1_peak_41 Promoter (<=1kb) 1 418767 422437 3671 1 AT1G02205 AT1G02205.5 75
1 420694 422243 H3K27me3_Seedling_rep1_peak_42 Promoter (1-2kb) 1 418767 422437 3671 1 AT1G02205 AT1G02205.5 1928
1 460331 460802 H3K27me3_Seedling_rep1_peak_43 Promoter (<=1kb) 1 458133 460696 2564 2 AT1G02310 AT1G02310.1 0
1 464018 467888 H3K27me3_Seedling_rep1_peak_44 Promoter (<=1kb) 1 463754 464981 1228 2 AT1G02335 AT1G02335.1 0
1 472751 473105 H3K27me3_Seedling_rep1_peak_45 Promoter (<=1kb) 1 471883 473160 1278 2 AT1G02360 AT1G02360.1 55
1 479507 483108 H3K27me3_Seedling_rep1_peak_46 Promoter (<=1kb) 1 480650 483373 2724 1 AT1G02390 AT1G02390.1 0
1 486919 487361 H3K27me3_Seedling_rep1_peak_47 Promoter (<=1kb) 1 486801 489634 2834 1 AT1G02400 AT1G02400.2 119
1 489183 490517 H3K27me3_Seedling_rep1_peak_48 Promoter (<=1kb) 1 489874 490627 754 2 AT1G02405 AT1G02405.1 110
1 498068 498446 H3K27me3_Seedling_rep1_peak_49 Promoter (<=1kb) 1 497917 498539 623 2 AT1G02450 AT1G02450.1 93
1 504885 505315 H3K27me3_Seedling_rep1_peak_50 Promoter (1-2kb) 1 504463 507268 2806 2 AT1G02460 AT1G02460.1 1953
1 506447 507139 H3K27me3_Seedling_rep1_peak_51 Promoter (<=1kb) 1 504463 507268 2806 2 AT1G02460 AT1G02460.1 129
1 535202 535457 H3K27me3_Seedling_rep1_peak_52 Promoter (<=1kb) 1 534827 535699 873 2 AT1G02540 AT1G02540.1 242
1 536607 537164 H3K27me3_Seedling_rep1_peak_53 Promoter (<=1kb) 1 536483 537211 729 1 AT1G02550 AT1G02550.1 125
1 543281 543832 H3K27me3_Seedling_rep1_peak_54 Promoter (<=1kb) 1 543011 544057 1047 1 AT1G02575 AT1G02575.1 271
1 545608 551586 H3K27me3_Seedling_rep1_peak_55 Promoter (<=1kb) 1 551289 551529 241 2 AT1G04127 AT1G04127.1 0
1 557322 558696 H3K27me3_Seedling_rep1_peak_56 Promoter (<=1kb) 1 558071 558492 422 2 AT1G04133 AT1G04133.1 0
1 582411 583843 H3K27me3_Seedling_rep1_peak_57 Promoter (<=1kb) 1 583471 583860 390 2 AT1G04143 AT1G04143.1 17
1 605298 606624 H3K27me3_Seedling_rep1_peak_58 Promoter (<=1kb) 1 604873 606708 1836 2 AT1G02770 AT1G02770.1 84
1 610771 612373 H3K27me3_Seedling_rep1_peak_59 Promoter (<=1kb) 1 610363 612324 1962 2 AT1G02790 AT1G02790.1 0
1 618377 620017 H3K27me3_Seedling_rep1_peak_60 Promoter (<=1kb) 1 618061 620502 2442 1 AT1G02810 AT1G02810.1 317
1 620723 621144 H3K27me3_Seedling_rep1_peak_61 Promoter (<=1kb) 1 620678 621329 652 1 AT1G02813 AT1G02813.1 46
1 663139 663936 H3K27me3_Seedling_rep1_peak_62 Promoter (<=1kb) 1 663021 664128 1108 2 AT1G02940 AT1G02940.1 192
1 665394 666228 H3K27me3_Seedling_rep1_peak_63 Promoter (<=1kb) 1 665881 666136 256 1 AT1G02952 AT1G02952.2 0
1 677412 678786 H3K27me3_Seedling_rep1_peak_64 Promoter (<=1kb) 1 677869 681378 3510 1 AT1G02980 AT1G02980.2 0
1 698257 698532 H3K27me3_Seedling_rep1_peak_65 Promoter (<=1kb) 1 697919 698591 673 2 AT1G03020 AT1G03020.1 59
1 699088 700577 H3K27me3_Seedling_rep1_peak_66 Promoter (<=1kb) 1 697919 698591 673 2 AT1G03020 AT1G03020.1 -498
1 706290 706433 H3K27me3_Seedling_rep1_peak_67 Promoter (<=1kb) 1 703906 706630 2725 2 AT1G03040 AT1G03040.2 197
1 747270 747501 H3K27me3_Seedling_rep1_peak_68 Promoter (<=1kb) 1 747197 748057 861 1 AT1G03103 AT1G03103.1 74
1 748304 748972 H3K27me3_Seedling_rep1_peak_69 Promoter (<=1kb) 1 748101 749034 934 2 AT1G03106 AT1G03106.1 62
1 769891 770347 H3K27me3_Seedling_rep1_peak_70 Promoter (1-2kb) 1 768376 770684 2309 1 AT1G03170 AT1G03170.1 1516
1 772033 772773 H3K27me3_Seedling_rep1_peak_71 Promoter (<=1kb) 1 773358 775362 2005 1 AT1G03180 AT1G03180.1 -585
1 790270 790847 H3K27me3_Seedling_rep1_peak_72 Promoter (<=1kb) 1 789820 791689 1870 1 AT1G03230 AT1G03230.1 451
1 819879 820458 H3K27me3_Seedling_rep1_peak_73 Promoter (<=1kb) 1 819612 821476 1865 1 AT1G03340 AT1G03340.1 268
1 820867 821083 H3K27me3_Seedling_rep1_peak_74 Promoter (1-2kb) 1 819612 821476 1865 1 AT1G03340 AT1G03340.1 1256
1 843378 843960 H3K27me3_Seedling_rep1_peak_75 Promoter (<=1kb) 1 842712 844237 1526 2 AT1G03400 AT1G03400.2 277
1 856102 856345 H3K27me3_Seedling_rep1_peak_76 Exon (AT1G03445.1/AT1G03445, exon 15 of 22) 1 854410 859701 5292 2 AT1G03445 AT1G03445.1 3356
1 857315 859694 H3K27me3_Seedling_rep1_peak_77 Promoter (<=1kb) 1 854410 859701 5292 2 AT1G03445 AT1G03445.1 7
1 871721 872926 H3K27me3_Seedling_rep1_peak_78 Promoter (<=1kb) 1 871874 872906 1033 1 AT1G03490 AT1G03490.1 0
1 899203 899399 H3K27me3_Seedling_rep1_peak_79 Promoter (<=1kb) 1 898480 899794 1315 1 AT1G03600 AT1G03600.1 724
1 922615 929704 H3K27me3_Seedling_rep1_peak_80 Promoter (<=1kb) 1 923390 924730 1341 1 AT1G03710 AT1G03710.1 0
1 952323 956502 H3K27me3_Seedling_rep1_peak_81 Promoter (<=1kb) 1 954290 956144 1855 1 AT1G03790 AT1G03790.1 0
1 957238 957975 H3K27me3_Seedling_rep1_peak_82 Promoter (<=1kb) 1 957104 958383 1280 2 AT1G03800 AT1G03800.1 408
1 958650 959643 H3K27me3_Seedling_rep1_peak_83 Promoter (<=1kb) 1 958759 959609 851 1 AT1G03810 AT1G03810.1 0
1 960051 960568 H3K27me3_Seedling_rep1_peak_84 Promoter (<=1kb) 1 959790 960916 1127 2 AT1G03820 AT1G03820.1 348
1 967742 970870 H3K27me3_Seedling_rep1_peak_85 Promoter (<=1kb) 1 967470 970334 2865 2 AT1G03840 AT1G03840.1 0
1 971648 972885 H3K27me3_Seedling_rep1_peak_86 Promoter (1-2kb) 1 967470 970334 2865 2 AT1G03840 AT1G03840.1 -1315
1 974059 974196 H3K27me3_Seedling_rep1_peak_87 Distal Intergenic 1 975880 977761 1882 2 AT1G03850 AT1G03850.1 3565
1 977363 977711 H3K27me3_Seedling_rep1_peak_88 Promoter (<=1kb) 1 975880 977761 1882 2 AT1G03850 AT1G03850.1 50
1 982725 983288 H3K27me3_Seedling_rep1_peak_89 Promoter (<=1kb) 1 982381 983544 1164 2 AT1G03870 AT1G03870.1 256
1 985779 987965 H3K27me3_Seedling_rep1_peak_90 Promoter (<=1kb) 1 985751 988146 2396 1 AT1G03880 AT1G03880.1 29
1 988425 990864 H3K27me3_Seedling_rep1_peak_91 Promoter (<=1kb) 1 989198 991020 1823 1 AT1G03890 AT1G03890.1 0
1 1023436 1023606 H3K27me3_Seedling_rep1_peak_92 Promoter (<=1kb) 1 1022251 1023719 1469 2 AT1G03982 AT1G03982.1 113
1 1082090 1084287 H3K27me3_Seedling_rep1_peak_93 Promoter (<=1kb) 1 1081024 1084431 3408 2 AT1G04150 AT1G04150.1 144
1 1104642 1105838 H3K27me3_Seedling_rep1_peak_94 Promoter (<=1kb) 1 1104493 1106241 1749 1 AT1G04180 AT1G04180.1 150
1 1120086 1120628 H3K27me3_Seedling_rep1_peak_95 Promoter (<=1kb) 1 1120225 1122526 2302 1 AT1G04247 AT1G04247.1 0
1 1121242 1122454 H3K27me3_Seedling_rep1_peak_96 Promoter (<=1kb) 1 1119699 1122786 3088 2 AT1G04220 AT1G04220.1 332
1 1123686 1124637 H3K27me3_Seedling_rep1_peak_97 Promoter (<=1kb) 1 1125515 1128788 3274 1 AT1G04230 AT1G04230.1 -878
1 1136306 1138331 H3K27me3_Seedling_rep1_peak_98 Promoter (<=1kb) 1 1136078 1138663 2586 1 AT1G04250 AT1G04250.1 229
1 1159531 1159876 H3K27me3_Seedling_rep1_peak_99 Promoter (<=1kb) 1 1159023 1159093 71 1 AT1G04320 AT1G04320.1 509
1 1161493 1161933 H3K27me3_Seedling_rep1_peak_100 Promoter (<=1kb) 1 1161289 1161982 694 2 AT1G04330 AT1G04330.1 49

上表第一列到第三列为peak在基因组位置;第五列annotation为peak的基因组功能元件身份;第六列到第十列为关联基因的位置信息; 第十一列geneId为基因ID;第十二列transcriptId为转录本ID;第十三列distanceToTSS为peak到TSS距离。







7. 基因富集分析

我们使用clusterprofiler(version 4.14.6)(Wu T. et al., 2021)进行GO和KEGG通路富集分析。富集分析结果表格未使用阈值过滤,您可以在表格中查看所有可能富集的通路。

7.1 GO富集分析

GO (Gene Ontology, http://www.geneontology.org) 是基因本体论联合会建立的将全世界所有与基因有关的研究结果进行分类汇总的综合数据库。该数据库标准化了不同数据库中关于基因和基因产物的生物学术语,适用于各物种,对基因和蛋白功能进行限定和描述。利用GO 数据库,可以对peak峰相关基因进行富集分析,可以找到不同条件下的peak峰相关基因按照其参与的BP(Biological Process, 生物过程)、MF(Molecular Function, 分子功能) 及CC(Cellular Component, 细胞组分) 三个方面进行分类注释。GO 注释有助于理解基因背后所代表的生物学意义。GO功能显著性富集分析给出与基因组背景相比,在相关基因中显著富集的GO功能条目,从而给出与peak峰相关基因与哪些生物学功能显著相关。该分析首先把所有相关向Gene Ontology数据库的各个term映射,计算每个term的基因数目,然后应用超几何检验,找出与整个基因组背景相比,在与peak峰相关基因中显著富集的GO条目。

下面展示peak关联的基因富集GO富集分析部分结果,完整结果请见/result/6.gokegg。GO富集分析完整结果请详见位于report/result/6.gokegg文件夹的*_GO_res.csv表格文件。

显示前100行 (共3086行)
ONTOLOGY ID Description GeneRatio BgRatio RichFactor FoldEnrichment zScore pvalue p.adjust qvalue geneID Count
BP GO:0044550 secondary metabolite biosynthetic process 190/6662 399/25557 0.476 1.827 9.884 0.000 0.000 0.000 AT1G04220/AT1G04330/AT1G07640/AT1G08340/AT1G10460/AT1G12130/AT1G12160/AT1G16400/AT1G16410/AT1G16490/AT1G16500/AT1G18140/AT1G18570/AT1G20490/AT1G21310/AT1G22220/AT1G22480/AT1G23330/AT1G30700/AT1G32100/... 190
BP GO:0045490 pectin catabolic process 65/6662 98/25557 0.663 2.544 9.096 0.000 0.000 0.000 AT1G02810/AT1G05310/AT1G11370/AT1G11580/AT1G11590/AT1G11920/AT1G14420/AT1G23200/AT1G30350/AT1G53830/AT1G67750/AT1G69940/AT2G19150/AT2G21610/AT2G26440/AT2G26450/AT2G41850/AT2G43050/AT2G45220/AT2G47030/... 65
BP GO:0016102 diterpenoid biosynthetic process 50/6662 69/25557 0.725 2.780 8.791 0.000 0.000 0.000 AT1G02400/AT1G15550/AT1G30040/AT1G44090/AT1G47990/AT1G48800/AT1G48820/AT1G50960/AT1G60980/AT1G61120/AT1G63030/AT1G66020/AT1G68320/AT1G70080/AT1G80330/AT1G80340/AT2G23230/AT2G24210/AT2G36690/AT2G46590/... 50
BP GO:0016101 diterpenoid metabolic process 53/6662 76/25557 0.697 2.675 8.685 0.000 0.000 0.000 AT1G02400/AT1G15550/AT1G30040/AT1G36060/AT1G44090/AT1G47990/AT1G48800/AT1G48820/AT1G50960/AT1G60980/AT1G61120/AT1G63030/AT1G66020/AT1G68320/AT1G70080/AT1G80330/AT1G80340/AT2G23230/AT2G24210/AT2G36690/... 53
BP GO:0009698 phenylpropanoid metabolic process 79/6662 146/25557 0.541 2.076 7.740 0.000 0.000 0.000 AT1G04220/AT1G05250/AT1G08340/AT1G16490/AT1G18400/AT1G31720/AT1G32100/AT1G32770/AT1G51680/AT1G61590/AT1G61810/AT1G61820/AT1G63910/AT1G64160/AT1G64670/AT1G65060/AT1G66270/AT1G70500/AT1G71030/AT1G80820/... 79
BP GO:0042545 cell wall modification 91/6662 179/25557 0.508 1.950 7.576 0.000 0.000 0.000 AT1G02810/AT1G05310/AT1G07000/AT1G11370/AT1G11580/AT1G11590/AT1G12560/AT1G18570/AT1G19300/AT1G19900/AT1G23200/AT1G48100/AT1G53830/AT1G61110/AT1G65680/AT1G69940/AT1G70370/AT1G70710/AT2G18800/AT2G19150/... 91
BP GO:0009699 phenylpropanoid biosynthetic process 65/6662 114/25557 0.570 2.187 7.544 0.000 0.000 0.000 AT1G04220/AT1G08340/AT1G16490/AT1G32100/AT1G32770/AT1G61590/AT1G61810/AT1G61820/AT1G63910/AT1G64160/AT1G64670/AT1G70500/AT1G71030/AT1G80820/AT2G05440/AT2G21730/AT2G21890/AT2G28670/AT2G37040/AT2G38080/... 65
BP GO:0000272 polysaccharide catabolic process 74/6662 143/25557 0.517 1.985 7.015 0.000 0.000 0.000 AT1G02810/AT1G05310/AT1G11370/AT1G11580/AT1G11590/AT1G11920/AT1G14420/AT1G23200/AT1G30350/AT1G53830/AT1G67750/AT1G69940/AT2G19150/AT2G21610/AT2G26440/AT2G26450/AT2G41850/AT2G43050/AT2G45220/AT2G47030/... 74
BP GO:0016114 terpenoid biosynthetic process 92/6662 198/25557 0.465 1.782 6.563 0.000 0.000 0.000 AT1G02400/AT1G15550/AT1G30040/AT1G30100/AT1G36060/AT1G44090/AT1G47990/AT1G48800/AT1G48820/AT1G49770/AT1G50960/AT1G60270/AT1G60980/AT1G61120/AT1G61680/AT1G63030/AT1G64670/AT1G65690/AT1G66020/AT1G66960/... 92
BP GO:2000652 regulation of secondary cell wall biogenesis 39/6662 61/25557 0.639 2.453 6.745 0.000 0.000 0.000 AT1G08340/AT1G10460/AT1G12260/AT1G16490/AT1G17950/AT1G20490/AT1G22480/AT1G28470/AT1G32640/AT1G43790/AT1G62981/AT1G62990/AT1G63910/AT1G73410/AT2G18060/AT2G28870/AT2G46760/AT3G08500/AT3G15050/AT3G22415/... 39
BP GO:0045488 pectin metabolic process 84/6662 177/25557 0.475 1.821 6.505 0.000 0.000 0.000 AT1G02810/AT1G05310/AT1G11370/AT1G11580/AT1G11590/AT1G11920/AT1G13250/AT1G14420/AT1G18580/AT1G19300/AT1G23200/AT1G28290/AT1G30350/AT1G44830/AT1G53830/AT1G67750/AT1G69940/AT2G19150/AT2G21610/AT2G23130/... 84
BP GO:0010393 galacturonan metabolic process 84/6662 178/25557 0.472 1.810 6.442 0.000 0.000 0.000 AT1G02810/AT1G05310/AT1G11370/AT1G11580/AT1G11590/AT1G11920/AT1G13250/AT1G14420/AT1G18580/AT1G19300/AT1G23200/AT1G28290/AT1G30350/AT1G44830/AT1G53830/AT1G67750/AT1G69940/AT2G19150/AT2G21610/AT2G23130/... 84
BP GO:0006721 terpenoid metabolic process 120/6662 282/25557 0.426 1.632 6.341 0.000 0.000 0.000 AT1G01390/AT1G02400/AT1G15550/AT1G28660/AT1G30040/AT1G30100/AT1G33100/AT1G36060/AT1G44090/AT1G47990/AT1G48800/AT1G48820/AT1G49770/AT1G50960/AT1G52400/AT1G60270/AT1G60980/AT1G61120/AT1G61680/AT1G63030/... 120
BP GO:0016143 S-glycoside metabolic process 115/6662 268/25557 0.429 1.646 6.314 0.000 0.000 0.000 AT1G01420/AT1G07640/AT1G09176/AT1G12130/AT1G12160/AT1G12630/AT1G16400/AT1G16410/AT1G18140/AT1G18570/AT1G21100/AT1G21120/AT1G22220/AT1G25510/AT1G28610/AT1G32640/AT1G41830/AT1G45201/AT1G47600/AT1G47870/... 115
BP GO:0019760 glucosinolate metabolic process 115/6662 268/25557 0.429 1.646 6.314 0.000 0.000 0.000 AT1G01420/AT1G07640/AT1G09176/AT1G12130/AT1G12160/AT1G12630/AT1G16400/AT1G16410/AT1G18140/AT1G18570/AT1G21100/AT1G21120/AT1G22220/AT1G25510/AT1G28610/AT1G32640/AT1G41830/AT1G45201/AT1G47600/AT1G47870/... 115
BP GO:0016144 S-glycoside biosynthetic process 60/6662 118/25557 0.508 1.951 6.146 0.000 0.000 0.000 AT1G07640/AT1G12130/AT1G12160/AT1G16400/AT1G16410/AT1G18140/AT1G18570/AT1G32640/AT1G41830/AT1G45201/AT1G52000/AT1G52100/AT1G55210/AT1G60270/AT1G62540/AT1G62560/AT1G62570/AT1G64390/AT1G65860/AT1G65880/... 60
BP GO:0019761 glucosinolate biosynthetic process 60/6662 118/25557 0.508 1.951 6.146 0.000 0.000 0.000 AT1G07640/AT1G12130/AT1G12160/AT1G16400/AT1G16410/AT1G18140/AT1G18570/AT1G32640/AT1G41830/AT1G45201/AT1G52000/AT1G52100/AT1G55210/AT1G60270/AT1G62540/AT1G62560/AT1G62570/AT1G64390/AT1G65860/AT1G65880/... 60
BP GO:0042214 terpene metabolic process 33/6662 51/25557 0.647 2.482 6.292 0.000 0.000 0.000 AT1G30100/AT1G48800/AT1G48820/AT1G61680/AT1G66020/AT1G70080/AT1G78390/AT2G23230/AT2G44990/AT3G14440/AT3G14490/AT3G14520/AT3G24220/AT3G25180/AT3G25810/AT3G29110/AT3G29190/AT3G29410/AT3G32030/AT4G13280/... 33
BP GO:0045165 cell fate commitment 55/6662 105/25557 0.524 2.009 6.155 0.000 0.000 0.000 AT1G06225/AT1G08465/AT1G14600/AT1G24260/AT1G25425/AT1G26600/AT1G32240/AT1G47485/AT1G49005/AT1G63245/AT1G68480/AT1G68795/AT1G69180/AT1G69320/AT1G69970/AT1G70895/AT1G73965/AT1G79580/AT1G79840/AT2G02060/... 55
BP GO:0071555 cell wall organization 130/6662 323/25557 0.402 1.544 5.842 0.000 0.000 0.000 AT1G02810/AT1G05230/AT1G05250/AT1G05310/AT1G07000/AT1G09550/AT1G09790/AT1G11370/AT1G11580/AT1G11590/AT1G12560/AT1G14420/AT1G18570/AT1G18580/AT1G19300/AT1G19900/AT1G21310/AT1G23200/AT1G23720/AT1G26240/... 130
BP GO:0006722 triterpenoid metabolic process 18/6662 21/25557 0.857 3.288 6.229 0.000 0.000 0.000 AT1G66960/AT1G78500/AT1G78950/AT1G78955/AT1G78960/AT1G78970/AT3G29255/AT3G45130/AT4G15340/AT4G15370/AT5G36110/AT5G36140/AT5G36150/AT5G42590/AT5G42600/AT5G47990/AT5G48000/AT5G48010 18
BP GO:0010022 meristem determinacy 20/6662 25/25557 0.800 3.069 6.146 0.000 0.000 0.000 AT1G26310/AT1G62360/AT1G65480/AT1G69180/AT2G22540/AT3G28917/AT3G30260/AT3G55510/AT3G57130/AT4G17460/AT4G20370/AT4G24540/AT4G28190/AT5G03790/AT5G03840/AT5G11320/AT5G11530/AT5G14010/AT5G18560/AT5G61850 20
BP GO:0051761 sesquiterpene metabolic process 20/6662 25/25557 0.800 3.069 6.146 0.000 0.000 0.000 AT1G48800/AT1G48820/AT1G66020/AT1G70080/AT2G23230/AT3G14490/AT3G14520/AT3G29110/AT3G29190/AT3G29410/AT3G32030/AT4G13280/AT4G13300/AT4G15870/AT4G20200/AT4G20210/AT5G23960/AT5G44620/AT5G44630/AT5G48110 20
BP GO:1903338 regulation of cell wall organization or biogenesis 41/6662 72/25557 0.569 2.185 5.976 0.000 0.000 0.000 AT1G08340/AT1G10460/AT1G12260/AT1G16490/AT1G17950/AT1G20490/AT1G22480/AT1G28470/AT1G32640/AT1G35780/AT1G43650/AT1G43790/AT1G62981/AT1G62990/AT1G63910/AT1G73410/AT2G18060/AT2G28870/AT2G46760/AT3G08500/... 41
BP GO:1901657 glycosyl compound metabolic process 125/6662 311/25557 0.402 1.542 5.709 0.000 0.000 0.000 AT1G01420/AT1G07640/AT1G09176/AT1G12130/AT1G12160/AT1G12630/AT1G16400/AT1G16410/AT1G18140/AT1G18570/AT1G21100/AT1G21120/AT1G22220/AT1G25510/AT1G28610/AT1G32640/AT1G41830/AT1G45201/AT1G47600/AT1G47870/... 125
BP GO:0048437 floral organ development 118/6662 290/25557 0.407 1.561 5.705 0.000 0.000 0.000 AT1G01030/AT1G02065/AT1G05230/AT1G08320/AT1G11190/AT1G12064/AT1G13400/AT1G13710/AT1G17420/AT1G24260/AT1G31140/AT1G32240/AT1G52150/AT1G52880/AT1G62360/AT1G64405/AT1G65620/AT1G68130/AT1G68480/AT1G69180/... 118
BP GO:0120252 hydrocarbon metabolic process 47/6662 88/25557 0.534 2.049 5.853 0.000 0.000 0.000 AT1G01480/AT1G02205/AT1G12010/AT1G30100/AT1G48800/AT1G48820/AT1G61680/AT1G62380/AT1G66020/AT1G70080/AT1G78390/AT2G22810/AT2G23230/AT2G44990/AT3G04070/AT3G14440/AT3G14490/AT3G14520/AT3G15010/AT3G24220/... 47
BP GO:0045229 external encapsulating structure organization 149/6662 387/25557 0.385 1.477 5.614 0.000 0.000 0.000 AT1G02810/AT1G05230/AT1G05250/AT1G05310/AT1G07000/AT1G09550/AT1G09790/AT1G11370/AT1G11580/AT1G11590/AT1G12560/AT1G14420/AT1G18570/AT1G18580/AT1G19300/AT1G19900/AT1G21310/AT1G23200/AT1G23720/AT1G24140/... 149
BP GO:0042446 hormone biosynthetic process 41/6662 73/25557 0.562 2.155 5.866 0.000 0.000 0.000 AT1G04180/AT1G04610/AT1G21430/AT1G23320/AT1G25330/AT1G25410/AT1G28300/AT1G48910/AT1G62360/AT1G67110/AT1G68130/AT1G68460/AT1G70560/AT2G01940/AT2G22330/AT2G35990/AT2G37210/AT2G45400/AT3G19160/AT3G23630/... 41
BP GO:0009733 response to auxin 168/6662 448/25557 0.375 1.439 5.561 0.000 0.000 0.000 AT1G04250/AT1G13670/AT1G15580/AT1G17345/AT1G18520/AT1G19830/AT1G19840/AT1G19850/AT1G23080/AT1G24590/AT1G25400/AT1G26870/AT1G27740/AT1G28130/AT1G30330/AT1G31040/AT1G31710/AT1G34670/AT1G47485/AT1G47510/... 168
BP GO:1901659 glycosyl compound biosynthetic process 62/6662 129/25557 0.481 1.844 5.705 0.000 0.000 0.000 AT1G07640/AT1G12130/AT1G12160/AT1G16400/AT1G16410/AT1G18140/AT1G18570/AT1G32640/AT1G41830/AT1G45201/AT1G52000/AT1G52100/AT1G55210/AT1G60270/AT1G62540/AT1G62560/AT1G62570/AT1G64390/AT1G65860/AT1G65880/... 62
BP GO:0051762 sesquiterpene biosynthetic process 19/6662 24/25557 0.792 3.037 5.928 0.000 0.000 0.000 AT1G48800/AT1G48820/AT1G66020/AT1G70080/AT2G23230/AT3G14490/AT3G14520/AT3G29110/AT3G29190/AT3G29410/AT3G32030/AT4G13280/AT4G13300/AT4G15870/AT4G20200/AT4G20210/AT5G23960/AT5G44630/AT5G48110 19
BP GO:0048646 anatomical structure formation involved in morphogenesis 121/6662 303/25557 0.399 1.532 5.531 0.000 0.000 0.000 AT1G05370/AT1G11090/AT1G13370/AT1G13710/AT1G21320/AT1G21470/AT1G24260/AT1G26780/AT1G26920/AT1G30210/AT1G49430/AT1G55020/AT1G55580/AT1G58340/AT1G62500/AT1G62940/AT1G65060/AT1G66110/AT1G67260/AT1G67510/... 121
BP GO:0015698 inorganic anion transport 58/6662 120/25557 0.483 1.854 5.569 0.000 0.000 0.000 AT1G08090/AT1G08100/AT1G12110/AT1G14040/AT1G16950/AT1G20823/AT1G22540/AT1G29290/AT1G31670/AT1G32450/AT1G47485/AT1G49800/AT1G51790/AT1G51880/AT1G52190/AT1G60050/AT1G62280/AT1G68740/AT1G69870/AT1G72130/... 58
BP GO:0045168 cell-cell signaling involved in cell fate commitment 28/6662 44/25557 0.636 2.441 5.681 0.000 0.000 0.000 AT1G06225/AT1G25425/AT1G26600/AT1G47485/AT1G49005/AT1G63245/AT1G68795/AT1G69320/AT1G69970/AT1G70895/AT1G73965/AT2G27250/AT2G30420/AT2G31081/AT2G31082/AT2G31083/AT2G31085/AT2G34925/AT3G24225/AT3G24770/... 28
BP GO:0008299 isoprenoid biosynthetic process 98/6662 237/25557 0.414 1.586 5.384 0.000 0.000 0.000 AT1G02400/AT1G15550/AT1G30040/AT1G30100/AT1G36060/AT1G44090/AT1G47990/AT1G48800/AT1G48820/AT1G49770/AT1G50960/AT1G54000/AT1G60270/AT1G60980/AT1G61120/AT1G61680/AT1G63030/AT1G64670/AT1G65690/AT1G66020/... 98
BP GO:0009808 lignin metabolic process 39/6662 71/25557 0.549 2.107 5.547 0.000 0.000 0.000 AT1G05250/AT1G08340/AT1G16490/AT1G31720/AT1G32770/AT1G61590/AT1G61810/AT1G61820/AT1G63910/AT1G80820/AT2G21730/AT2G21890/AT2G28670/AT2G37040/AT2G38080/AT2G41480/AT3G15050/AT3G16920/AT3G50740/AT3G55580/... 39
BP GO:0010089 xylem development 32/6662 54/25557 0.593 2.273 5.562 0.000 0.000 0.000 AT1G12260/AT1G62990/AT1G71930/AT1G79430/AT1G80100/AT2G13820/AT2G18060/AT2G45420/AT3G21710/AT3G24770/AT3G46890/AT3G56270/AT3G61920/AT4G00220/AT4G08150/AT4G08160/AT4G13195/AT4G14940/AT4G21780/AT4G32880/... 32
BP GO:0009685 gibberellin metabolic process 26/6662 40/25557 0.650 2.494 5.613 0.000 0.000 0.000 AT1G02400/AT1G15550/AT1G30040/AT1G36060/AT1G44090/AT1G47990/AT1G50960/AT1G60980/AT1G63030/AT1G68320/AT1G80330/AT1G80340/AT2G36690/AT2G46590/AT3G26790/AT4G02780/AT4G21200/AT4G21690/AT4G25420/AT5G07200/... 26
BP GO:0009834 plant-type secondary cell wall biogenesis 61/6662 131/25557 0.466 1.786 5.358 0.000 0.000 0.000 AT1G08340/AT1G10460/AT1G12260/AT1G16490/AT1G17950/AT1G20490/AT1G22480/AT1G28470/AT1G32640/AT1G32770/AT1G33280/AT1G35780/AT1G43790/AT1G55210/AT1G62981/AT1G62990/AT1G63910/AT1G70500/AT1G73410/AT1G75500/... 61
BP GO:0042445 hormone metabolic process 66/6662 147/25557 0.449 1.722 5.216 0.000 0.000 0.000 AT1G04180/AT1G04610/AT1G17060/AT1G19630/AT1G21430/AT1G23320/AT1G25330/AT1G25410/AT1G28300/AT1G44350/AT1G48410/AT1G48910/AT1G51760/AT1G58460/AT1G60050/AT1G62360/AT1G67110/AT1G68130/AT1G68460/AT1G70560/... 66
BP GO:0006720 isoprenoid metabolic process 128/6662 336/25557 0.381 1.461 5.055 0.000 0.000 0.000 AT1G01390/AT1G02400/AT1G15550/AT1G28660/AT1G30040/AT1G30100/AT1G33100/AT1G36060/AT1G44090/AT1G47990/AT1G48800/AT1G48820/AT1G49770/AT1G50960/AT1G52400/AT1G54000/AT1G60270/AT1G60980/AT1G61120/AT1G61680/... 128
BP GO:1905393 plant organ formation 68/6662 154/25557 0.442 1.694 5.129 0.000 0.000 0.000 AT1G13710/AT1G21470/AT1G24260/AT1G26920/AT1G30210/AT1G49430/AT1G55020/AT1G55580/AT1G58340/AT1G62500/AT1G67260/AT1G68480/AT1G68800/AT1G70510/AT1G70895/AT1G73590/AT1G73870/AT1G77690/AT2G17500/AT2G17950/... 68
BP GO:0009686 gibberellin biosynthetic process 23/6662 35/25557 0.657 2.521 5.346 0.000 0.000 0.000 AT1G02400/AT1G15550/AT1G30040/AT1G44090/AT1G47990/AT1G50960/AT1G60980/AT1G63030/AT1G68320/AT1G80330/AT1G80340/AT2G36690/AT2G46590/AT3G26790/AT4G02780/AT4G21200/AT4G21690/AT4G25420/AT5G07200/AT5G51310/... 23
BP GO:0090698 post-embryonic plant morphogenesis 90/6662 220/25557 0.409 1.569 5.036 0.000 0.000 0.000 AT1G01040/AT1G07090/AT1G13290/AT1G16910/AT1G18580/AT1G23320/AT1G24260/AT1G31140/AT1G49430/AT1G49770/AT1G52880/AT1G55020/AT1G63700/AT1G68130/AT1G68480/AT1G69780/AT1G70510/AT1G70560/AT1G70895/AT1G72490/... 90
BP GO:1902025 nitrate import 24/6662 38/25557 0.632 2.423 5.212 0.000 0.000 0.000 AT1G08090/AT1G08100/AT1G12110/AT1G16950/AT1G22540/AT1G29290/AT1G31670/AT1G32450/AT1G47485/AT1G49800/AT1G52190/AT1G69870/AT1G72130/AT1G72140/AT2G23440/AT2G35612/AT2G38100/AT2G40530/AT3G45060/AT5G49660/... 24
BP GO:0010582 floral meristem determinacy 16/6662 21/25557 0.762 2.923 5.234 0.000 0.000 0.000 AT1G26310/AT1G62360/AT1G69180/AT2G22540/AT3G28917/AT3G30260/AT3G55510/AT3G57130/AT4G17460/AT4G24540/AT4G28190/AT5G03790/AT5G11320/AT5G14010/AT5G18560/AT5G61850 16
BP GO:1900376 regulation of secondary metabolite biosynthetic process 22/6662 34/25557 0.647 2.482 5.135 0.000 0.000 0.000 AT1G07640/AT1G08340/AT1G61590/AT1G63910/AT3G15050/AT3G27785/AT3G55580/AT4G09820/AT4G12350/AT4G22680/AT4G27260/AT4G38620/AT4G39410/AT4G39940/AT5G05340/AT5G07690/AT5G07700/AT5G16600/AT5G23260/AT5G40360/... 22
BP GO:0007267 cell-cell signaling 42/6662 85/25557 0.494 1.896 4.911 0.000 0.000 0.000 AT1G06225/AT1G25425/AT1G26600/AT1G28270/AT1G35467/AT1G47485/AT1G49005/AT1G60815/AT1G61563/AT1G61566/AT1G63245/AT1G68795/AT1G69320/AT1G69970/AT1G70895/AT1G73965/AT2G19020/AT2G19040/AT2G19045/AT2G22055/... 42
BP GO:0043086 negative regulation of catalytic activity 44/6662 91/25557 0.484 1.855 4.851 0.000 0.000 0.000 AT1G02550/AT1G09360/AT1G14890/AT1G23205/AT1G23350/AT1G44900/AT1G55770/AT1G62760/AT1G62770/AT1G64010/AT1G70720/AT1G72290/AT1G73330/AT2G01610/AT2G15345/AT2G31425/AT2G31430/AT2G47340/AT2G47670/AT3G36659/... 44
BP GO:0008300 isoprenoid catabolic process 19/6662 28/25557 0.679 2.603 5.040 0.000 0.000 0.000 AT1G02400/AT1G30040/AT1G30100/AT1G36060/AT1G47990/AT1G78390/AT2G29090/AT2G44990/AT3G14440/AT3G19270/AT3G24220/AT4G18350/AT4G19170/AT4G19230/AT4G32810/AT5G13790/AT5G45340/AT5G58660/AT5G65640 19
BP GO:0010143 cutin biosynthetic process 19/6662 28/25557 0.679 2.603 5.040 0.000 0.000 0.000 AT1G01610/AT1G02390/AT1G06520/AT1G15360/AT1G49430/AT1G64670/AT2G38110/AT2G38530/AT3G05600/AT3G11325/AT3G11430/AT3G48720/AT4G00730/AT4G01950/AT4G28110/AT5G06090/AT5G15310/AT5G23940/AT5G57800 19
BP GO:0160062 cutin-based cuticle development 19/6662 28/25557 0.679 2.603 5.040 0.000 0.000 0.000 AT1G01610/AT1G02390/AT1G06520/AT1G15360/AT1G49430/AT1G64670/AT2G38110/AT2G38530/AT3G05600/AT3G11325/AT3G11430/AT3G48720/AT4G00730/AT4G01950/AT4G28110/AT5G06090/AT5G15310/AT5G23940/AT5G57800 19
BP GO:0001763 morphogenesis of a branching structure 25/6662 42/25557 0.595 2.283 4.943 0.000 0.000 0.000 AT1G30210/AT1G55580/AT1G67260/AT1G68800/AT1G72490/AT1G73870/AT2G17950/AT2G27990/AT2G31070/AT2G34925/AT2G44990/AT2G46640/AT3G21880/AT3G24300/AT3G24770/AT3G30260/AT4G13195/AT4G18390/AT4G21030/AT4G21050/... 25
BP GO:0006026 aminoglycan catabolic process 13/6662 16/25557 0.812 3.117 5.030 0.000 0.000 0.000 AT1G02360/AT1G56680/AT2G43600/AT3G16920/AT3G47540/AT4G19720/AT4G19730/AT4G19740/AT4G19750/AT4G19760/AT4G19800/AT4G19810/AT4G19820 13
BP GO:0006030 chitin metabolic process 13/6662 16/25557 0.812 3.117 5.030 0.000 0.000 0.000 AT1G02360/AT1G56680/AT2G43600/AT3G16920/AT3G47540/AT4G19720/AT4G19730/AT4G19740/AT4G19750/AT4G19760/AT4G19800/AT4G19810/AT4G19820 13
BP GO:0006032 chitin catabolic process 13/6662 16/25557 0.812 3.117 5.030 0.000 0.000 0.000 AT1G02360/AT1G56680/AT2G43600/AT3G16920/AT3G47540/AT4G19720/AT4G19730/AT4G19740/AT4G19750/AT4G19760/AT4G19800/AT4G19810/AT4G19820 13
BP GO:0016104 triterpenoid biosynthetic process 13/6662 16/25557 0.812 3.117 5.030 0.000 0.000 0.000 AT1G66960/AT1G78500/AT1G78950/AT1G78955/AT1G78960/AT1G78970/AT3G29255/AT3G45130/AT4G15340/AT4G15370/AT5G36150/AT5G42600/AT5G48010 13
BP GO:0046348 amino sugar catabolic process 13/6662 16/25557 0.812 3.117 5.030 0.000 0.000 0.000 AT1G02360/AT1G56680/AT2G43600/AT3G16920/AT3G47540/AT4G19720/AT4G19730/AT4G19740/AT4G19750/AT4G19760/AT4G19800/AT4G19810/AT4G19820 13
BP GO:1901072 glucosamine-containing compound catabolic process 13/6662 16/25557 0.812 3.117 5.030 0.000 0.000 0.000 AT1G02360/AT1G56680/AT2G43600/AT3G16920/AT3G47540/AT4G19720/AT4G19730/AT4G19740/AT4G19750/AT4G19760/AT4G19800/AT4G19810/AT4G19820 13
BP GO:0010073 meristem maintenance 105/6662 275/25557 0.382 1.465 4.601 0.000 0.000 0.000 AT1G03170/AT1G03840/AT1G05370/AT1G07090/AT1G10657/AT1G11125/AT1G11915/AT1G13620/AT1G13710/AT1G16530/AT1G23010/AT1G26310/AT1G26600/AT1G32190/AT1G44900/AT1G49005/AT1G51190/AT1G52150/AT1G54200/AT1G54330/... 105
BP GO:0046246 terpene biosynthetic process 23/6662 38/25557 0.605 2.322 4.842 0.000 0.000 0.000 AT1G48800/AT1G48820/AT1G61680/AT1G66020/AT1G70080/AT2G23230/AT3G14490/AT3G14520/AT3G25180/AT3G25810/AT3G29110/AT3G29190/AT3G29410/AT3G32030/AT4G13280/AT4G13300/AT4G15330/AT4G15870/AT4G20200/AT4G20210/... 23
BP GO:0009809 lignin biosynthetic process 31/6662 58/25557 0.534 2.050 4.755 0.000 0.000 0.000 AT1G08340/AT1G16490/AT1G32770/AT1G61590/AT1G61810/AT1G61820/AT1G63910/AT1G80820/AT2G21730/AT2G21890/AT2G28670/AT2G38080/AT3G15050/AT3G16920/AT3G55580/AT4G01070/AT4G12350/AT4G22680/AT4G34230/AT4G36220/... 31
BP GO:0120251 hydrocarbon biosynthetic process 37/6662 74/25557 0.500 1.918 4.696 0.000 0.000 0.000 AT1G01480/AT1G02205/AT1G12010/AT1G48800/AT1G48820/AT1G61680/AT1G62380/AT1G66020/AT1G70080/AT2G22810/AT2G23230/AT3G04070/AT3G14490/AT3G14520/AT3G15010/AT3G25180/AT3G25810/AT3G29110/AT3G29190/AT3G29410/... 37
BP GO:0008202 steroid metabolic process 41/6662 85/25557 0.482 1.850 4.663 0.000 0.000 0.000 AT1G12740/AT1G17060/AT1G19630/AT1G20050/AT1G25330/AT1G65670/AT2G14920/AT2G26710/AT2G28850/AT2G28860/AT2G29090/AT2G34490/AT2G45400/AT3G13730/AT3G19270/AT3G30290/AT3G44326/AT3G44970/AT4G01510/AT4G15300/... 41
BP GO:0048438 floral whorl development 88/6662 224/25557 0.393 1.507 4.526 0.000 0.000 0.000 AT1G01030/AT1G02065/AT1G08320/AT1G13400/AT1G17420/AT1G24260/AT1G31140/AT1G32240/AT1G52150/AT1G52880/AT1G62360/AT1G65620/AT1G68480/AT1G69180/AT1G69690/AT1G70510/AT1G70560/AT1G75520/AT1G75790/AT1G80090/... 88
BP GO:0140115 export across plasma membrane 43/6662 91/25557 0.473 1.813 4.612 0.000 0.000 0.000 AT1G11670/AT1G23080/AT1G23300/AT1G33080/AT1G33090/AT1G33100/AT1G44800/AT1G47530/AT1G58340/AT1G66760/AT1G70940/AT1G73590/AT1G75500/AT1G77110/AT2G01420/AT2G04050/AT2G04070/AT2G04080/AT2G04090/AT2G04100/... 43
BP GO:0009625 response to insect 24/6662 41/25557 0.585 2.246 4.740 0.000 0.000 0.000 AT1G07640/AT1G16410/AT1G18570/AT1G52400/AT1G72290/AT1G73260/AT1G74080/AT2G22330/AT3G14210/AT3G26830/AT3G56400/AT4G13770/AT4G16740/AT4G39950/AT5G07690/AT5G07700/AT5G23010/AT5G24770/AT5G26000/AT5G44420/... 24
BP GO:0016052 carbohydrate catabolic process 90/6662 233/25557 0.386 1.482 4.387 0.000 0.000 0.000 AT1G02810/AT1G05310/AT1G11370/AT1G11580/AT1G11590/AT1G11920/AT1G14420/AT1G22650/AT1G23200/AT1G30350/AT1G47840/AT1G53830/AT1G67750/AT1G69940/AT1G72000/AT2G19150/AT2G21330/AT2G21610/AT2G26440/AT2G26450/... 90
BP GO:0044092 negative regulation of molecular function 45/6662 98/25557 0.459 1.762 4.485 0.000 0.001 0.000 AT1G02550/AT1G09360/AT1G14890/AT1G23205/AT1G23350/AT1G44900/AT1G55770/AT1G62760/AT1G62770/AT1G64010/AT1G70720/AT1G72290/AT1G73330/AT2G01610/AT2G15345/AT2G31425/AT2G31430/AT2G47340/AT2G47670/AT3G36659/... 45
BP GO:0015772 oligosaccharide transport 15/6662 21/25557 0.714 2.740 4.737 0.000 0.001 0.000 AT1G09960/AT1G11560/AT1G22710/AT1G66570/AT1G71880/AT1G71890/AT2G14670/AT3G48740/AT4G25010/AT5G06170/AT5G13170/AT5G23660/AT5G43610/AT5G50790/AT5G50800 15
BP GO:0009956 radial pattern formation 22/6662 37/25557 0.595 2.281 4.630 0.000 0.001 0.000 AT1G07640/AT1G10657/AT1G54330/AT1G64390/AT2G16850/AT2G18490/AT2G32560/AT2G37590/AT2G46640/AT3G26610/AT3G45610/AT4G13600/AT4G37650/AT5G02460/AT5G08391/AT5G10510/AT5G16560/AT5G22930/AT5G57390/AT5G60200/... 22
BP GO:0010223 secondary shoot formation 22/6662 37/25557 0.595 2.281 4.630 0.000 0.001 0.000 AT1G30210/AT1G55580/AT1G67260/AT1G68800/AT1G73870/AT2G17950/AT2G27990/AT2G31070/AT2G34925/AT2G44990/AT3G21880/AT3G24770/AT3G30260/AT4G13195/AT4G18390/AT4G21030/AT4G21050/AT4G29130/AT4G32810/AT5G02030/... 22
BP GO:0010346 shoot axis formation 22/6662 37/25557 0.595 2.281 4.630 0.000 0.001 0.000 AT1G30210/AT1G55580/AT1G67260/AT1G68800/AT1G73870/AT2G17950/AT2G27990/AT2G31070/AT2G34925/AT2G44990/AT3G21880/AT3G24770/AT3G30260/AT4G13195/AT4G18390/AT4G21030/AT4G21050/AT4G29130/AT4G32810/AT5G02030/... 22
BP GO:0009299 mRNA transcription 12/6662 15/25557 0.800 3.069 4.759 0.000 0.001 0.000 AT1G07090/AT1G16910/AT1G19790/AT1G63210/AT1G78815/AT2G31160/AT2G42610/AT3G04510/AT3G23290/AT4G18610/AT5G28490/AT5G58500 12
BP GO:0010431 seed maturation 47/6662 105/25557 0.448 1.717 4.373 0.000 0.001 0.001 AT1G01040/AT1G03880/AT1G03890/AT1G07430/AT1G07985/AT1G22600/AT1G27461/AT1G28300/AT1G30100/AT1G35730/AT1G48130/AT1G48270/AT1G49770/AT1G70910/AT1G78390/AT2G26130/AT2G29090/AT2G36770/AT2G40170/AT2G41280/... 47
BP GO:0015766 disaccharide transport 14/6662 20/25557 0.700 2.685 4.477 0.000 0.001 0.001 AT1G09960/AT1G22710/AT1G66570/AT1G71880/AT1G71890/AT2G14670/AT3G48740/AT4G25010/AT5G06170/AT5G13170/AT5G23660/AT5G43610/AT5G50790/AT5G50800 14
BP GO:0015770 sucrose transport 14/6662 20/25557 0.700 2.685 4.477 0.000 0.001 0.001 AT1G09960/AT1G22710/AT1G66570/AT1G71880/AT1G71890/AT2G14670/AT3G48740/AT4G25010/AT5G06170/AT5G13170/AT5G23660/AT5G43610/AT5G50790/AT5G50800 14
BP GO:0009813 flavonoid biosynthetic process 52/6662 122/25557 0.426 1.635 4.175 0.000 0.001 0.001 AT1G20490/AT1G30530/AT1G32640/AT1G34790/AT1G55210/AT1G56650/AT1G62710/AT1G64390/AT1G66370/AT1G66380/AT1G69690/AT1G69935/AT1G71030/AT1G73325/AT1G75040/AT2G16890/AT2G22510/AT2G22960/AT2G30760/AT2G36790/... 52
BP GO:0042908 xenobiotic transport 29/6662 57/25557 0.509 1.952 4.271 0.000 0.001 0.001 AT1G11670/AT1G23300/AT1G33080/AT1G33090/AT1G33100/AT1G47530/AT1G58340/AT1G66760/AT1G66950/AT2G04050/AT2G04070/AT2G04080/AT2G04090/AT2G04100/AT2G16980/AT2G16990/AT2G34360/AT2G38510/AT3G21690/AT3G23550/... 29
BP GO:1905177 tracheary element differentiation 13/6662 18/25557 0.722 2.771 4.462 0.000 0.001 0.001 AT1G12260/AT1G43790/AT1G71930/AT2G18060/AT2G40470/AT4G08160/AT4G29230/AT4G36160/AT5G06610/AT5G19530/AT5G25830/AT5G62380/AT5G66300 13
BP GO:0009832 plant-type cell wall biogenesis 117/6662 327/25557 0.358 1.373 4.027 0.000 0.002 0.001 AT1G08340/AT1G09790/AT1G09960/AT1G10460/AT1G12260/AT1G13250/AT1G16490/AT1G17950/AT1G20490/AT1G22480/AT1G28290/AT1G28470/AT1G32180/AT1G32640/AT1G32770/AT1G33280/AT1G35780/AT1G41830/AT1G43650/AT1G43790/... 117
BP GO:0046618 xenobiotic export from cell 26/6662 50/25557 0.520 1.995 4.181 0.000 0.002 0.002 AT1G11670/AT1G23300/AT1G33080/AT1G33090/AT1G33100/AT1G47530/AT1G58340/AT1G66760/AT2G04050/AT2G04070/AT2G04080/AT2G04090/AT2G04100/AT2G34360/AT2G38510/AT3G21690/AT3G23550/AT3G26590/AT4G21903/AT4G21910/... 26
BP GO:1990961 xenobiotic detoxification by transmembrane export across the plasma membrane 26/6662 50/25557 0.520 1.995 4.181 0.000 0.002 0.002 AT1G11670/AT1G23300/AT1G33080/AT1G33090/AT1G33100/AT1G47530/AT1G58340/AT1G66760/AT2G04050/AT2G04070/AT2G04080/AT2G04090/AT2G04100/AT2G34360/AT2G38510/AT3G21690/AT3G23550/AT3G26590/AT4G21903/AT4G21910/... 26
BP GO:0010016 shoot system morphogenesis 100/6662 274/25557 0.365 1.400 3.954 0.000 0.002 0.002 AT1G01030/AT1G04150/AT1G11090/AT1G11125/AT1G13290/AT1G13710/AT1G18580/AT1G23320/AT1G30210/AT1G32190/AT1G33170/AT1G47485/AT1G48410/AT1G52150/AT1G54200/AT1G55580/AT1G62500/AT1G63700/AT1G65900/AT1G67260/... 100
BP GO:0010817 regulation of hormone levels 108/6662 300/25557 0.360 1.381 3.942 0.000 0.002 0.002 AT1G04180/AT1G04610/AT1G12110/AT1G17060/AT1G18350/AT1G19630/AT1G21430/AT1G23080/AT1G23320/AT1G25330/AT1G25410/AT1G28230/AT1G28300/AT1G44350/AT1G48410/AT1G48910/AT1G51760/AT1G53700/AT1G54200/AT1G58460/... 108
BP GO:0010087 phloem or xylem histogenesis 67/6662 170/25557 0.394 1.512 3.977 0.000 0.002 0.002 AT1G12260/AT1G13290/AT1G26600/AT1G46480/AT1G49005/AT1G52150/AT1G54200/AT1G54330/AT1G62990/AT1G63245/AT1G65910/AT1G67720/AT1G69320/AT1G69970/AT1G70560/AT1G70895/AT1G71930/AT1G73965/AT1G79430/AT1G80100/... 67
BP GO:0072330 monocarboxylic acid biosynthetic process 108/6662 301/25557 0.359 1.376 3.901 0.000 0.002 0.002 AT1G01120/AT1G04220/AT1G05800/AT1G06080/AT1G06090/AT1G06100/AT1G06120/AT1G06350/AT1G06360/AT1G07720/AT1G11600/AT1G12080/AT1G17420/AT1G19250/AT1G19640/AT1G21970/AT1G25450/AT1G30100/AT1G36060/AT1G49430/... 108
BP GO:0048609 multicellular organismal reproductive process 93/6662 253/25557 0.368 1.410 3.893 0.000 0.002 0.002 AT1G01040/AT1G03880/AT1G03890/AT1G06520/AT1G07430/AT1G07985/AT1G11220/AT1G14630/AT1G17420/AT1G18580/AT1G19890/AT1G19900/AT1G22600/AT1G24260/AT1G24520/AT1G27461/AT1G28300/AT1G28640/AT1G30100/AT1G32770/... 93
BP GO:0009691 cytokinin biosynthetic process 14/6662 21/25557 0.667 2.557 4.240 0.000 0.002 0.002 AT1G25410/AT1G62360/AT1G67110/AT1G68460/AT2G35990/AT2G37210/AT3G19160/AT3G23630/AT3G63110/AT5G03270/AT5G06300/AT5G19040/AT5G26140/AT5G38450 14
BP GO:0016115 terpenoid catabolic process 14/6662 21/25557 0.667 2.557 4.240 0.000 0.002 0.002 AT1G02400/AT1G30040/AT1G36060/AT1G47990/AT2G29090/AT2G44990/AT3G19270/AT4G19170/AT4G19230/AT4G32810/AT5G13790/AT5G45340/AT5G58660/AT5G65640 14
BP GO:0051923 sulfation 12/6662 17/25557 0.706 2.708 4.183 0.000 0.003 0.003 AT1G13420/AT1G13430/AT1G28170/AT2G03750/AT2G14920/AT2G27570/AT3G45070/AT3G45080/AT4G26280/AT5G07000/AT5G07010/AT5G43690 12
BP GO:0048825 cotyledon development 28/6662 57/25557 0.491 1.884 3.969 0.000 0.003 0.003 AT1G12980/AT1G13290/AT1G23320/AT1G24590/AT1G53700/AT1G69780/AT1G70560/AT1G73590/AT2G26700/AT2G32370/AT2G36120/AT2G46310/AT2G46990/AT3G14370/AT3G50870/AT4G04890/AT4G11140/AT4G21030/AT4G21750/AT4G27420/... 28
BP GO:0009739 response to gibberellin 54/6662 133/25557 0.406 1.558 3.828 0.000 0.004 0.003 AT1G03840/AT1G07430/AT1G10480/AT1G15550/AT1G16950/AT1G18080/AT1G48270/AT1G62660/AT1G63090/AT1G67030/AT1G67100/AT1G68360/AT1G75750/AT2G03500/AT2G26130/AT2G26300/AT2G32460/AT2G34340/AT2G36830/AT2G45660/... 54
BP GO:0010158 abaxial cell fate specification 10/6662 13/25557 0.769 2.951 4.178 0.000 0.004 0.004 AT1G08465/AT1G14600/AT1G32240/AT1G68480/AT2G02060/AT2G45190/AT4G00180/AT4G17695/AT5G16560/AT5G42630 10
BP GO:2000762 regulation of phenylpropanoid metabolic process 17/6662 29/25557 0.586 2.249 3.995 0.000 0.004 0.004 AT1G08340/AT1G18400/AT1G61590/AT1G63910/AT3G15050/AT3G55580/AT4G09820/AT4G12350/AT4G17785/AT4G22680/AT4G38620/AT4G39410/AT5G05340/AT5G16600/AT5G23260/AT5G45580/AT5G49660 17
BP GO:0044272 sulfur compound biosynthetic process 106/6662 300/25557 0.353 1.355 3.677 0.000 0.005 0.004 AT1G07640/AT1G12130/AT1G12160/AT1G15640/AT1G16400/AT1G16410/AT1G16500/AT1G18140/AT1G18570/AT1G20490/AT1G22220/AT1G32640/AT1G34430/AT1G41830/AT1G45201/AT1G52000/AT1G52100/AT1G54000/AT1G55210/AT1G56250/... 106
BP GO:0009690 cytokinin metabolic process 21/6662 40/25557 0.525 2.014 3.811 0.000 0.007 0.006 AT1G25410/AT1G58460/AT1G62360/AT1G67110/AT1G68460/AT2G19500/AT2G35990/AT2G37210/AT3G19160/AT3G23630/AT3G30580/AT3G63110/AT4G29740/AT5G03270/AT5G06300/AT5G08150/AT5G19040/AT5G21482/AT5G26140/AT5G38450/... 21
BP GO:0098754 detoxification 62/6662 161/25557 0.385 1.477 3.607 0.000 0.007 0.006 AT1G02940/AT1G05250/AT1G10360/AT1G11670/AT1G17190/AT1G19550/AT1G22440/AT1G22990/AT1G23300/AT1G33080/AT1G33090/AT1G33100/AT1G47530/AT1G48130/AT1G49860/AT1G56160/AT1G56650/AT1G58340/AT1G59700/AT1G66760/... 62
BP GO:0010262 somatic embryogenesis 15/6662 25/25557 0.600 2.302 3.867 0.000 0.007 0.006 AT1G04180/AT1G21970/AT1G28300/AT1G68240/AT1G73590/AT3G26790/AT3G27785/AT3G62100/AT4G28720/AT5G01240/AT5G07260/AT5G13790/AT5G19890/AT5G25620/AT5G40360 15

表7.1 GO富集分析部分结果:
ONTOLOGY:GO方面,细胞成分,生物过程或分子功能之一;
ID:GO标识符,GO ID;
Description:GO术语的文字描述;
GeneRatio:该条目基因比例,分子是富集到这个GO条目上的基因的数目,分母是所有peak关联基因的数目;
BgRatio:背景比例,分母是物种全部有GO注释的基因的数目,分子是这些基因中注释到这个GO条目上面的基因的数目;
RichFactor​​:富集因子(Enrichment Factor)= GeneRatio / BgRatio;
​​FoldEnrichment​:富集倍数(Fold Enrichment)= (富集通路基因数 / 输入基因数) / (背景通路基因数 / 背景总基因数);
​​zScore​:标准化富集得分(基于超几何分布的 Z 值);
pvalue:富集的p值;
p.adjust:使用BH校正之后的p值;
qvalue:q值,使用FDR校正之后的p值,q-value相比于p-value更加严格,表示p-value产生假阳性的概率;
geneID:富集到这个GO条目上面的具体的基因ID;
Count:富集到这个GO条目上面的基因的数目。




图7.2 Peak关联基因GO气泡图。纵坐标是GO Term 名称,横坐标是对应GO Term 中检出的基因占背景基因的个数,颜色代表显著性,气泡大小代表该条目基因比例。



图7.3 Peak关联基因GO条状图。按照BP、MF、CC三个方面分别展示GO富集结果。纵坐标是GO Term 名称,横坐标值越大显著性越高,如果为0代表qvalue等于1。



7.2 KEGG富集分析

KEGG (Kyoto Encyclopedia of Genes and Genomes, http://www.genome.jp/kegg/) 是日本京都大学构建的基因组信息数据库,它将基因组序列信息与功能信息相结合,提供了一个全面的基因组功能信息资源。在PATHWAY数据库里,包括图解的细胞生化过程如代谢、膜转运、信号传递、细胞周期,还包括同系保守的子通路等信息。KEGG富集分析可以对peak关联基因进行KEGG通路富集分析。

下面展示peak关联的基因富集KEGG富集分析部分结果,完整结果请见/result/6.gokegg。KEGG富集分析完整结果请详见位于report/result/6.gokegg文件夹的*_KEGG_res.csv表格文件。

显示前100行 (共95行)
ID Description GeneRatio BgRatio RichFactor FoldEnrichment zScore pvalue p.adjust qvalue geneID Count
00940 Phenylpropanoid biosynthesis 72/650 109/3449 0.661 3.505 12.805 0.000 0.000 0.000 AT1G05260/AT1G14540/AT1G14550/AT1G24110/AT1G24735/AT1G30870/AT1G34510/AT1G48130/AT1G49570/AT1G51680/AT1G61820/AT1G62940/AT1G65060/AT1G68850/AT1G80820/AT2G18140/AT2G18150/AT2G18980/AT2G21730/AT2G21890/... 72
00360 Phenylalanine metabolism 58/650 92/3449 0.630 3.345 10.986 0.000 0.000 0.000 AT1G05260/AT1G14540/AT1G14550/AT1G24110/AT1G24735/AT1G30870/AT1G34510/AT1G48130/AT1G49570/AT1G51680/AT1G62940/AT1G65060/AT1G68850/AT2G18140/AT2G18150/AT2G18980/AT2G24800/AT2G37040/AT2G38380/AT2G38390/... 58
00945 Stilbenoid, diarylheptanoid and gingerol biosynthesis 35/650 67/3449 0.522 2.772 7.057 0.000 0.000 0.000 AT1G01190/AT1G11600/AT1G11610/AT1G13710/AT1G24735/AT2G30750/AT3G26170/AT3G26190/AT3G26200/AT3G26270/AT3G26290/AT3G26310/AT3G26320/AT3G26830/AT3G52970/AT3G53280/AT3G56630/AT4G13290/AT4G13310/AT4G15360/... 35
00903 Limonene degradation 34/650 70/3449 0.486 2.577 6.424 0.000 0.000 0.000 AT1G01190/AT1G11600/AT1G11610/AT1G13710/AT2G30750/AT3G26170/AT3G26190/AT3G26200/AT3G26270/AT3G26290/AT3G26310/AT3G26320/AT3G26830/AT3G52970/AT3G53280/AT3G56630/AT4G13290/AT4G13310/AT4G15360/AT4G31940/... 34
04075 Plant hormone signal transduction 75/650 232/3449 0.323 1.715 5.436 0.000 0.000 0.000 AT1G03445/AT1G04250/AT1G07430/AT1G08320/AT1G19830/AT1G19840/AT1G19850/AT1G22070/AT1G23160/AT1G28130/AT1G32640/AT1G48500/AT1G48670/AT1G52830/AT1G56150/AT1G59500/AT1G59750/AT1G67710/AT1G73000/AT1G75590/... 75
00040 Pentose and glucuronate interconversions 27/650 59/3449 0.458 2.428 5.332 0.000 0.000 0.000 AT1G02790/AT1G11590/AT1G11920/AT1G14420/AT1G26570/AT1G30350/AT1G53830/AT1G67750/AT2G45220/AT2G47030/AT3G03250/AT3G07850/AT3G24230/AT3G27400/AT3G43270/AT3G57510/AT3G59850/AT4G13210/AT4G22080/AT4G22090/... 27
00908 Zeatin biosynthesis 13/650 22/3449 0.591 3.135 4.842 0.000 0.000 0.000 AT1G22400/AT1G25410/AT1G67110/AT1G68460/AT2G19500/AT3G19160/AT3G23630/AT3G63110/AT4G29740/AT5G19040/AT5G21482/AT5G38450/AT5G56970 13
00500 Starch and sucrose metabolism 37/650 114/3449 0.325 1.722 3.778 0.000 0.003 0.003 AT1G02790/AT1G11590/AT1G16980/AT1G17000/AT1G18580/AT1G26570/AT1G47840/AT1G53830/AT1G61820/AT1G62660/AT1G78580/AT2G36190/AT2G44480/AT2G45220/AT2G47030/AT3G03250/AT3G07850/AT3G13790/AT3G43190/AT3G43270/... 37
00906 Carotenoid biosynthesis 11/650 24/3449 0.458 2.432 3.392 0.002 0.021 0.019 AT1G30100/AT1G78390/AT2G29090/AT3G14440/AT3G19270/AT3G24220/AT4G18350/AT4G19170/AT4G19230/AT5G45340/AT5G52570 11
04710 Circadian rhythm 11/650 24/3449 0.458 2.432 3.392 0.002 0.021 0.019 AT1G20140/AT2G03160/AT2G20160/AT2G25700/AT3G21850/AT3G21860/AT3G42830/AT3G60010/AT3G60020/AT4G34210/AT4G34470 11
00592 alpha-Linolenic acid metabolism 11/650 27/3449 0.407 2.162 2.920 0.007 0.054 0.047 AT1G17420/AT1G19640/AT1G55020/AT1G67560/AT2G44810/AT3G01420/AT3G22400/AT3G25770/AT3G25780/AT3G45140/AT4G15440 11
00380 Tryptophan metabolism 14/650 38/3449 0.368 1.955 2.852 0.007 0.054 0.047 AT1G04610/AT2G22330/AT2G30770/AT2G44460/AT2G44470/AT3G44300/AT3G44310/AT4G13260/AT4G32540/AT4G35090/AT4G39950/AT5G11320/AT5G25620/AT5G43890 14
00966 Glucosinolate biosynthesis 8/650 18/3449 0.444 2.358 2.784 0.011 0.082 0.072 AT1G16400/AT1G16410/AT2G22330/AT3G19710/AT4G13770/AT4G39950/AT5G05260/AT5G23020 8
00904 Diterpenoid biosynthesis 6/650 12/3449 0.500 2.653 2.764 0.014 0.098 0.086 AT1G15550/AT1G80340/AT4G02780/AT4G25420/AT5G07200/AT5G51810 6
00270 Cysteine and methionine metabolism 24/650 84/3449 0.286 1.516 2.307 0.019 0.118 0.103 AT1G01480/AT1G12010/AT1G62380/AT1G64660/AT1G69770/AT1G80740/AT2G22810/AT2G26400/AT3G22740/AT3G49700/AT3G61510/AT4G08040/AT4G08990/AT4G14140/AT4G26200/AT4G35640/AT4G37770/AT5G11520/AT5G15950/AT5G18930/... 24
00960 Tropane, piperidine and pyridine alkaloid biosynthesis 8/650 22/3449 0.364 1.930 2.107 0.041 0.241 0.211 AT2G29150/AT2G29320/AT2G29330/AT2G29340/AT2G29360/AT2G29370/AT2G30670/AT5G11520 8
00941 Flavonoid biosynthesis 7/650 19/3449 0.368 1.955 2.011 0.051 0.284 0.248 AT1G24735/AT3G51240/AT3G55120/AT5G07990/AT5G13930/AT5G42800/AT5G66220 7
00460 Cyanoamino acid metabolism 9/650 28/3449 0.321 1.706 1.806 0.065 0.344 0.301 AT1G61820/AT2G44480/AT3G44300/AT3G44310/AT3G47050/AT4G13890/AT4G39650/AT5G22300/AT5G42260 9
00100 Steroid biosynthesis 9/650 29/3449 0.310 1.647 1.685 0.079 0.397 0.348 AT1G20050/AT2G28850/AT2G28860/AT2G34490/AT3G45130/AT4G37760/AT5G24140/AT5G24150/AT5G24160 9
00910 Nitrogen metabolism 12/650 43/3449 0.279 1.481 1.529 0.095 0.452 0.396 AT1G08080/AT1G23730/AT1G37130/AT1G77760/AT2G28210/AT2G37040/AT3G10340/AT3G44300/AT3G44310/AT4G21000/AT5G04230/AT5G16570 12
00430 Taurine and hypotaurine metabolism 4/650 12/3449 0.333 1.769 1.285 0.175 0.793 0.694 AT2G02000/AT2G02010/AT4G39650/AT5G17330 4
00350 Tyrosine metabolism 7/650 26/3449 0.269 1.429 1.057 0.205 0.884 0.774 AT1G12050/AT1G22440/AT1G77120/AT4G22110/AT4G28680/AT5G11520/AT5G24760 7
01040 Biosynthesis of unsaturated fatty acids 8/650 32/3449 0.250 1.327 0.894 0.244 1.000 0.875 AT1G06080/AT1G43800/AT1G63380/AT2G29980/AT3G02610/AT3G02620/AT3G55360/AT4G30950 8
00130 Ubiquinone and other terpenoid-quinone biosynthesis 6/650 24/3449 0.250 1.327 0.773 0.291 1.000 0.875 AT1G23360/AT1G51680/AT1G62940/AT1G65060/AT3G21230/AT3G21240 6
00052 Galactose metabolism 9/650 39/3449 0.231 1.224 0.679 0.307 1.000 0.875 AT1G47840/AT1G62660/AT2G36190/AT3G03250/AT3G06580/AT3G13790/AT4G01970/AT4G29130/AT4G37840 9
00562 Inositol phosphate metabolism 12/650 54/3449 0.222 1.179 0.639 0.312 1.000 0.875 AT1G21980/AT1G60890/AT2G26420/AT2G26870/AT3G03530/AT3G03540/AT3G55940/AT4G01190/AT4G26260/AT4G38530/AT5G56640/AT5G58670 12
00564 Glycerophospholipid metabolism 12/650 54/3449 0.222 1.179 0.639 0.312 1.000 0.875 AT1G01610/AT1G02390/AT1G06520/AT2G26870/AT2G38110/AT2G44810/AT3G03530/AT3G03540/AT3G11430/AT4G01950/AT5G06090/AT5G25370 12
00410 beta-Alanine metabolism 7/650 30/3449 0.233 1.238 0.631 0.331 1.000 0.875 AT2G02000/AT2G02010/AT2G30650/AT2G30660/AT5G17330/AT5G19530/AT5G65940 7
00561 Glycerolipid metabolism 8/650 35/3449 0.229 1.213 0.610 0.334 1.000 0.875 AT1G01610/AT1G02390/AT1G06520/AT2G11810/AT2G38110/AT3G11430/AT4G01950/AT5G06090 8
00196 Photosynthesis - antenna proteins 5/650 22/3449 0.227 1.206 0.467 0.402 1.000 0.875 AT1G19150/AT2G05070/AT2G34430/AT2G40100/AT3G61470 5
00900 Terpenoid backbone biosynthesis 10/650 48/3449 0.208 1.105 0.354 0.419 1.000 0.875 AT2G18620/AT2G23410/AT2G23800/AT3G14510/AT3G20160/AT3G21500/AT3G29430/AT3G32040/AT5G60500/AT5G60510 10
04070 Phosphatidylinositol signaling system 10/650 48/3449 0.208 1.105 0.354 0.419 1.000 0.875 AT1G21980/AT1G60890/AT1G66410/AT2G26420/AT3G43810/AT3G55940/AT3G56800/AT4G01190/AT4G38530/AT5G58670 10
00010 Glycolysis / Gluconeogenesis 21/650 105/3449 0.200 1.061 0.307 0.419 1.000 0.875 AT1G22440/AT1G34430/AT1G47840/AT1G77120/AT2G21330/AT3G01260/AT3G25960/AT3G49160/AT3G55650/AT4G22110/AT4G26390/AT4G26530/AT4G29130/AT4G37840/AT5G01320/AT5G01330/AT5G04120/AT5G15140/AT5G24760/AT5G36880/... 21
04626 Plant-pathogen interaction 29/650 148/3449 0.196 1.040 0.238 0.440 1.000 0.875 AT1G19780/AT1G21550/AT1G24620/AT1G32640/AT1G48500/AT1G64060/AT1G66400/AT1G66410/AT1G73630/AT2G14610/AT2G36180/AT2G38910/AT2G41410/AT2G41860/AT3G03410/AT3G07490/AT3G10190/AT3G10300/AT3G22930/AT3G43810/... 29
00565 Ether lipid metabolism 4/650 18/3449 0.222 1.179 0.367 0.449 1.000 0.875 AT2G26870/AT3G03530/AT3G03540/AT5G25370 4
00195 Photosynthesis 9/650 46/3449 0.196 1.038 0.126 0.510 1.000 0.875 AT1G03600/AT1G20340/AT1G76100/AT4G02770/AT4G04640/AT4G05180/AT4G21280/AT5G10000/AT5G66570 9
00061 Fatty acid biosynthesis 5/650 27/3449 0.185 0.983 -0.044 0.595 1.000 0.875 AT1G43800/AT1G63380/AT2G30200/AT3G02610/AT3G02620 5
00520 Amino sugar and nucleotide sugar metabolism 18/650 99/3449 0.182 0.965 -0.171 0.610 1.000 0.875 AT1G18580/AT1G26570/AT1G31070/AT1G47840/AT1G63000/AT2G27860/AT3G03250/AT3G06580/AT3G12500/AT3G51160/AT3G59480/AT4G00110/AT4G10260/AT4G19810/AT4G29130/AT4G37840/AT5G24090/AT5G66280 18
00051 Fructose and mannose metabolism 9/650 50/3449 0.180 0.955 -0.154 0.619 1.000 0.875 AT1G47840/AT2G21330/AT3G51160/AT3G59480/AT4G10260/AT4G26530/AT4G29130/AT4G37840/AT5G66280 9
00630 Glyoxylate and dicarboxylate metabolism 6/650 34/3449 0.176 0.936 -0.180 0.640 1.000 0.875 AT3G21720/AT3G58740/AT3G60100/AT5G14780/AT5G38420/AT5G43330 6
00330 Arginine and proline metabolism 12/650 68/3449 0.176 0.936 -0.255 0.650 1.000 0.875 AT1G44180/AT2G39800/AT4G08870/AT4G35820/AT5G11520/AT5G14800/AT5G15950/AT5G16570/AT5G18900/AT5G18930/AT5G19530/AT5G38710 12
00620 Pyruvate metabolism 13/650 75/3449 0.173 0.920 -0.339 0.679 1.000 0.875 AT1G34430/AT1G67280/AT1G68750/AT3G25960/AT3G49160/AT3G55650/AT4G26390/AT5G23010/AT5G23020/AT5G25880/AT5G36880/AT5G43330/AT5G65690 13
00590 Arachidonic acid metabolism 2/650 12/3449 0.167 0.884 -0.193 0.691 1.000 0.875 AT1G63460/AT4G39650 2
04140 Autophagy - animal 2/650 12/3449 0.167 0.884 -0.193 0.691 1.000 0.875 AT1G54210/AT3G13970 2
00640 Propanoate metabolism 5/650 31/3449 0.161 0.856 -0.389 0.722 1.000 0.875 AT2G30650/AT2G30660/AT4G13660/AT5G36880/AT5G65940 5
04120 Ubiquitin mediated proteolysis 17/650 105/3449 0.162 0.859 -0.707 0.796 1.000 0.875 AT1G20140/AT2G03160/AT2G16740/AT2G18600/AT2G20160/AT2G25700/AT2G32790/AT3G17205/AT3G21850/AT3G21860/AT3G42830/AT3G60010/AT3G60020/AT4G34210/AT4G34470/AT5G26900/AT5G50870 17
00053 Ascorbate and aldarate metabolism 5/650 34/3449 0.147 0.780 -0.620 0.796 1.000 0.875 AT1G26570/AT1G55570/AT3G27820/AT4G26260/AT5G56640 5
00790 Folate biosynthesis 2/650 15/3449 0.133 0.707 -0.547 0.805 1.000 0.875 AT1G69190/AT3G11750 2
00250 Alanine, aspartate and glutamate metabolism 7/650 47/3449 0.149 0.790 -0.698 0.810 1.000 0.875 AT2G02000/AT2G02010/AT2G16570/AT3G08860/AT5G11520/AT5G16570/AT5G17330 7
00710 Carbon fixation by Calvin cycle 12/650 77/3449 0.156 0.827 -0.740 0.811 1.000 0.875 AT1G68750/AT2G21330/AT3G25960/AT3G49160/AT3G55650/AT4G26390/AT4G26530/AT5G11520/AT5G25880/AT5G38420/AT5G43330/AT5G65690 12
00650 Butanoate metabolism 3/650 22/3449 0.136 0.724 -0.627 0.813 1.000 0.875 AT2G02000/AT2G02010/AT5G17330 3
00290 Valine, leucine and isoleucine biosynthesis 5/650 36/3449 0.139 0.737 -0.764 0.836 1.000 0.875 AT1G50090/AT2G43100/AT3G19710/AT5G23010/AT5G23020 5
00480 Glutathione metabolism 9/650 62/3449 0.145 0.770 -0.880 0.853 1.000 0.875 AT1G02940/AT1G63460/AT1G65930/AT2G02930/AT2G29440/AT4G39650/AT5G19530/AT5G35790/AT5G37830 9
00450 Selenocompound metabolism 2/650 18/3449 0.111 0.590 -0.841 0.880 1.000 0.875 AT1G64660/AT5G49810 2
00280 Valine, leucine and isoleucine degradation 6/650 46/3449 0.130 0.692 -1.013 0.890 1.000 0.875 AT1G50090/AT2G30650/AT2G30660/AT3G19710/AT5G34780/AT5G65940 6
00030 Pentose phosphate pathway 7/650 53/3449 0.132 0.701 -1.058 0.896 1.000 0.875 AT2G21330/AT3G49360/AT4G26530/AT5G24400/AT5G24410/AT5G24420/AT5G35790 7
00760 Nicotinate and nicotinamide metabolism 1/650 11/3449 0.091 0.482 -0.829 0.900 1.000 0.875 AT2G23420 1
00260 Glycine, serine and threonine metabolism 6/650 47/3449 0.128 0.677 -1.073 0.902 1.000 0.875 AT1G17745/AT1G72810/AT3G08860/AT4G13890/AT4G33010/AT5G28237 6
00020 Citrate cycle (TCA cycle) 8/650 60/3449 0.133 0.707 -1.101 0.902 1.000 0.875 AT1G34430/AT1G65930/AT3G58740/AT3G60100/AT4G35650/AT5G43330/AT5G50950/AT5G65690 8
00920 Sulfur metabolism 4/650 34/3449 0.118 0.624 -1.061 0.907 1.000 0.875 AT4G35640/AT4G39940/AT5G28030/AT5G56760 4
04144 Endocytosis 9/650 67/3449 0.134 0.713 -1.144 0.908 1.000 0.875 AT1G21980/AT1G56410/AT1G60890/AT2G26420/AT3G45000/AT3G53710/AT4G01190/AT5G02490/AT5G25370 9
00071 Fatty acid degradation 5/650 41/3449 0.122 0.647 -1.095 0.909 1.000 0.875 AT1G22440/AT1G49430/AT1G77120/AT4G22110/AT5G24760 5
04130 SNARE interactions in vesicular transport 6/650 48/3449 0.125 0.663 -1.132 0.912 1.000 0.875 AT1G32270/AT2G36900/AT3G45280/AT3G52400/AT5G39630/AT5G58180 6
03022 Basal transcription factors 4/650 35/3449 0.114 0.606 -1.128 0.919 1.000 0.875 AT1G20000/AT4G17020/AT4G21010/AT4G24440 4
04141 Protein processing in endoplasmic reticulum 19/650 132/3449 0.144 0.764 -1.334 0.930 1.000 0.875 AT1G20140/AT1G53540/AT1G56410/AT1G78720/AT2G03160/AT2G16740/AT2G20160/AT2G25700/AT2G29500/AT2G32790/AT3G21850/AT3G21860/AT3G42830/AT3G60010/AT3G60020/AT4G32640/AT4G34210/AT4G34470/AT5G02490 19
00770 Pantothenate and CoA biosynthesis 2/650 22/3449 0.091 0.482 -1.174 0.939 1.000 0.875 AT1G50090/AT3G19710 2
00600 Sphingolipid metabolism 1/650 14/3449 0.071 0.379 -1.122 0.947 1.000 0.875 AT4G04930 1
04122 Sulfur relay system 1/650 14/3449 0.071 0.379 -1.122 0.947 1.000 0.875 AT1G01290 1
00860 Porphyrin metabolism 4/650 40/3449 0.100 0.531 -1.439 0.959 1.000 0.875 AT1G19670/AT1G44318/AT4G16690/AT4G25080 4
04145 Phagosome 7/650 62/3449 0.113 0.599 -1.535 0.963 1.000 0.875 AT1G19910/AT1G20010/AT1G32270/AT1G78720/AT4G23710/AT4G25950/AT4G32530 7
03430 Mismatch repair 3/650 33/3449 0.091 0.482 -1.440 0.963 1.000 0.875 AT1G03810/AT1G08130/AT4G25540 3
04650 Natural killer cell mediated cytotoxicity 1/650 17/3449 0.059 0.312 -1.370 0.972 1.000 0.875 AT3G59790 1
00400 Phenylalanine, tyrosine and tryptophan biosynthesis 4/650 43/3449 0.093 0.494 -1.610 0.974 1.000 0.875 AT3G55870/AT5G11520/AT5G17990/AT5G28237 4
00670 One carbon pool by folate 1/650 18/3449 0.056 0.295 -1.445 0.977 1.000 0.875 AT4G13890 1
03420 Nucleotide excision repair 6/650 59/3449 0.102 0.540 -1.719 0.978 1.000 0.875 AT1G03810/AT1G08130/AT1G18040/AT2G27120/AT3G42830/AT4G17020 6
03030 DNA replication 4/650 45/3449 0.089 0.472 -1.719 0.980 1.000 0.875 AT1G03810/AT1G08130/AT1G44900/AT2G27120 4
00300 Lysine biosynthesis 1/650 19/3449 0.053 0.279 -1.518 0.981 1.000 0.875 AT2G13810 1
04712 Circadian rhythm - plant 2/650 29/3449 0.069 0.366 -1.652 0.982 1.000 0.875 AT4G16250/AT5G13930 2
03020 RNA polymerase 2/650 30/3449 0.067 0.354 -1.713 0.985 1.000 0.875 AT1G61700/AT5G51940 2
03410 Base excision repair 3/650 39/3449 0.077 0.408 -1.791 0.986 1.000 0.875 AT1G08130/AT2G27120/AT3G12710 3
00510 N-Glycan biosynthesis 3/650 40/3449 0.075 0.398 -1.845 0.988 1.000 0.875 AT2G39630/AT5G07630/AT5G14950 3
04146 Peroxisome 5/650 61/3449 0.082 0.435 -2.146 0.994 1.000 0.875 AT1G49430/AT1G65930/AT3G56350/AT4G34900/AT4G35090 5
00190 Oxidative phosphorylation 13/650 127/3449 0.102 0.543 -2.528 0.998 1.000 0.875 AT1G19910/AT1G32710/AT1G80230/AT2G20800/AT2G46860/AT3G47950/AT4G04640/AT4G23710/AT4G25950/AT4G26210/AT4G32530/AT5G08670/AT5G40810 13
03008 Ribosome biogenesis in eukaryotes 6/650 76/3449 0.079 0.419 -2.468 0.998 1.000 0.875 AT1G10490/AT1G63810/AT2G24990/AT2G39820/AT4G25630/AT5G55080 6
03440 Homologous recombination 1/650 34/3449 0.029 0.156 -2.383 0.999 1.000 0.875 AT1G03810 1
03060 Protein export 2/650 46/3449 0.043 0.231 -2.531 0.999 1.000 0.875 AT1G78720/AT5G66970 2
00970 Aminoacyl-tRNA biosynthesis 2/650 48/3449 0.042 0.221 -2.618 0.999 1.000 0.875 AT1G17960/AT1G29870 2
00230 Purine metabolism 12/650 133/3449 0.090 0.479 -2.954 1.000 1.000 0.875 AT1G14250/AT1G61700/AT2G16570/AT2G27120/AT3G25960/AT3G49160/AT3G55650/AT4G12440/AT4G26390/AT4G34900/AT4G39940/AT5G51940 12
03040 Spliceosome 9/650 115/3449 0.078 0.415 -3.073 1.000 1.000 0.875 AT1G20960/AT1G56410/AT2G41500/AT3G50670/AT3G55200/AT4G03430/AT5G02490/AT5G25230/AT5G64200 9
03015 mRNA surveillance pathway 4/650 82/3449 0.049 0.259 -3.273 1.000 1.000 0.875 AT1G17760/AT1G54450/AT3G58390/AT5G28210 4
03010 Ribosome 21/650 220/3449 0.095 0.506 -3.645 1.000 1.000 0.875 AT1G12960/AT1G23410/AT1G35680/AT1G57660/AT1G58380/AT2G16360/AT2G37600/AT2G39590/AT2G47570/AT3G09680/AT3G11250/AT3G22230/AT3G27840/AT3G28500/AT3G48960/AT3G51190/AT5G02870/AT5G27850/AT5G39850/AT5G40040/... 21
00240 Pyrimidine metabolism 5/650 97/3449 0.052 0.274 -3.497 1.000 1.000 0.875 AT1G14250/AT1G61700/AT2G27120/AT4G20320/AT5G51940 5
03018 RNA degradation 1/650 56/3449 0.018 0.095 -3.291 1.000 1.000 0.875 AT5G18230 1
03050 Proteasome 1/650 58/3449 0.017 0.091 -3.362 1.000 1.000 0.875 AT5G35590 1
03013 Nucleocytoplasmic transport 5/650 121/3449 0.041 0.219 -4.213 1.000 1.000 0.875 AT1G07920/AT2G18720/AT2G44070/AT5G55080/AT5G55856 5

表7.2 KEGG富集分析部分结果:
ID:KEGG通路标识符,前面省略"map",比如“04120”代表“map04120”;
Description:KEGG通路的文字描述;
GeneRatio:该条目基因比例,分子是富集到这个KEGG通路上的基因的数目,分母是所有peak关联基因的数目;
BgRatio:背景比例,分母是物种全部有KEGG注释的基因的数目,分子是这些基因中注释到这个KEGG通路上面的基因的数目;
RichFactor​​:富集因子(Enrichment Factor)= GeneRatio / BgRatio;
​​FoldEnrichment​:富集倍数(Fold Enrichment)= (富集通路基因数 / 输入基因数) / (背景通路基因数 / 背景总基因数);
​​zScore​:标准化富集得分(基于超几何分布的 Z 值);
pvalue:富集的p值;
p.adjust:使用BH校正之后的p值;
qvalue:q值,使用FDR校正之后的p值,q-value相比于p-value更加严格,表示p-value产生假阳性的概率;
geneID:富集到这个KEGG通路上面的具体的基因ID;
Count:富集到这个KEGG通路上面的基因的数目。



图7.4 Peak关联基因KEGG气泡图。纵坐标是KEGG通路名称,横坐标是对应KEGG通路中检出的基因占背景基因的个数,颜色代表显著性,气泡大小代表该通路基因比例。



图7.5 Peak关联基因KEGG条状图。纵坐标是KEGG通路名称,横坐标是出现在该通路的基因数,颜色代表显著性。






8. Motif分析

对于一些基因元件或peak区域,分析这些区域的序列中是否有频繁出现的一些基序(motif),从而可以进一步分析这些基序相关的转录因子或结合蛋白。各种蛋白通过不同的motif识别蛋白-DNA结合位点,因此我们通过Homer(version 4.11.1)(Heinz S et al., 2010)来提取peak所在区间的序列对peak之间共有的motif进行扫描,查找其共有的motif区域,基于富集分析预测可能与peaks结合的蛋白。对于有组内生物学重复的样本,我们取其交集({组名}_consensus)进行motif分析。各样本分析结果位于report/result/7.motif文件夹中:
homerMotifs.motifs8/10/12:这些是de novo(从头预测)查找motif的输出文件,由motif长度分隔。
homerMotifs.all.motifs:由所有homerMotifs.motifs组成的连接文件。
motifFindingParameters.txt:用于执行findMotifsGenome.pl的命令,包含使用的参数
knownResults.txt:基于已知motifs富集的统计信息的文本文件(在EXCEL/WPS中打开)。
seq.autonorm.tsv:用于lower-order oligo标准化的autonormalization统计。
knownResults.html:基于已知motifs富集的格式化输出。
homerResults.html:de novo预测motif的格式化输出。



8.1 已知Motif分析

基于已知motifs富集的分析结果,请打开下方链接查看,其文件对应在各个文件夹下的“knownResults.html”

结果说明:
Rank(序号):根据显著性q-value排序;
Motif:展示motif的序列特征的logo图,可直观了解motif中各碱基的分布和保守性;
Name(Motif名称):HOMER数据库中motif的名称;
P-value(P值):未校正的显著性(基于超几何分布或泊松分布);
Log P-value(对数P值):P值的对数值,绝对值越大表示显著性越高;
q-value (Benjamini)(q值,Benjamini校正值):通过Benjamini-Hochberg方法进行的多重假设检验校正后的P值;
# Target Sequences with Motif(含有该motif的目标序列数量):包含该motif的基因组序列数量;
% of Targets Sequences with Motif(目标序列中含有该motif的比例):包含该motif的基因组序列占输入序列的百分比;
of Background Sequences with Motif (背景序列中含有该motif序列数量):背景序列(通常是全基因组序列)中包含该motif的序列数量;
% of Background Sequences with Motif(背景序列中含有该motif的比例):背景序列中含有该motif的序列所占的百分比。
Motif File:motif碱基分步矩阵结果;
SVG:motif的svg可视化文件;



8.2 从头预测Motif分析

基于de novo 从头预测的motifs富集的分析结果,请打开下方链接查看,其文件对应在各个文件夹下的“homerResults.html”。

结果说明:
Rank(序号):根据显著性q-value排序;
Motif:展示motif的序列特征的logo图,可直观了解motif中各碱基的分布和保守性;
P-value(P值):未校正的显著性(基于超几何分布或泊松分布);
Log P-value(对数P值):P值的对数值,绝对值越大表示显著性越高;
% of Targets(目标序列中含有该motif的比例):靶标序列占总序列百分比;
% of Background(背景序列中含有该motif的比例):背景序列占总序列百分比;
STD(Bg STD):靶标和背景的序列集出现偏离序列中心200bp的标准偏差;
Best Match/Details:最佳匹配的结果,点击 More information 后会出现更多信息——该motif的一些基本信息,如链接到motfi文件的超链接,下方match查看denovo motif和已知的motif的相似性比对结果打分, score越高代表越相似。;
Motif File:motif碱基分步矩阵结果。





参考文献


Andrews S. FastQC: a quality control tool for high throughput sequence data.https://www.bioinformatics.babraham.ac.uk/projects/fastqc/, 2010.
Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018 Sep 1;34(17):i884-i890.
Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, Cheng JX, Murre C, Singh H, Glass CK. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol Cell. 2010 May 28;38(4):576-89.
Langmead, B., Salzberg, S. Fast gapped-read alignment with Bowtie 2. Nat Methods 9, 357–359 (2012).
Landt SG, Marinov GK, Kundaje A, Kheradpour P, Pauli F, Batzoglou S, Bernstein BE, Bickel P, Brown JB, Cayting P, Chen Y, DeSalvo G, Epstein C, Fisher-Aylor KI, Euskirchen G, Gerstein M, Gertz J, Hartemink AJ, Hoffman MM, Iyer VR, Jung YL, Karmakar S, Kellis M, Kharchenko PV, Li Q, Liu T, Liu XS, Ma L, Milosavljevic A, Myers RM, Park PJ, Pazin MJ, Perry MD, Raha D, Reddy TE, Rozowsky J, Shoresh N, Sidow A, Slattery M, Stamatoyannopoulos JA, Tolstorukov MY, White KP, Xi S, Farnham PJ, Lieb JD, Wold BJ, Snyder M. ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia. Genome Res. 2012 Sep;22(9):1813-31.
Ramírez F, Ryan DP, Grüning B, Bhardwaj V, Kilpert F, Richter AS, Heyne S, Dündar F, Manke T. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 2016 Jul 8;44(W1):W160-5.
Stark, Rory and Gord Brown. “DiffBind: Differential binding analysis of CUT&TAG peak data.” (2012).
Tarasov A, Vilella AJ, Cuppen E, Nijman IJ, Prins P. Sambamba: fast processing of NGS alignment formats. Bioinformatics. 2015 Jun 15;31(12):2032-4.
Wang, Q., Li, M., Wu, T., Zhan, L., Li, L., Chen, M., Xie, W., Xie, Z., Hu, E., Xu, S., & Yu, G. (2022). Exploring epigenomic datasets by ChIPseeker. Current Protocols, 2, e585.
Wu T, Hu E, Xu S, Chen M, Guo P, Dai Z, Feng T, Zhou L, Tang W, Zhan L, Fu X, Liu S, Bo X, Yu G. clusterProfiler 4.0: A universal enrichment tool for interpreting omics data. Innovation (Camb). 2021 Jul 1;2(3):100141.
Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nusbaum C, Myers RM, Brown M, Li W, Liu XS. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 2008;9(9):R137.

联系我们

官网:武汉鸿源韬生物科技有限公司 咨询热线:18086690478

邮箱:sales@hyt-bio.com 地址:湖北省武汉市江夏区郑店街光谷南(郑店)大健康产业园东湖高新国际健康城B地块1号楼7层02号房

报告文件结构

报告文件结构