鸿源韬生物

CUT&Tag生信分析报告

1.项目简介

1.1 样本信息

合同编号 RS202410002
实验技术 Cut&Tag
物种名称 拟南芥
拉丁名 Arabidopsis thaliana
参考基因组 tair10
报告生成日期 2025年07月29日


1.2 实验原理及流程

CUT&Tag (Cleavage Under Targets and Tagmentation) 是一种用于研究蛋白质-DNA相互作用的高通量测序技术。它利用抗体特异性识别目标蛋白,并通过Tn5转座酶在目标蛋白结合的DNA附近进行切割和标签化,从而实现目标蛋白结合位点的高效富集和测序(Kaya-Okur et al., 2019)。 CUT&Tag技术的核心原理:
抗体介导的靶向富集: 特异性一抗与目标表位结合,二抗(抗IgG)携带Protein A/G标签,招募后续的Protein A-Tn5融合蛋白。
Tn5转座酶的靶向切割和标签化: Protein A-Tn5融合蛋白携带测序接头(adapters),在抗体引导下定位到目标染色质区域。激活Tn5酶活性后仅在目标位点附近切割DNA并插入接头。
PCR扩增和测序: 对标签化的DNA片段进行PCR扩增,构建测序文库,并进行高通量测序。
CUT&Tag 技术相较于传统的CUT&Tag技术具有以下优势:
所需细胞量少: CUT&Tag 仅需少量细胞即可获得高质量的测序数据,适用于稀有细胞类型或临床样本的研究;
信噪比高: CUT&Tag 的背景噪音低,能够更准确地识别目标蛋白的结合位点;
实验周期短: CUT&Tag 的实验流程简化,实验周期较CUT&Tag更短。

CUT&Tag实验原理



1.3 实验流程

CUT&Tag 的实验流程主要包括以下步骤:
1.细胞收集和固定: 收集目标细胞并进行交联固定,以保持蛋白-DNA相互作用的稳定性。
2.细胞透化: 使用透化剂处理细胞,使抗体能够进入细胞核与目标蛋白结合。
3.抗体孵育: 加入特异性抗体,与目标蛋白结合。
4.Tn5转座酶孵育: 加入携带测序接头的Tn5转座酶,在目标蛋白结合的DNA附近进行切割和标签化。
5.DNA纯化: 纯化标签化的DNA片段。
6.PCR扩增: 对标签化的DNA片段进行PCR扩增,构建测序文库。
7.高通量测序: 对测序文库进行高通量测序,获得目标蛋白结合位点的信息。

1.4 分析流程

获得测序原始数据(raw data)后,首先对原始数据进行过滤,获得高质量的测序数据(clean data),将测序数据(clean data)比对到项目物种的参考基因组上,对比对结果进行鉴定峰位点(peak calling),对peak关联基因进行注释以及富集分析, 有生物学重复时进行差异Peak、共识Peak分析以及motif分析。


CUT&Tag生物信息学分析流程



2. 数据质控

我们交付的原始数据为fastq(简称fq)格式文件的压缩包,文件名后缀通常为 “.fq.gz”。交付数据前我们会计算每个压缩文件的md5值。在您拿到数据之后,请您先校>验每个压缩文件的md5值,Linux下可以在数据目录使用“md5sum -c <*md5.txt>”命令进行校验,Windows下可使用hashmyfiles等校验工具,如发现压缩文件md5值与附在数据文件目录下的md5文档中的不一致则说明文件可能在传输的过程中被损坏。数据文件大小为文件占用磁盘空间的大小,文件的大小通常与磁盘格式、压缩比例等因素有关,与测序数据量(碱基数)的多少无对应关系,因此对应PE测序的 read1和read2两个文件大小也可能不相同。

将高通量测序得到的原始图像数据经过Base Calling 转化为序列数据,即FASTQ格式,得到最原始的测序数据文件。FASTQ 格式文件可记录所测读段(read)的碱基及其质量分数。FASTQ 格式以测序读段为单位进行存储,每条读段占 4 行,第一行是序列标识(read ID)以及相关的描述信息,以“@” 开头;第二行即为碱基序列,长度由测序策略决定;第三行以“+”开头,后面是序列标示符、描述信息,或者什么也不加; 第四行是测序质量值(phred),与第二行一一对应,phred值以ASCII码标记,对应的 ASCII 值减去33,即为第二行对应碱基的测序质量值,示例如下:

@HWI-ST1276:71:C1162ACXX:1:1101:1208:2458 1:N:0:CGATGT
NAAGAACACGTTCGGTCACCTCAGCACACTTGTGAATGTCATGGGATCCAT
+
#55???BBBBB?BA@DEEFFCFFHHFFCFFHHHHHHHFAE0ECFFD/AEHH

测序错误率用e表示, 平台测得数据的碱基质量值用Qphred表示,则有:Qphred=-10log10(e)。软件中碱基识别正确率与Phred分值之间的简明对应关系见下表:

Phred分值不正确的碱基识别碱基正确识别率Q-score
101/1090%Q10
201/10099%Q20
301/100099.9%Q30

测序Reads的错误率往往会随着测序接近尾声而升高,这是由测序过程中化学试剂的消耗造成共有的特征。



2.1 原始数据质控

CUT&Tag实验基于第二代测序(NGS)平台完成,采用双端测序文库构建策略(插入片段~300 bp)。我们需要对原始测序数据进行质量评估与过滤,以确保后续分析的可靠性。首先,我们使用FastQC(version 0.12.1)(Andrews, 2010)对原始测序数据(raw data)进行全局质量分析,包括碱基质量分布(Phred score)、碱基组成平衡性(base content uniformity)、重复序列比例(duplication level)及GC含量偏差等指标,以全面评估测序质量。
我们使用Fastp(version 0.24.0)(Chen et al., 2018)对原始测序数据进行以下过滤操作。
接头序列去除:识别并切除双端reads中的接头序列;
低复杂度序列过滤:剔除含模糊碱基(N碱基占比≥10%)的reads;
动态质量修剪:通过滑动窗口法(5 bp窗口步长)评估局部序列质量,当窗口平均Phred score小于20时,执行3'端截断;
长度筛选:保留长度≥25 bp的paired-end reads,长度不足的reads及其匹配reads(R1/R2)均被排除。
原始和过滤后质控结果请详见result/1.qc文件夹,raw为原始数据质控结果,clean为过滤后质控结果。



图2.1 各个样本平均测序碱基质量分数,横坐标代表150 bp长度序列中各个位置,纵坐标为该位置平均的碱基质量值Q;盒形图中间的红线表示中位数(median value);黄色部分代表四分位距(25-75%);上下分割线代表 90%和 10%的上下临界值;蓝色的线代表碱基质量的平均值。


图2.2 各个样本碱基平衡性,图中四条线代表A T C G在每个位置平均含量。理论上,A和T应该相等,G和C应该相等,且4种碱基平行且接近分布。正常情况下四种碱基的出现频率应该是接近的,而且没有位置差异。因此好的样本中四条线应该平行且接近。当部分位置碱基的比例出现 bias 时,即四条线波动较大时可能存在测序数据或者文库污染。如果所有位置的碱基比例一致的表现出bias 时,即四条线平行但分开,往往代表文库有 bias (建库过程或本身特点),或者是测序中的系统误差。测序刚开始由于测序仪状态不稳定,在15bp之前很可能出现波动。


图2.3 各个样本每条序列GC含量,各个样本每条序列GC含量,横轴为序列GC含量百分比; 纵轴是每条序列GC含量对应的数量,蓝色的线是程序根据经验分布给出的理论值,红色是真实值,理想情况下两者接近,当红色的线出现双峰时,可能存在其他物种DNA污染。


图2.4 各个样本重复序列水平,测序深度越高,越容易产生一定程度的重复(duplication),这属于正常的现象。但如果duplication 的程度很高,就提示我们可能有 bias 的存在(如建库过程中由于 PCR 扩增引起的duplication)。横坐标为 reads 重复的次数,纵坐标为重复次数对应的 reads 占 unique reads 的比例,以unique reads 的总数作为 100%。这里,我们仅对文件前 2000000 个reads 进行统计:对长度小于75bp 的reads 将其截短为 50bp,用于统计重复。


2.2 过滤后数据质控

这里展示Fastp过滤后的数据质控结果,图片内容与上面raw data类似。

图2.5 各个样本平均测序碱基质量分数,横坐标代表150 bp长度序列中各个位置,纵坐标为该位置平均的碱基质量值Q;盒形图中间的红线表示中位数(median value);黄色部分代表四分位距(25-75%);上下分割线代表 90%和 10%的上下临界值;蓝色的线代表碱基质量的平均值。

图2.6 各个样本碱基平衡性,图中四条线代表A T C G在每个位置平均含量。理论上,A和T应该相等,G和C应该相等,且4种碱基平行且接近分布。正常情况下四种碱基的出现频率应该是接近的,而且没有位置差异。因此好的样本中四条线应该平行且接近。当部分位置碱基的比例出现 bias 时,即四条线在某些位置波动较大时,可能测序数据或者文库存在污染。当所有位置的碱基比例一致的表现出bias 时,即四条线平行但分开,往往代表文库有 bias (建库过程或本身特点),或者是测序中的系统误差。一般测序的时候,刚开始测序仪状态不稳定,在15bp之前很可能出现波动。

图2.7 各个样本每条序列GC含量,横轴为序列GC含量百分比; 纵轴是每条序列GC含量对应的数量,蓝色的线是程序根据经验分布给出的理论值,红色是真实值,理想情况下两者接近,当红色的线出现双峰时,可能存在其他物种DNA污染。

图2.8 各个样本重复序列水平,测序深度越高,越容易产生一定程度的重复(duplication),这属于正常的现象。但如果duplication 的程度很高,就提示我们可能有 bias 的存在(如建库过程中由于 PCR 扩增引起的duplication)。横坐标为 reads 重复的次数,纵坐标为重复次数对应的 reads 占 unique reads 的比例,以unique reads 的总数作为 100%。这里,我们仅对文件前 2000000 个reads 进行统计:对长度小于75bp 的reads 将其截短为 50bp,用于统计重复。



2.3 数据过滤结果统计

我们对数据过滤结果进行统计,如下表所示:

Sample Raw_Total_Reads Raw_Total_Bases Raw_Q20_Rate Raw_Q30_Rate Raw_GC_Content Clean_Total_Reads Clean_Total_Bases Clean_Q20_Rate Clean_Q30_Rate Clean_GC_Content
H3K4me3_case_rep1_IP 14.61M 2191.11M 0.946 0.863 0.408 14.45M 2096.09M 0.950 0.868 0.403
H3K4me3_case_rep2_IP 15.25M 2286.87M 0.952 0.878 0.408 15.01M 2176.97M 0.957 0.885 0.403
H3K4me3_case_rep3_IP 13.23M 1983.86M 0.949 0.868 0.409 13.08M 1888.64M 0.953 0.874 0.403
H3K4me3_control_rep1_IP 16.32M 2448.36M 0.948 0.872 0.412 16.03M 2301.62M 0.955 0.881 0.405
H3K4me3_control_rep2_IP 14.56M 2183.40M 0.951 0.870 0.411 14.46M 2092.24M 0.954 0.875 0.406
H3K4me3_control_rep3_IP 18.82M 2822.82M 0.947 0.867 0.411 18.69M 2632.14M 0.954 0.875 0.401

表 2.1数据过滤结果统计:
Sample:样品名称;
Raw_Total_Reads/Clean_Total_Reads:过滤前后样本总reads数量,单位为百万;
Raw_Total_Bases/Clean_Total_Bases:过滤前后样本总碱基数量,单位为百万;
Raw_Q20_Rate/Clean_Q20_Rate:过滤前后样本Q20碱基比例;
Raw_Q30_Rate/Clean_Q30_Rate:过滤前后样本Q30碱基比例;
Raw_GC_Content/Clean_GC_Content:过滤前后样本GC含量。



2.4 插入片段长度分布

CUT&Tag 插入片段长度分布与抗体类型有关,对于组蛋白修饰类(如 H3K27me3)宽峰信号,片段分布广(200–1000bp),但大部分集中在约300–600bp;对于组蛋白活性标记(如 H3K4me3)更靠近 TSS,片段较短(~100–400bp);对于TF类多数片段小于120bp,通常为窄峰。

图2.7 各样本插入片段长度分布,插入片段即文库构建时DNA序列。






3. 比对参考基因组

我们将各样品过滤后的clean data的reads与参考基因组进行比对,获取Reads在参考基因组上的定位信息,这里使用的软件是Bowtie2(version 2.4.5)(Langmead B. et al., 2018)。来自一个DNA片段的多个拷贝,可能会锚定在多个read上,经过测序得到的这些reads就是PCR重复。PCR本身就是为了产生重复序列的。理论上来讲,不同的序列在进行PCR扩增时,扩增的倍数应该是相同的。但是由于聚合酶的偏好性,PCR扩增次数过多的情况下,会导致一些序列持续扩增,而另一些序列扩增到一定程度后便不再进行,也就是我们常说的PCR偏好性。因此,比对完成后我们使用软件Sambamba(version 1.0.1)(Tarasov A. et al., 2015)去除PCR重复,获取unique reads。

3.1 比对参考基因组情况

sample clean_reads PCR_dup PCR_dup % prop_map_reads prop_map % MAPQ30
H3K4me3_control_rep1_IP 16,028,898 5,720,621 35.690 11,211,468 69.950 15,014,730
H3K4me3_case_rep2_IP 15,014,300 6,004,460 39.990 10,143,498 67.560 14,076,103
H3K4me3_control_rep2_IP 14,464,472 5,123,116 35.420 10,539,264 72.860 13,607,479
H3K4me3_case_rep1_IP 14,454,782 5,605,582 38.780 10,005,760 69.220 13,520,774
H3K4me3_control_rep3_IP 18,687,456 6,078,369 32.530 13,939,596 74.590 17,503,610
H3K4me3_case_rep3_IP 13,079,880 4,382,059 33.500 8,982,592 68.670 12,292,485

表 3.1比对结果统计:
Sample:样品名称;
clean_reads:clean后reads总数;
PCR_dup:鉴定为PCR重复的reads数;
PCR_dup %:PCR重复reads百分比;
prop_map_reads:完美比对的reads总数,PE两端reads比对到同一条序列,且根据比对结果推断的插入片段大小符合设置的阈值;
prop_map %:完美比对reads百分比;
MAPQ30:MAPQ值大于30的reads数。




3.2 Reads富集情况

我们使用Deeptools(version 3.5.4)(Ramírez F. et al., 2016)软件对reads富集情况进行可视化,绘制信号在基因不同区域(Transcription Start Site,转录起始位点,TSS; Transcription End Site,转录终止位点,TES)的分布。对于可能富集在基因区域或者启动子区域的靶蛋白,其IP信号会富集在基因的TSS上游到TES区域,且显著高于INPUT信号。


图3.1 各样本reads富集情况。横坐标为相对基因位置,纵坐标为按照基因组大小RPGC标准化后reads富集分数。



图3.2 各样本reads富集热图。下方热图代表基因上下游Reads富集情况,每一行代表一个基因上下游区域reads富集程度。




3.3 比对可视化

软件比对所得结果为bam格式文件(位于report/result/2.map文件夹中),bam文件是压缩的⼆进制⽂件,无法直接作为文本打开查看。由于bam文件数据较大,我们将其转为较小的bw格式文件。客户可以结合物种参考基因组和注释文件使用IGV (Integrative Genomics Viewer) 浏览器对bam、bw、bed等文件进行可视化浏览。IGV浏览器使用方法可参考我们提供的使用说明文档IGV快速上手







4. 峰鉴定

我们使用MACS3(version 3.0.0)(Zhang Y. et al. 2008)进行peak鉴定,即找到靶蛋白富集的基因组区域。MACS(Model-based Analysis of ChIP-Seq)是一种基于统计学模型的算法,专用于从免疫沉淀测序数据中精准识别转录因子结合位点或组蛋白修饰富集区域(Peaks)。其核心原理是通过构建动态背景噪声模型以区分特异性结合与非随机分布的背景事件。算法首先通过双滑动窗口扫描基因组,利用泊松分布或负二项分布评估局部富集信号的显著性(P-value),并结合片段长度推断(Fragment Size)优化覆盖深度分析;随后通过两步法(粗筛峰与精细调整)定位峰边界,并确定信号峰值中心(Summit)(如果是broad模式则无summit)。使用Phantompeakqualtools(version 1.2.2)(Landt SG. et al. 2012)进行基于phantom peak的数据质量评估,计算NSC和RSC。本节结果请详见位于report/result/3.peak文件夹中

4.1 Peak信息统计

sample Peak num FRIP Peak reads Total reads PBC1 PBC2 NRF
H3K4me3_case_rep1 12,684 0.781 6,752,173 8,649,374 0.775 5.847 0.680
H3K4me3_case_rep2 12,681 0.806 7,090,064 8,796,848 0.773 5.803 0.677
H3K4me3_case_rep3 12,277 0.717 6,111,759 8,522,990 0.773 5.817 0.677
H3K4me3_control_rep1 13,228 0.824 8,310,433 10,080,674 0.754 5.506 0.639
H3K4me3_control_rep2 13,157 0.841 7,706,789 9,159,918 0.757 5.542 0.647
H3K4me3_control_rep3 13,523 0.746 9,270,375 12,419,371 0.752 5.426 0.637

表 4.1 Peak信息统计:
sample:样品名称;
Peak num:Peak数量;
FRiP(Fraction of Reads in Peaks)值表示映射到峰区的 reads 占总 reads 的比例,反映了ATAC-Seq实验的Tn5酶富集效果。较高的 FRiP 值表明实验成功地富集了目标区域的 DNA 片段,而较低的 FRiP 值可能表明富集效果差或背景噪声较高。
Peak reads:映射到峰区域的reads数;Total reads:样本总reads数。
NSC(Normalized strand cross-correlation coefficent)衡量富集区域的标准化交叉相关性得分。NSC值越大表明富集效果越好,NSC值低于1.1表明较弱的富集,小于1表示无富集。NSC值稍微低于1.05,有较低的信噪比或很少的峰,这可能是生物学真实现象,比如有的因子在特定组织类型中只有很少的结合位点;也可能确实是数据质量差。
RSC(Relative strand cross-correlation coefficient)评估ChIP-Seq数据中的伪峰情况,RSC是片段长度相关值减去背景相关值除以phantom-peak相关值减去背景相关值。RSC的最小值可能是0,表示无信号;富集好的实验RSC值大于1;低于1表示质量低。
Qv:MACS3 callpeak使用的qvalue阈值。




4.2 Call Peak结果

各个样本peak信息结果表部分内容如下,完整信息请查看report/result/3.peak/{样本名称}_peaks.tsv表格。“.broadPeak”或“.narrowPeak”文件为MACS3的输出文件,用于描述峰区域信息,可在IGV浏览器中打开。

显示前100行 (共12707行)
chr start end length abs_summit pileup -log10(pvalue) fold_enrichment -log10(qvalue) name
1 4259 4684 426 4443 28 5.386 2.432 4.154 H3K4me3_case_rep1_peak_1
1 8413 8847 435 8607 54 20.645 4.613 18.876 H3K4me3_case_rep1_peak_2
1 22878 24084 1207 23848 68 31.258 5.787 29.221 H3K4me3_case_rep1_peak_3
1 32783 33747 965 33363 101 41.787 5.383 39.519 H3K4me3_case_rep1_peak_4
1 37550 38470 921 37738 71 23.059 4.094 21.226 H3K4me3_case_rep1_peak_5
1 46122 46742 621 46582 93 27.169 3.828 25.231 H3K4me3_case_rep1_peak_6
1 48407 49257 851 48810 144 59.773 5.556 57.144 H3K4me3_case_rep1_peak_7
1 50559 51138 580 50917 77 18.057 3.170 16.363 H3K4me3_case_rep1_peak_8
1 57290 58047 758 57500 41 12.155 3.523 10.646 H3K4me3_case_rep1_peak_9
1 58170 58742 573 58248 35 8.784 3.019 7.400 H3K4me3_case_rep1_peak_10
1 63429 63928 500 63784 34 8.261 2.936 6.898 H3K4me3_case_rep1_peak_11
1 66913 67502 590 67114 35 7.834 2.786 6.490 H3K4me3_case_rep1_peak_12
1 72188 72884 697 72556 85 25.871 3.935 23.964 H3K4me3_case_rep1_peak_13
1 75501 76558 1058 75741 104 41.479 5.190 39.217 H3K4me3_case_rep1_peak_14
1 86815 87141 327 86909 94 24.727 3.516 22.851 H3K4me3_case_rep1_peak_15
1 88963 89224 262 89095 78 12.534 2.485 11.015 H3K4me3_case_rep1_peak_16
1 91341 92154 814 91754 170 68.446 5.429 65.663 H3K4me3_case_rep1_peak_17
1 96808 97083 276 96844 56 6.268 2.005 4.999 H3K4me3_case_rep1_peak_18
1 97224 98114 891 97689 116 43.072 4.864 40.777 H3K4me3_case_rep1_peak_19
1 99910 100196 287 100096 50 7.484 2.290 6.157 H3K4me3_case_rep1_peak_20
1 109120 110104 985 109565 88 39.816 5.865 37.590 H3K4me3_case_rep1_peak_21
1 114788 115291 504 115041 57 9.561 2.476 8.149 H3K4me3_case_rep1_peak_22
1 117274 118505 1232 117682 104 34.009 4.267 31.911 H3K4me3_case_rep1_peak_23
1 120499 120760 262 120692 50 8.690 2.492 7.314 H3K4me3_case_rep1_peak_24
1 127873 128450 578 128193 118 75.043 9.440 72.151 H3K4me3_case_rep1_peak_25
1 135857 136978 1122 136301 70 22.308 4.020 20.494 H3K4me3_case_rep1_peak_26
1 141796 142898 1103 142319 126 55.269 5.853 52.725 H3K4me3_case_rep1_peak_27
1 143255 143699 445 143445 48 7.373 2.311 6.050 H3K4me3_case_rep1_peak_28
1 148946 150321 1376 149543 73 30.184 5.200 28.173 H3K4me3_case_rep1_peak_29
1 154886 156218 1333 155335 120 42.349 4.636 40.069 H3K4me3_case_rep1_peak_30
1 157097 157454 358 157236 50 4.741 1.843 3.548 H3K4me3_case_rep1_peak_31
1 157816 158190 375 157937 80 16.048 2.854 14.415 H3K4me3_case_rep1_peak_32
1 161732 161993 262 161849 53 9.090 2.489 7.697 H3K4me3_case_rep1_peak_33
1 163238 163989 752 163595 67 21.637 4.048 19.842 H3K4me3_case_rep1_peak_34
1 172259 173596 1338 173268 151 100.005 10.302 96.740 H3K4me3_case_rep1_peak_35
1 185190 185959 770 185487 212 74.546 4.771 71.663 H3K4me3_case_rep1_peak_36
1 187064 188013 950 187408 331 177.030 7.919 172.965 H3K4me3_case_rep1_peak_37
1 195559 196662 1104 196176 153 75.821 6.873 72.916 H3K4me3_case_rep1_peak_38
1 220345 220872 528 220600 55 21.357 4.697 19.568 H3K4me3_case_rep1_peak_39
1 226018 227210 1193 226619 75 28.713 4.817 26.737 H3K4me3_case_rep1_peak_40
1 228647 229716 1070 228832 51 12.527 3.140 11.009 H3K4me3_case_rep1_peak_41
1 236724 238151 1428 237141 107 50.416 6.273 47.968 H3K4me3_case_rep1_peak_42
1 248727 250097 1371 249836 44 13.315 3.622 11.771 H3K4me3_case_rep1_peak_43
1 258841 259215 375 258980 38 9.464 3.043 8.055 H3K4me3_case_rep1_peak_44
1 262569 263494 926 263238 76 25.567 4.258 23.670 H3K4me3_case_rep1_peak_45
1 268269 270332 2064 268614 178 38.191 3.135 36.001 H3K4me3_case_rep1_peak_46
1 270487 271703 1217 270706 239 69.349 4.012 66.550 H3K4me3_case_rep1_peak_47
1 278007 279669 1663 278440 208 79.501 5.189 76.539 H3K4me3_case_rep1_peak_48
1 280099 280360 262 280190 81 8.518 2.013 7.147 H3K4me3_case_rep1_peak_49
1 284524 285245 722 284987 58 14.171 3.175 12.598 H3K4me3_case_rep1_peak_50
1 293126 294259 1134 293466 128 47.853 4.933 45.457 H3K4me3_case_rep1_peak_51
1 297686 297950 265 297871 79 6.956 1.864 5.651 H3K4me3_case_rep1_peak_52
1 298107 299608 1502 298662 142 34.531 3.398 32.420 H3K4me3_case_rep1_peak_53
1 303457 304001 545 303681 146 41.258 3.837 39.002 H3K4me3_case_rep1_peak_54
1 305767 306401 635 306156 142 48.311 4.525 45.906 H3K4me3_case_rep1_peak_55
1 308965 309570 606 309310 84 24.057 3.733 22.199 H3K4me3_case_rep1_peak_56
1 315388 316459 1072 316204 143 59.336 5.551 56.715 H3K4me3_case_rep1_peak_57
1 322087 324123 2037 323395 132 32.805 3.441 30.734 H3K4me3_case_rep1_peak_58
1 325135 325762 628 325291 93 15.063 2.530 13.463 H3K4me3_case_rep1_peak_59
1 330891 331162 272 331058 34 7.765 2.811 6.423 H3K4me3_case_rep1_peak_60
1 336576 338611 2036 337632 253 120.347 6.722 116.825 H3K4me3_case_rep1_peak_61
1 343579 344413 835 343947 289 105.124 4.995 101.790 H3K4me3_case_rep1_peak_62
1 345883 347261 1379 346450 265 90.840 4.693 87.707 H3K4me3_case_rep1_peak_63
1 353731 355081 1351 354459 524 297.435 8.738 293.011 H3K4me3_case_rep1_peak_64
1 362179 363551 1373 362916 254 142.999 8.404 139.238 H3K4me3_case_rep1_peak_65
1 373323 374035 713 373537 108 60.515 7.818 57.872 H3K4me3_case_rep1_peak_66
1 388835 390229 1395 389420 191 73.585 5.212 70.716 H3K4me3_case_rep1_peak_67
1 390583 390849 267 390693 78 7.645 1.946 6.312 H3K4me3_case_rep1_peak_68
1 395562 396547 986 395899 198 89.651 6.256 86.536 H3K4me3_case_rep1_peak_69
1 401555 401880 326 401660 112 20.684 2.772 18.914 H3K4me3_case_rep1_peak_70
1 403741 405146 1406 404702 226 71.394 4.319 68.562 H3K4me3_case_rep1_peak_71
1 408740 409513 774 408940 195 35.549 2.824 33.417 H3K4me3_case_rep1_peak_72
1 410660 411215 556 410900 195 49.143 3.545 46.721 H3K4me3_case_rep1_peak_73
1 411632 412528 897 412004 215 61.066 3.924 58.412 H3K4me3_case_rep1_peak_74
1 428568 430761 2194 429011 64 24.533 4.733 22.660 H3K4me3_case_rep1_peak_75
1 442104 442636 533 442336 55 17.861 3.972 16.176 H3K4me3_case_rep1_peak_76
1 445337 446167 831 445893 61 12.867 2.878 11.335 H3K4me3_case_rep1_peak_77
1 449214 450766 1553 450256 107 37.446 4.559 35.272 H3K4me3_case_rep1_peak_78
1 460286 460732 447 460524 68 20.451 3.810 18.689 H3K4me3_case_rep1_peak_79
1 462938 463900 963 463641 95 32.677 4.440 30.609 H3K4me3_case_rep1_peak_80
1 472142 472581 440 472332 90 14.631 2.529 13.042 H3K4me3_case_rep1_peak_81
1 474375 474976 602 474594 129 33.258 3.536 31.176 H3K4me3_case_rep1_peak_82
1 477807 478068 262 477922 48 4.806 1.877 3.610 H3K4me3_case_rep1_peak_83
1 486918 487377 460 487121 70 19.201 3.542 17.476 H3K4me3_case_rep1_peak_84
1 487822 488301 480 488167 45 6.820 2.277 5.520 H3K4me3_case_rep1_peak_85
1 490262 490864 603 490421 71 11.270 2.449 9.794 H3K4me3_case_rep1_peak_86
1 491610 491871 262 491650 53 5.480 1.924 4.245 H3K4me3_case_rep1_peak_87
1 493459 494525 1067 493702 161 80.082 6.931 77.110 H3K4me3_case_rep1_peak_88
1 518062 519670 1609 518534 81 29.454 4.615 27.461 H3K4me3_case_rep1_peak_89
1 537613 538815 1203 538034 75 36.830 6.331 34.669 H3K4me3_case_rep1_peak_90
1 541545 541806 262 541763 37 8.941 2.965 7.552 H3K4me3_case_rep1_peak_91
1 569079 570193 1115 569637 232 119.642 7.422 116.130 H3K4me3_case_rep1_peak_92
1 574194 575577 1384 574451 109 29.011 3.588 27.027 H3K4me3_case_rep1_peak_93
1 580288 580995 708 580739 99 42.276 5.562 39.997 H3K4me3_case_rep1_peak_94
1 583960 584632 673 584366 74 28.614 4.859 26.641 H3K4me3_case_rep1_peak_95
1 591803 593262 1460 592594 88 39.457 5.805 37.240 H3K4me3_case_rep1_peak_96
1 601685 603028 1344 602046 56 18.579 4.058 16.869 H3K4me3_case_rep1_peak_97
1 608500 609459 960 609199 51 18.557 4.361 16.848 H3K4me3_case_rep1_peak_98
1 618177 619509 1333 619030 97 26.669 3.652 24.743 H3K4me3_case_rep1_peak_99
1 619760 620031 272 619933 67 11.479 2.547 9.997 H3K4me3_case_rep1_peak_100

表 4.2 Call Peak结果。样本_peaks.tsv是一个表格文件,其中包含有关被调用峰的信息。您可以在excel/WPS中打开它并使用函数进行排序/过滤。各列信息为:
1.chr,染色体名称;
2.start,peak起始位置;
3.end,peak的结束位置;
4.length,peak长度;
5.abs_summit,峰顶的位置(absolute peak summit position),narrowPeak存在而broadPeak不存在;
6.pileup,峰顶上的堆积高度(pileup height at peak summit);
7.-log10(pvalue) for the peak(例如 pvalue =1e-10,那么这个值应该是 10);
8.fold_enrichment,该峰的富集倍数,与该位置λ的随机泊松分布相对应,peak文件中signalValue列等于该列;
9.-log10(qvalue) for the peak,peak文件中score列是该列数值x10。







5. 差异Peak分析

在存在多个分组且组内有生物学重复的情况下,可以对组间进行差异Peak(differential peak)分析,以确定哪些Peak在组间存在显著差异,同时获取组内共识峰(consensus peak)。如果没有差异分析则本节内容为空。

5.1 差异Peak分析结果

存在组内生物学重复时,我们使用软件DiffBind(version 3.10)(Stark,R., & Brown,G.,2012)对样本peaks进行分析。结果详见report/result/4.peak。后文中提到的“diff”代表组间差异,“cons”代表组内交集。
DiffPeak:“sampAvssampB_res.csv”为各组样本差异分析结果,sampA代表实验组,sampB代表对照组; DiffPeak_sampAvssampB_up.bed为sampA和sampB比较,结合强度上调的peak; DiffPeak_sampA vs sampB_down.bed为sampA和sampB比较,结合强度下调的peak; sampA_consensus_peaks.bed为A组组内共识峰。



图5.1 比较组PCA图。主成分分析是将原来较多维度的指标 (peak 的分布特征),降维到较低的维度(二维),来研究样品间的主成分关系。二维PCA分析结果中,会展示主成分1(PC1) 和主成分2(PC2)分别作为 X 轴和 Y 轴的散点图,每个点代表 1 个样本。坐标轴上百分比代表主成分的贡献率,贡献率越大,说明该主成分对样本差异的解释能力越强。如果两个样本距离越远,则说明样本 peaks 分布的差异越大。 反之,则说明相应样本peaks整体分布模式越接近。所以,PCA 分析常用于评估样本重复性的好坏。理想情况下,生物学重复的样本应该聚类在一起,而不同组间应该可以区分开。



图5.2 差异Peak火山图。横坐标为log2(Fold Change),纵坐标为-log10(FDR),蓝色为显著性下调的峰,红色为显著性上调的峰,灰色为非显著性差异的峰。







6. 基因组注释

为了进一步探讨peak结合位点特征,理解染色质开放区域对基因调控的机制, 使用R包ChIPseeker(version 1.36)(Wang et al., 2022)对Peak区域进行注释,我们统计Peak在各基因功能元件分布情况,并将各个peak与基因关联。本节结果请详见位于report/result/5.anno文件夹。

6.1 Peak 在基因组分布

图6.1 Peak在基因功能元件上分布饼图。
一般来说,peaks最多的区域是位于转录起始点(TSS)上游1kb的启动子区域“promoter(<=1kb)”,它与基因的表达调控密切相关;“promoter(1~2kb)”代表TSS上游1~2kb的启动子区域,“promoter(2~3kb)”代表TSS上游2~3kb的启动子区域。
5'非翻译区(5' UTR)和外显子区域(Exon)与mRNA的稳定性或基因表达的调控有关。
3'非翻译区(3' UTR)、内含子(Intron)、远端基因间区(Distal Intergenic)以及TSS下游区(Downstream),这些区域的调控活动可能涉及长距离的基因调控或影响基因的后续处理和表达。



图6.2 各样本Peak在基因功能元件上分布比例堆叠条状图,samples代表单个样本,cons代表组内共识峰,diff代表组间差异峰。



图6.3 各样本Peak在TSS(转录起始位点)侧翼分布比例堆叠条状图,samples代表单个样本,cons代表组内共识峰,diff代表组间差异峰,各元件内容含义见图 6.1。



6.2 Peak关联基因注释

各个样本Peak关联基因注释结果表部分内容如下,完整信息请查看report/result/5.anno/{样本名称}_PeakAnno.csv表格。{组名}_PeakAnno.csv代表组内共识峰注释结果,{比较组}_{up/down}_PeakAnno.csv代表组间差异峰注释结果。

显示前100行 (共12684行)
chr start end peaknum annotation geneChr geneStart geneEnd geneLength geneStrand geneId transcriptId distanceToTSS
1 4258 4684 H3K4me3_case_rep1_peak_1 Promoter (<=1kb) 1 3631 5899 2269 1 AT1G01010 AT1G01010.1 628
1 8412 8847 H3K4me3_case_rep1_peak_2 Promoter (<=1kb) 1 6788 8737 1950 2 AT1G01020 AT1G01020.2 0
1 22877 24084 H3K4me3_case_rep1_peak_3 Promoter (<=1kb) 1 23121 31227 8107 1 AT1G01040 AT1G01040.1 0
1 32782 33747 H3K4me3_case_rep1_peak_4 Promoter (<=1kb) 1 31170 33171 2002 2 AT1G01050 AT1G01050.1 0
1 37549 38470 H3K4me3_case_rep1_peak_5 Promoter (<=1kb) 1 33365 37871 4507 2 AT1G01060 AT1G01060.6 0
1 46121 46742 H3K4me3_case_rep1_peak_6 Promoter (<=1kb) 1 45296 47019 1724 2 AT1G01080 AT1G01080.2 277
1 48406 49257 H3K4me3_case_rep1_peak_7 Promoter (<=1kb) 1 47234 49304 2071 2 AT1G01090 AT1G01090.1 47
1 50558 51138 H3K4me3_case_rep1_peak_8 Promoter (<=1kb) 1 50090 51108 1019 2 AT1G01100 AT1G01100.1 0
1 57289 58047 H3K4me3_case_rep1_peak_9 Promoter (1-2kb) 1 57164 59215 2052 2 AT1G01120 AT1G01120.1 1168
1 58169 58742 H3K4me3_case_rep1_peak_10 Promoter (<=1kb) 1 57164 59215 2052 2 AT1G01120 AT1G01120.1 473
1 63428 63928 H3K4me3_case_rep1_peak_11 Promoter (<=1kb) 1 61905 63811 1907 2 AT1G01130 AT1G01130.1 0
1 66912 67502 H3K4me3_case_rep1_peak_12 Promoter (<=1kb) 1 64167 67625 3459 2 AT1G01140 AT1G01140.2 123
1 72187 72884 H3K4me3_case_rep1_peak_13 Promoter (<=1kb) 1 72339 74096 1758 1 AT1G01160 AT1G01160.1 0
1 75500 76558 H3K4me3_case_rep1_peak_14 Promoter (<=1kb) 1 75390 76845 1456 1 AT1G01180 AT1G01180.1 111
1 86814 87141 H3K4me3_case_rep1_peak_15 Promoter (1-2kb) 1 86486 88409 1924 2 AT1G01200 AT1G01200.1 1268
1 88962 89224 H3K4me3_case_rep1_peak_16 Promoter (<=1kb) 1 88890 89745 856 1 AT1G01210 AT1G01210.1 73
1 91340 92154 H3K4me3_case_rep1_peak_17 Promoter (<=1kb) 1 91342 95681 4340 1 AT1G01220 AT1G01220.1 0
1 96807 97083 H3K4me3_case_rep1_peak_18 Promoter (<=1kb) 1 97412 99240 1829 1 AT1G01230 AT1G01230.1 -329
1 97223 98114 H3K4me3_case_rep1_peak_19 Promoter (<=1kb) 1 97412 99240 1829 1 AT1G01230 AT1G01230.1 0
1 99909 100196 H3K4me3_case_rep1_peak_20 Promoter (<=1kb) 1 99872 101882 2011 1 AT1G01240 AT1G01240.5 38
1 109119 110104 H3K4me3_case_rep1_peak_21 Promoter (<=1kb) 1 109076 111535 2460 1 AT1G01260 AT1G01260.2 44
1 114787 115291 H3K4me3_case_rep1_peak_22 Promoter (<=1kb) 1 114202 116407 2206 1 AT1G01290 AT1G01290.2 586
1 117273 118505 H3K4me3_case_rep1_peak_23 Promoter (<=1kb) 1 116784 118845 2062 1 AT1G01300 AT1G01300.1 490
1 120498 120760 H3K4me3_case_rep1_peak_24 Promoter (<=1kb) 1 120154 121130 977 1 AT1G01310 AT1G01310.1 345
1 127872 128450 H3K4me3_case_rep1_peak_25 Exon (AT1G01320.3/AT1G01320, exon 6 of 25) 1 121124 130570 9447 2 AT1G01320 AT1G01320.1 2120
1 135856 136978 H3K4me3_case_rep1_peak_26 Promoter (<=1kb) 1 132270 135924 3655 2 AT1G01340 AT1G01340.2 0
1 141795 142898 H3K4me3_case_rep1_peak_27 Promoter (<=1kb) 1 141870 143261 1392 1 AT1G01360 AT1G01360.1 0
1 143254 143699 H3K4me3_case_rep1_peak_28 Promoter (<=1kb) 1 143489 146042 2554 1 AT1G01370 AT1G01370.1 0
1 148945 150321 H3K4me3_case_rep1_peak_29 Promoter (<=1kb) 1 148013 149848 1836 2 AT1G01390 AT1G01390.2 0
1 154885 156218 H3K4me3_case_rep1_peak_30 Promoter (<=1kb) 1 154367 156178 1812 2 AT1G01420 AT1G01420.2 0
1 157096 157454 H3K4me3_case_rep1_peak_31 Promoter (<=1kb) 1 154367 156178 1812 2 AT1G01420 AT1G01420.2 -919
1 157815 158190 H3K4me3_case_rep1_peak_32 Promoter (<=1kb) 1 156477 158823 2347 2 AT1G01430 AT1G01430.1 633
1 161731 161993 H3K4me3_case_rep1_peak_33 Promoter (<=1kb) 1 159628 162953 3326 2 AT1G01440 AT1G01440.1 960
1 163237 163989 H3K4me3_case_rep1_peak_34 Promoter (<=1kb) 1 163278 166353 3076 1 AT1G01448 AT1G01448.1 0
1 172258 173596 H3K4me3_case_rep1_peak_35 Promoter (<=1kb) 1 171525 172948 1424 2 AT1G01470 AT1G01470.1 0
1 185189 185959 H3K4me3_case_rep1_peak_36 Promoter (<=1kb) 1 185033 187135 2103 1 AT1G01500 AT1G01500.1 157
1 187063 188013 H3K4me3_case_rep1_peak_37 Promoter (<=1kb) 1 187145 190472 3328 1 AT1G01510 AT1G01510.1 0
1 195558 196662 H3K4me3_case_rep1_peak_38 Promoter (<=1kb) 1 195645 198787 3143 1 AT1G01540 AT1G01540.2 0
1 220344 220872 H3K4me3_case_rep1_peak_39 Promoter (1-2kb) 1 218834 221286 2453 1 AT1G01600 AT1G01600.1 1511
1 226017 227210 H3K4me3_case_rep1_peak_40 Promoter (<=1kb) 1 225665 227302 1638 2 AT1G01620 AT1G01620.2 92
1 228646 229716 H3K4me3_case_rep1_peak_41 Promoter (<=1kb) 1 228799 230979 2181 1 AT1G01630 AT1G01630.1 0
1 236723 238151 H3K4me3_case_rep1_peak_42 Promoter (<=1kb) 1 232840 237905 5066 2 AT1G01650 AT1G01650.1 0
1 248726 250097 H3K4me3_case_rep1_peak_43 Promoter (<=1kb) 1 249041 252522 3482 1 AT1G01690 AT1G01690.2 0
1 258840 259215 H3K4me3_case_rep1_peak_44 Promoter (<=1kb) 1 258717 258963 247 2 AT1G04037 AT1G04037.1 0
1 262568 263494 H3K4me3_case_rep1_peak_45 Promoter (<=1kb) 1 262828 266324 3497 1 AT1G01710 AT1G01710.1 0
1 268268 270332 H3K4me3_case_rep1_peak_46 Promoter (<=1kb) 1 267993 268336 344 2 AT1G04043 AT1G04043.1 0
1 270486 271703 H3K4me3_case_rep1_peak_47 Promoter (<=1kb) 1 269792 270859 1068 2 AT1G01725 AT1G01725.1 0
1 278006 279669 H3K4me3_case_rep1_peak_48 Promoter (<=1kb) 1 276266 278448 2183 2 AT1G01760 AT1G01760.1 0
1 280098 280360 H3K4me3_case_rep1_peak_49 Promoter (1-2kb) 1 278600 282891 4292 1 AT1G01770 AT1G01770.1 1499
1 284523 285245 H3K4me3_case_rep1_peak_50 Promoter (<=1kb) 1 284460 291203 6744 1 AT1G01790 AT1G01790.2 64
1 293125 294259 H3K4me3_case_rep1_peak_51 Promoter (<=1kb) 1 293246 295086 1841 1 AT1G01800 AT1G01800.1 0
1 297685 297950 H3K4me3_case_rep1_peak_52 Promoter (<=1kb) 1 296001 298334 2334 2 AT1G01820 AT1G01820.1 384
1 298106 299608 H3K4me3_case_rep1_peak_53 Promoter (<=1kb) 1 296001 298334 2334 2 AT1G01820 AT1G01820.1 0
1 303456 304001 H3K4me3_case_rep1_peak_54 Promoter (<=1kb) 1 303537 304358 822 1 AT1G01840 AT1G01840.1 0
1 305766 306401 H3K4me3_case_rep1_peak_55 Promoter (<=1kb) 1 304133 306310 2178 2 AT1G01860 AT1G01860.1 0
1 308964 309570 H3K4me3_case_rep1_peak_56 Promoter (<=1kb) 1 306467 309541 3075 2 AT1G01880 AT1G01880.3 0
1 315387 316459 H3K4me3_case_rep1_peak_57 Promoter (<=1kb) 1 313101 315980 2880 2 AT1G01910 AT1G01910.4 0
1 322086 324123 H3K4me3_case_rep1_peak_58 Promoter (<=1kb) 1 319769 322895 3127 2 AT1G01930 AT1G01930.1 0
1 325134 325762 H3K4me3_case_rep1_peak_59 Promoter (<=1kb) 1 325316 330619 5304 1 AT1G01950 AT1G01950.1 0
1 330890 331162 H3K4me3_case_rep1_peak_60 Exon (AT1G01960.1/AT1G01960, exon 11 of 11) 1 325379 330619 5241 1 AT1G01950 AT1G01950.3 5512
1 336575 338611 H3K4me3_case_rep1_peak_61 Promoter (<=1kb) 1 330588 337912 7325 2 AT1G01960 AT1G01960.1 0
1 343578 344413 H3K4me3_case_rep1_peak_62 Promoter (<=1kb) 1 342918 344400 1483 2 AT1G01990 AT1G01990.1 0
1 345882 347261 H3K4me3_case_rep1_peak_63 Promoter (<=1kb) 1 347613 352545 4933 1 AT1G02010 AT1G02010.1 -352
1 353730 355081 H3K4me3_case_rep1_peak_64 Promoter (<=1kb) 1 352612 355021 2410 2 AT1G02020 AT1G02020.3 0
1 362178 363551 H3K4me3_case_rep1_peak_65 Promoter (<=1kb) 1 360918 363142 2225 2 AT1G02060 AT1G02060.1 0
1 373322 374035 H3K4me3_case_rep1_peak_66 Promoter (<=1kb) 1 373335 386847 13513 1 AT1G02080 AT1G02080.1 0
1 388834 390229 H3K4me3_case_rep1_peak_67 Promoter (<=1kb) 1 387277 389808 2532 2 AT1G02090 AT1G02090.1 0
1 390582 390849 H3K4me3_case_rep1_peak_68 Promoter (<=1kb) 1 389823 392811 2989 1 AT1G02100 AT1G02100.3 760
1 395561 396547 H3K4me3_case_rep1_peak_69 Promoter (<=1kb) 1 395689 400001 4313 1 AT1G02120 AT1G02120.1 0
1 401554 401880 H3K4me3_case_rep1_peak_70 Promoter (<=1kb) 1 399983 401919 1937 2 AT1G02130 AT1G02130.1 39
1 403740 405146 H3K4me3_case_rep1_peak_71 Promoter (<=1kb) 1 403100 404456 1357 2 AT1G02140 AT1G02140.1 0
1 408739 409513 H3K4me3_case_rep1_peak_72 Promoter (<=1kb) 1 408622 410680 2059 1 AT1G02150 AT1G02150.1 118
1 410659 411215 H3K4me3_case_rep1_peak_73 Promoter (<=1kb) 1 410744 411632 889 1 AT1G02160 AT1G02160.1 0
1 411631 412528 H3K4me3_case_rep1_peak_74 Promoter (<=1kb) 1 411664 413554 1891 1 AT1G02170 AT1G02170.1 0
1 428567 430761 H3K4me3_case_rep1_peak_75 Promoter (<=1kb) 1 428650 430720 2071 2 AT1G02220 AT1G02220.1 0
1 442103 442636 H3K4me3_case_rep1_peak_76 Promoter (<=1kb) 1 440200 442972 2773 2 AT1G02260 AT1G02260.1 336
1 445336 446167 H3K4me3_case_rep1_peak_77 Promoter (<=1kb) 1 443149 446276 3128 2 AT1G02270 AT1G02270.2 109
1 449213 450766 H3K4me3_case_rep1_peak_78 Promoter (<=1kb) 1 448423 450426 2004 2 AT1G02280 AT1G02280.1 0
1 460285 460732 H3K4me3_case_rep1_peak_79 Promoter (<=1kb) 1 458133 460696 2564 2 AT1G02310 AT1G02310.1 0
1 462937 463900 H3K4me3_case_rep1_peak_80 Promoter (<=1kb) 1 461946 463618 1673 2 AT1G02330 AT1G02330.1 0
1 472141 472581 H3K4me3_case_rep1_peak_81 Promoter (<=1kb) 1 471883 473160 1278 2 AT1G02360 AT1G02360.1 579
1 474374 474976 H3K4me3_case_rep1_peak_82 Promoter (<=1kb) 1 474373 476612 2240 1 AT1G02370 AT1G02370.1 2
1 477806 478068 H3K4me3_case_rep1_peak_83 Promoter (<=1kb) 1 476945 479112 2168 1 AT1G02380 AT1G02380.1 862
1 486917 487377 H3K4me3_case_rep1_peak_84 Promoter (<=1kb) 1 486801 489634 2834 1 AT1G02400 AT1G02400.2 117
1 487821 488301 H3K4me3_case_rep1_peak_85 Promoter (1-2kb) 1 486801 489634 2834 1 AT1G02400 AT1G02400.2 1021
1 490261 490864 H3K4me3_case_rep1_peak_86 Promoter (<=1kb) 1 489874 490627 754 2 AT1G02405 AT1G02405.1 0
1 491609 491871 H3K4me3_case_rep1_peak_87 Promoter (<=1kb) 1 490925 492979 2055 1 AT1G02410 AT1G02410.2 685
1 493458 494525 H3K4me3_case_rep1_peak_88 Promoter (<=1kb) 1 493639 495158 1520 1 AT1G02420 AT1G02420.1 0
1 518061 519670 H3K4me3_case_rep1_peak_89 Promoter (<=1kb) 1 518091 520495 2405 1 AT1G02500 AT1G02500.1 0
1 537612 538815 H3K4me3_case_rep1_peak_90 Promoter (<=1kb) 1 537740 540127 2388 1 AT1G02560 AT1G02560.1 0
1 541544 541806 H3K4me3_case_rep1_peak_91 Promoter (<=1kb) 1 541236 542245 1010 1 AT1G02570 AT1G02570.1 309
1 569078 570193 H3K4me3_case_rep1_peak_92 Promoter (<=1kb) 1 568558 570554 1997 1 AT1G02650 AT1G02650.2 521
1 574193 575577 H3K4me3_case_rep1_peak_93 Promoter (<=1kb) 1 571826 575085 3260 2 AT1G02660 AT1G02660.1 0
1 580287 580995 H3K4me3_case_rep1_peak_94 Promoter (<=1kb) 1 580625 582310 1686 1 AT1G02680 AT1G02680.1 0
1 583959 584632 H3K4me3_case_rep1_peak_95 Promoter (<=1kb) 1 584134 587327 3194 1 AT1G02690 AT1G02690.1 0
1 591802 593262 H3K4me3_case_rep1_peak_96 Promoter (<=1kb) 1 591826 593887 2062 1 AT1G02720 AT1G02720.2 0
1 601684 603028 H3K4me3_case_rep1_peak_97 Promoter (<=1kb) 1 599330 602199 2870 2 AT1G02740 AT1G02740.1 0
1 608499 609459 H3K4me3_case_rep1_peak_98 Promoter (<=1kb) 1 607799 609534 1736 2 AT1G02780 AT1G02780.1 75
1 618176 619509 H3K4me3_case_rep1_peak_99 Promoter (<=1kb) 1 618061 620502 2442 1 AT1G02810 AT1G02810.1 116
1 619759 620031 H3K4me3_case_rep1_peak_100 Promoter (<=1kb) 1 620678 621329 652 1 AT1G02813 AT1G02813.1 -647

上表第一列到第三列为peak在基因组位置;第五列annotation为peak的基因组功能元件身份;第六列到第十列为关联基因的位置信息; 第十一列geneId为基因ID;第十二列transcriptId为转录本ID;第十三列distanceToTSS为peak到TSS距离。







7. 基因富集分析

我们使用clusterprofiler(version 4.14.6)(Wu T. et al., 2021)进行GO和KEGG通路富集分析。富集分析结果表格未使用阈值过滤,您可以在表格中查看所有可能富集的通路。

7.1 GO富集分析

GO (Gene Ontology, http://www.geneontology.org) 是基因本体论联合会建立的将全世界所有与基因有关的研究结果进行分类汇总的综合数据库。该数据库标准化了不同数据库中关于基因和基因产物的生物学术语,适用于各物种,对基因和蛋白功能进行限定和描述。利用GO 数据库,可以对peak峰相关基因进行富集分析,可以找到不同条件下的peak峰相关基因按照其参与的BP(Biological Process, 生物过程)、MF(Molecular Function, 分子功能) 及CC(Cellular Component, 细胞组分) 三个方面进行分类注释。GO 注释有助于理解基因背后所代表的生物学意义。GO功能显著性富集分析给出与基因组背景相比,在相关基因中显著富集的GO功能条目,从而给出与peak峰相关基因与哪些生物学功能显著相关。该分析首先把所有相关向Gene Ontology数据库的各个term映射,计算每个term的基因数目,然后应用超几何检验,找出与整个基因组背景相比,在与peak峰相关基因中显著富集的GO条目。

下面展示peak关联的基因富集GO富集分析部分结果,完整结果请见/result/6.gokegg/GOALLterm_peakanno_*.csv。GO富集分析完整结果请详见位于report/result/6.gokegg文件夹的*_GO_res.csv表格文件。

显示前100行 (共3439行)
ONTOLOGY ID Description GeneRatio BgRatio pvalue p.adjust qvalue geneID Count
BP GO:0006886 intracellular protein transport 320/10443 491/25557 0.000 0.000 0.000 AT1G01910/AT1G02010/AT1G02280/AT1G02690/AT1G03000/AT1G04070/AT1G04830/AT1G05520/AT1G06950/AT1G08190/AT1G09070/AT1G09180/AT1G09270/AT1G09580/AT1G10730/AT1G11250/AT1G12360/AT1G12470/AT1G12930/AT1G13900/... 320
BP GO:0010228 vegetative to reproductive phase transition of meristem 304/10443 479/25557 0.000 0.000 0.000 AT1G01040/AT1G01060/AT1G02060/AT1G02740/AT1G03160/AT1G03365/AT1G03750/AT1G04210/AT1G04870/AT1G05150/AT1G05380/AT1G05830/AT1G06040/AT1G06070/AT1G08680/AT1G09520/AT1G09730/AT1G10570/AT1G12110/AT1G12910/... 304
BP GO:0033365 protein localization to organelle 201/10443 293/25557 0.000 0.000 0.000 AT1G01910/AT1G02280/AT1G02690/AT1G03000/AT1G04070/AT1G04960/AT1G06950/AT1G08190/AT1G09070/AT1G09270/AT1G11060/AT1G12930/AT1G13900/AT1G14850/AT1G15130/AT1G15310/AT1G16540/AT1G18320/AT1G19970/AT1G23465/... 201
BP GO:0006605 protein targeting 167/10443 236/25557 0.000 0.000 0.000 AT1G01910/AT1G02280/AT1G03000/AT1G04070/AT1G06950/AT1G08190/AT1G09070/AT1G13900/AT1G15310/AT1G16540/AT1G18320/AT1G23465/AT1G24460/AT1G26670/AT1G27390/AT1G29960/AT1G48090/AT1G48160/AT1G48760/AT1G48900/... 167
BP GO:0072594 establishment of protein localization to organelle 184/10443 271/25557 0.000 0.000 0.000 AT1G01910/AT1G02280/AT1G02690/AT1G03000/AT1G04070/AT1G04960/AT1G06950/AT1G08190/AT1G09070/AT1G09270/AT1G12930/AT1G13900/AT1G14850/AT1G15130/AT1G15310/AT1G16540/AT1G18320/AT1G23465/AT1G24310/AT1G24460/... 184
BP GO:0022613 ribonucleoprotein complex biogenesis 262/10443 427/25557 0.000 0.000 0.000 AT1G01040/AT1G01080/AT1G01860/AT1G02870/AT1G03140/AT1G03530/AT1G04170/AT1G04230/AT1G04270/AT1G06190/AT1G06720/AT1G07070/AT1G07840/AT1G09340/AT1G09700/AT1G10490/AT1G12244/AT1G12800/AT1G13030/AT1G13160/... 262
BP GO:0006397 mRNA processing 236/10443 378/25557 0.000 0.000 0.000 AT1G02330/AT1G02840/AT1G03140/AT1G03330/AT1G03910/AT1G04510/AT1G06190/AT1G06220/AT1G07170/AT1G07350/AT1G07740/AT1G08780/AT1G09230/AT1G09660/AT1G09760/AT1G10320/AT1G10580/AT1G10910/AT1G11710/AT1G11900/... 236
BP GO:0034470 ncRNA processing 260/10443 429/25557 0.000 0.000 0.000 AT1G01040/AT1G01080/AT1G01210/AT1G01760/AT1G01860/AT1G03110/AT1G04230/AT1G06190/AT1G06720/AT1G07840/AT1G07910/AT1G09060/AT1G09290/AT1G09340/AT1G09700/AT1G10490/AT1G12244/AT1G12800/AT1G13870/AT1G14790/... 260
BP GO:0048193 Golgi vesicle transport 132/10443 187/25557 0.000 0.000 0.000 AT1G02130/AT1G05520/AT1G08400/AT1G09180/AT1G09580/AT1G10180/AT1G10290/AT1G13980/AT1G14010/AT1G14020/AT1G15370/AT1G15880/AT1G18190/AT1G21900/AT1G26670/AT1G26690/AT1G28490/AT1G29330/AT1G30630/AT1G31780/... 132
BP GO:0009648 photoperiodism 186/10443 290/25557 0.000 0.000 0.000 AT1G01060/AT1G02740/AT1G03365/AT1G03750/AT1G04210/AT1G04990/AT1G05150/AT1G06040/AT1G09730/AT1G11480/AT1G12110/AT1G12910/AT1G14650/AT1G15800/AT1G16210/AT1G17210/AT1G17450/AT1G18450/AT1G18560/AT1G18610/... 186
BP GO:0042254 ribosome biogenesis 216/10443 351/25557 0.000 0.000 0.000 AT1G01040/AT1G01080/AT1G01860/AT1G02870/AT1G03530/AT1G04230/AT1G04270/AT1G06190/AT1G06720/AT1G07070/AT1G07840/AT1G09340/AT1G09700/AT1G10490/AT1G12244/AT1G12800/AT1G13160/AT1G14320/AT1G15420/AT1G15440/... 216
BP GO:0009657 plastid organization 194/10443 309/25557 0.000 0.000 0.000 AT1G01790/AT1G03160/AT1G04770/AT1G06460/AT1G06820/AT1G06870/AT1G06950/AT1G08540/AT1G09340/AT1G10430/AT1G10910/AT1G11750/AT1G11870/AT1G12410/AT1G15510/AT1G20830/AT1G24490/AT1G28530/AT1G29120/AT1G30475/... 194
BP GO:0090407 organophosphate biosynthetic process 207/10443 338/25557 0.000 0.000 0.000 AT1G01090/AT1G01220/AT1G01290/AT1G02880/AT1G04280/AT1G08750/AT1G09830/AT1G10700/AT1G11880/AT1G12640/AT1G12730/AT1G13560/AT1G15110/AT1G15700/AT1G16560/AT1G17410/AT1G20575/AT1G21640/AT1G21980/AT1G22620/... 207
BP GO:1901361 organic cyclic compound catabolic process 208/10443 342/25557 0.000 0.000 0.000 AT1G01040/AT1G02080/AT1G04010/AT1G04710/AT1G05620/AT1G06450/AT1G06570/AT1G06710/AT1G07040/AT1G07705/AT1G08200/AT1G08370/AT1G09700/AT1G09810/AT1G11190/AT1G12050/AT1G13940/AT1G14210/AT1G14230/AT1G14710/... 208
BP GO:0010150 leaf senescence 198/10443 323/25557 0.000 0.000 0.000 AT1G01230/AT1G02220/AT1G04010/AT1G05100/AT1G05620/AT1G07040/AT1G11190/AT1G11700/AT1G11760/AT1G13990/AT1G14330/AT1G16130/AT1G17020/AT1G18210/AT1G19270/AT1G20900/AT1G23040/AT1G23550/AT1G26420/AT1G26930/... 198
BP GO:0009966 regulation of signal transduction 262/10443 451/25557 0.000 0.000 0.000 AT1G01960/AT1G02100/AT1G03730/AT1G04310/AT1G05100/AT1G06840/AT1G07430/AT1G07910/AT1G08680/AT1G08720/AT1G09950/AT1G10430/AT1G10560/AT1G11050/AT1G12990/AT1G13300/AT1G13980/AT1G13990/AT1G14330/AT1G14780/... 262
BP GO:0071407 cellular response to organic cyclic compound 204/10443 335/25557 0.000 0.000 0.000 AT1G01440/AT1G02100/AT1G03400/AT1G03730/AT1G06390/AT1G08420/AT1G08720/AT1G11050/AT1G13750/AT1G14000/AT1G14330/AT1G14780/AT1G14920/AT1G16110/AT1G16260/AT1G17147/AT1G17330/AT1G17430/AT1G18670/AT1G19570/... 204
BP GO:0010646 regulation of cell communication 271/10443 470/25557 0.000 0.000 0.000 AT1G01960/AT1G02100/AT1G03730/AT1G04310/AT1G05100/AT1G06840/AT1G07430/AT1G07910/AT1G08680/AT1G08720/AT1G09950/AT1G10430/AT1G10560/AT1G11050/AT1G12990/AT1G13300/AT1G13980/AT1G13990/AT1G14330/AT1G14780/... 271
BP GO:0023051 regulation of signaling 267/10443 463/25557 0.000 0.000 0.000 AT1G01960/AT1G02100/AT1G03730/AT1G04310/AT1G05100/AT1G06840/AT1G07430/AT1G07910/AT1G08680/AT1G08720/AT1G09950/AT1G10430/AT1G10560/AT1G11050/AT1G12990/AT1G13300/AT1G13980/AT1G13990/AT1G14330/AT1G14780/... 267
BP GO:0090693 plant organ senescence 222/10443 373/25557 0.000 0.000 0.000 AT1G01230/AT1G02220/AT1G02850/AT1G03220/AT1G04010/AT1G05100/AT1G05620/AT1G07040/AT1G11190/AT1G11700/AT1G11760/AT1G13520/AT1G13990/AT1G14330/AT1G16130/AT1G17020/AT1G18210/AT1G19270/AT1G20900/AT1G21000/... 222
BP GO:0009658 chloroplast organization 146/10443 227/25557 0.000 0.000 0.000 AT1G01790/AT1G04770/AT1G06460/AT1G06950/AT1G08540/AT1G09340/AT1G10430/AT1G10910/AT1G11750/AT1G11870/AT1G12410/AT1G15510/AT1G20830/AT1G24490/AT1G28530/AT1G29120/AT1G30475/AT1G30610/AT1G35680/AT1G45230/... 146
BP GO:0016072 rRNA metabolic process 166/10443 266/25557 0.000 0.000 0.000 AT1G01040/AT1G01080/AT1G01790/AT1G01860/AT1G04230/AT1G06190/AT1G06720/AT1G07840/AT1G09340/AT1G09700/AT1G10490/AT1G12244/AT1G12800/AT1G15420/AT1G15440/AT1G16280/AT1G17450/AT1G17690/AT1G22270/AT1G23280/... 166
BP GO:0061024 membrane organization 168/10443 270/25557 0.000 0.000 0.000 AT1G01020/AT1G03160/AT1G05020/AT1G05520/AT1G06870/AT1G08190/AT1G08350/AT1G08820/AT1G09180/AT1G11250/AT1G13210/AT1G13900/AT1G14910/AT1G15880/AT1G16240/AT1G17500/AT1G18320/AT1G20110/AT1G21360/AT1G21790/... 168
BP GO:0006753 nucleoside phosphate metabolic process 187/10443 310/25557 0.000 0.000 0.000 AT1G01090/AT1G01220/AT1G01710/AT1G04280/AT1G04410/AT1G09420/AT1G09780/AT1G09830/AT1G10700/AT1G13440/AT1G13700/AT1G14230/AT1G15700/AT1G17410/AT1G20260/AT1G21640/AT1G24180/AT1G24280/AT1G28960/AT1G29880/... 187
BP GO:0018193 peptidyl-amino acid modification 172/10443 281/25557 0.000 0.000 0.000 AT1G02740/AT1G03930/AT1G04210/AT1G04440/AT1G04870/AT1G05830/AT1G09060/AT1G09320/AT1G09730/AT1G10570/AT1G12580/AT1G12680/AT1G14030/AT1G16710/AT1G18335/AT1G18450/AT1G24610/AT1G26470/AT1G27120/AT1G45160/... 172
BP GO:0006364 rRNA processing 157/10443 252/25557 0.000 0.000 0.000 AT1G01040/AT1G01080/AT1G01860/AT1G04230/AT1G06190/AT1G06720/AT1G07840/AT1G09340/AT1G09700/AT1G10490/AT1G12244/AT1G12800/AT1G15420/AT1G15440/AT1G16280/AT1G17690/AT1G22270/AT1G23280/AT1G25260/AT1G29250/... 157
BP GO:0008380 RNA splicing 198/10443 333/25557 0.000 0.000 0.000 AT1G02140/AT1G02330/AT1G02840/AT1G03140/AT1G03330/AT1G03910/AT1G04510/AT1G06220/AT1G07170/AT1G07350/AT1G07910/AT1G09230/AT1G09660/AT1G09760/AT1G10320/AT1G10580/AT1G13030/AT1G14640/AT1G14650/AT1G15470/... 198
BP GO:0006888 endoplasmic reticulum to Golgi vesicle-mediated transport 73/10443 97/25557 0.000 0.000 0.000 AT1G02130/AT1G05520/AT1G08400/AT1G09180/AT1G09580/AT1G14010/AT1G15880/AT1G21900/AT1G26690/AT1G29330/AT1G30630/AT1G51160/AT1G62020/AT1G69460/AT1G79990/AT1G80500/AT2G01470/AT2G17980/AT2G18840/AT2G20930/... 73
BP GO:0007005 mitochondrion organization 122/10443 186/25557 0.000 0.000 0.000 AT1G02410/AT1G03860/AT1G04070/AT1G05270/AT1G06530/AT1G08220/AT1G09020/AT1G10865/AT1G11870/AT1G13900/AT1G14450/AT1G14830/AT1G16700/AT1G17350/AT1G18320/AT1G18680/AT1G22800/AT1G23465/AT1G27390/AT1G29960/... 122
BP GO:0044270 cellular nitrogen compound catabolic process 173/10443 285/25557 0.000 0.000 0.000 AT1G01040/AT1G02080/AT1G05620/AT1G06450/AT1G06710/AT1G07040/AT1G07705/AT1G08370/AT1G09700/AT1G09810/AT1G11190/AT1G13940/AT1G14210/AT1G14230/AT1G14710/AT1G15920/AT1G18680/AT1G23020/AT1G23040/AT1G25260/... 173
BP GO:0055086 nucleobase-containing small molecule metabolic process 231/10443 402/25557 0.000 0.000 0.000 AT1G01090/AT1G01220/AT1G01710/AT1G03110/AT1G04280/AT1G04410/AT1G05620/AT1G08200/AT1G09420/AT1G09780/AT1G09830/AT1G10700/AT1G13440/AT1G13700/AT1G14230/AT1G15700/AT1G17160/AT1G17410/AT1G17890/AT1G20260/... 231
BP GO:0010629 negative regulation of gene expression 195/10443 330/25557 0.000 0.000 0.000 AT1G01040/AT1G01260/AT1G02080/AT1G05460/AT1G06230/AT1G06450/AT1G06710/AT1G07705/AT1G08060/AT1G08370/AT1G08830/AT1G09060/AT1G09320/AT1G09570/AT1G09700/AT1G09810/AT1G14710/AT1G14790/AT1G15920/AT1G17760/... 195
BP GO:0019439 aromatic compound catabolic process 187/10443 314/25557 0.000 0.000 0.000 AT1G01040/AT1G02080/AT1G04710/AT1G05620/AT1G06450/AT1G06570/AT1G06710/AT1G07040/AT1G07705/AT1G08370/AT1G09700/AT1G09810/AT1G11190/AT1G12050/AT1G13940/AT1G14210/AT1G14230/AT1G14710/AT1G15920/AT1G18680/... 187
BP GO:0090351 seedling development 139/10443 221/25557 0.000 0.000 0.000 AT1G01790/AT1G03060/AT1G03790/AT1G06040/AT1G07240/AT1G07430/AT1G09970/AT1G10560/AT1G14920/AT1G16060/AT1G17730/AT1G18080/AT1G18100/AT1G18580/AT1G20450/AT1G22190/AT1G27320/AT1G28240/AT1G30010/AT1G32070/... 139
BP GO:0046700 heterocycle catabolic process 171/10443 284/25557 0.000 0.000 0.000 AT1G01040/AT1G02080/AT1G05620/AT1G06450/AT1G06710/AT1G07040/AT1G07705/AT1G08370/AT1G09700/AT1G09810/AT1G11190/AT1G13940/AT1G14210/AT1G14230/AT1G14710/AT1G15920/AT1G18680/AT1G23020/AT1G23040/AT1G25260/... 171
BP GO:0009117 nucleotide metabolic process 178/10443 298/25557 0.000 0.000 0.000 AT1G01090/AT1G01220/AT1G01710/AT1G04280/AT1G04410/AT1G09420/AT1G09780/AT1G09830/AT1G10700/AT1G13440/AT1G13700/AT1G15700/AT1G17410/AT1G20260/AT1G21640/AT1G24180/AT1G24280/AT1G28960/AT1G29880/AT1G29900/... 178
BP GO:0048580 regulation of post-embryonic development 265/10443 479/25557 0.000 0.000 0.000 AT1G01040/AT1G01390/AT1G01790/AT1G02060/AT1G02330/AT1G03790/AT1G04130/AT1G05100/AT1G05870/AT1G06180/AT1G06475/AT1G07240/AT1G07430/AT1G09730/AT1G10155/AT1G10560/AT1G13260/AT1G13620/AT1G14400/AT1G14920/... 265
BP GO:0051668 localization within membrane 89/10443 134/25557 0.000 0.000 0.000 AT1G01910/AT1G10180/AT1G13900/AT1G15130/AT1G15310/AT1G18320/AT1G24490/AT1G27390/AT1G47550/AT1G47560/AT1G48160/AT1G48900/AT1G59820/AT1G61570/AT1G67650/AT1G67680/AT1G79940/AT2G20890/AT2G20990/AT2G27900/... 89
BP GO:0000375 RNA splicing, via transesterification reactions 160/10443 273/25557 0.000 0.000 0.000 AT1G02330/AT1G02840/AT1G03140/AT1G03330/AT1G03910/AT1G04510/AT1G06220/AT1G07170/AT1G07350/AT1G09230/AT1G09660/AT1G09760/AT1G10320/AT1G10580/AT1G13030/AT1G14640/AT1G14650/AT1G15470/AT1G16610/AT1G20920/... 160
BP GO:0000377 RNA splicing, via transesterification reactions with bulged adenosine as nucleophile 160/10443 273/25557 0.000 0.000 0.000 AT1G02330/AT1G02840/AT1G03140/AT1G03330/AT1G03910/AT1G04510/AT1G06220/AT1G07170/AT1G07350/AT1G09230/AT1G09660/AT1G09760/AT1G10320/AT1G10580/AT1G13030/AT1G14640/AT1G14650/AT1G15470/AT1G16610/AT1G20920/... 160
BP GO:0006163 purine nucleotide metabolic process 148/10443 249/25557 0.000 0.000 0.000 AT1G01090/AT1G01710/AT1G04280/AT1G04410/AT1G09420/AT1G09780/AT1G09830/AT1G10700/AT1G13440/AT1G13700/AT1G15700/AT1G17410/AT1G20260/AT1G21640/AT1G24180/AT1G24280/AT1G28960/AT1G30120/AT1G31160/AT1G31220/... 148
BP GO:0072521 purine-containing compound metabolic process 158/10443 270/25557 0.000 0.000 0.000 AT1G01090/AT1G01710/AT1G03110/AT1G04280/AT1G04410/AT1G05620/AT1G09420/AT1G09780/AT1G09830/AT1G10700/AT1G13440/AT1G13700/AT1G15700/AT1G17410/AT1G20260/AT1G21640/AT1G24180/AT1G24280/AT1G28960/AT1G30120/... 158
BP GO:0002376 immune system process 179/10443 313/25557 0.000 0.000 0.000 AT1G01040/AT1G01440/AT1G03160/AT1G04300/AT1G05460/AT1G06135/AT1G08030/AT1G08450/AT1G09970/AT1G11280/AT1G11330/AT1G11960/AT1G12220/AT1G12530/AT1G14780/AT1G16260/AT1G17230/AT1G17250/AT1G17980/AT1G18570/... 179
BP GO:0009451 RNA modification 211/10443 380/25557 0.000 0.000 0.000 AT1G01760/AT1G01860/AT1G03110/AT1G03510/AT1G03530/AT1G05750/AT1G06140/AT1G06150/AT1G06510/AT1G06710/AT1G08070/AT1G09290/AT1G09800/AT1G10490/AT1G10910/AT1G11290/AT1G11430/AT1G11780/AT1G13870/AT1G14470/... 211
BP GO:0072657 protein localization to membrane 74/10443 108/25557 0.000 0.000 0.000 AT1G01910/AT1G13900/AT1G15130/AT1G15310/AT1G18320/AT1G24490/AT1G27390/AT1G48160/AT1G48900/AT1G61570/AT1G67650/AT1G67680/AT1G79940/AT2G20890/AT2G20990/AT2G28800/AT2G31530/AT2G33640/AT2G34250/AT2G36680/... 74
BP GO:0090150 establishment of protein localization to membrane 74/10443 108/25557 0.000 0.000 0.000 AT1G01910/AT1G13900/AT1G15130/AT1G15310/AT1G18320/AT1G24490/AT1G27390/AT1G48160/AT1G48900/AT1G61570/AT1G67650/AT1G67680/AT1G79940/AT2G20890/AT2G20990/AT2G28800/AT2G31530/AT2G33640/AT2G34250/AT2G36680/... 74
BP GO:0098771 inorganic ion homeostasis 167/10443 290/25557 0.000 0.000 0.000 AT1G01140/AT1G04120/AT1G05200/AT1G05700/AT1G07030/AT1G07670/AT1G08030/AT1G08450/AT1G10130/AT1G11320/AT1G12520/AT1G13550/AT1G14040/AT1G14185/AT1G14860/AT1G18910/AT1G20110/AT1G20340/AT1G25520/AT1G26160/... 167
BP GO:0000398 mRNA splicing, via spliceosome 145/10443 246/25557 0.000 0.000 0.000 AT1G02330/AT1G02840/AT1G03140/AT1G03330/AT1G03910/AT1G04510/AT1G06220/AT1G07170/AT1G07350/AT1G09230/AT1G09660/AT1G09760/AT1G10320/AT1G10580/AT1G13030/AT1G14640/AT1G14650/AT1G15470/AT1G16610/AT1G20920/... 145
BP GO:0046777 protein autophosphorylation 108/10443 173/25557 0.000 0.000 0.000 AT1G01540/AT1G05100/AT1G06390/AT1G08680/AT1G08720/AT1G09020/AT1G09970/AT1G10470/AT1G11440/AT1G12580/AT1G12680/AT1G12970/AT1G16130/AT1G16670/AT1G20930/AT1G30570/AT1G49180/AT1G49580/AT1G50700/AT1G51170/... 108
BP GO:2000112 regulation of cellular macromolecule biosynthetic process 100/10443 158/25557 0.000 0.000 0.000 AT1G01040/AT1G02080/AT1G06450/AT1G07705/AT1G08370/AT1G09340/AT1G09570/AT1G09700/AT1G09810/AT1G10840/AT1G13950/AT1G15920/AT1G16860/AT1G18080/AT1G20340/AT1G26110/AT1G27070/AT1G27960/AT1G30010/AT1G30230/... 100
BP GO:0006955 immune response 175/10443 308/25557 0.000 0.000 0.000 AT1G01040/AT1G01440/AT1G03160/AT1G04300/AT1G05460/AT1G06135/AT1G08030/AT1G08450/AT1G09970/AT1G11280/AT1G11330/AT1G11960/AT1G12220/AT1G12530/AT1G14780/AT1G16260/AT1G17230/AT1G17250/AT1G17980/AT1G18570/... 175
BP GO:0048573 photoperiodism, flowering 131/10443 219/25557 0.000 0.000 0.000 AT1G01060/AT1G02740/AT1G03365/AT1G04210/AT1G06040/AT1G09730/AT1G12110/AT1G12910/AT1G14650/AT1G18450/AT1G18560/AT1G20670/AT1G21920/AT1G22610/AT1G22770/AT1G32070/AT1G35460/AT1G48770/AT1G54170/AT1G54830/... 131
BP GO:0033013 tetrapyrrole metabolic process 183/10443 325/25557 0.000 0.000 0.000 AT1G01180/AT1G01500/AT1G02560/AT1G03475/AT1G04620/AT1G06690/AT1G07040/AT1G08520/AT1G09130/AT1G09940/AT1G13990/AT1G14345/AT1G15730/AT1G18360/AT1G18460/AT1G20650/AT1G22850/AT1G23040/AT1G24881/AT1G26930/... 183
BP GO:0010243 response to organonitrogen compound 116/10443 191/25557 0.000 0.000 0.000 AT1G01440/AT1G01930/AT1G05200/AT1G06800/AT1G08450/AT1G09210/AT1G11020/AT1G16900/AT1G17280/AT1G18260/AT1G25570/AT1G27100/AT1G27730/AT1G27752/AT1G30000/AT1G32640/AT1G34420/AT1G50030/AT1G51800/AT1G59870/... 116
BP GO:1901698 response to nitrogen compound 156/10443 272/25557 0.000 0.000 0.000 AT1G01440/AT1G01930/AT1G05200/AT1G05850/AT1G06620/AT1G06800/AT1G08450/AT1G09210/AT1G11020/AT1G12110/AT1G12820/AT1G13300/AT1G16900/AT1G17280/AT1G17340/AT1G18260/AT1G20575/AT1G20640/AT1G25570/AT1G27100/... 156
BP GO:0070482 response to oxygen levels 196/10443 355/25557 0.000 0.000 0.000 AT1G02360/AT1G03220/AT1G03610/AT1G04960/AT1G07150/AT1G07400/AT1G07870/AT1G09070/AT1G10140/AT1G13260/AT1G13300/AT1G14200/AT1G17147/AT1G17290/AT1G18300/AT1G18490/AT1G19020/AT1G20970/AT1G21550/AT1G22220/... 196
BP GO:0016050 vesicle organization 80/10443 122/25557 0.000 0.000 0.000 AT1G05020/AT1G05520/AT1G08190/AT1G09180/AT1G11250/AT1G12470/AT1G14910/AT1G15130/AT1G15880/AT1G16240/AT1G20110/AT1G24460/AT1G26670/AT1G28490/AT1G48760/AT1G59820/AT1G65260/AT1G77140/AT1G77890/AT1G79590/... 80
BP GO:0006091 generation of precursor metabolites and energy 221/10443 408/25557 0.000 0.000 0.000 AT1G01990/AT1G02910/AT1G03310/AT1G04410/AT1G06680/AT1G07030/AT1G08480/AT1G09420/AT1G09530/AT1G09780/AT1G13440/AT1G13700/AT1G14345/AT1G14450/AT1G16700/AT1G17350/AT1G24280/AT1G32070/AT1G32350/AT1G32440/... 221
BP GO:0009845 seed germination 115/10443 190/25557 0.000 0.000 0.000 AT1G03060/AT1G03790/AT1G07240/AT1G07430/AT1G09970/AT1G10560/AT1G14920/AT1G16060/AT1G18080/AT1G18100/AT1G18580/AT1G20450/AT1G22190/AT1G27320/AT1G28240/AT1G30010/AT1G32070/AT1G34120/AT1G48630/AT1G49590/... 115
BP GO:0031324 negative regulation of cellular metabolic process 252/10443 474/25557 0.000 0.000 0.000 AT1G01040/AT1G02080/AT1G03760/AT1G03850/AT1G04550/AT1G05380/AT1G06450/AT1G06590/AT1G07705/AT1G08030/AT1G08370/AT1G08390/AT1G08460/AT1G09060/AT1G09570/AT1G09700/AT1G09810/AT1G10450/AT1G13260/AT1G13880/... 252
BP GO:0006644 phospholipid metabolic process 131/10443 222/25557 0.000 0.000 0.000 AT1G05630/AT1G07230/AT1G08750/AT1G11880/AT1G12640/AT1G12730/AT1G13560/AT1G15110/AT1G16560/AT1G17340/AT1G20575/AT1G21980/AT1G22620/AT1G31910/AT1G32200/AT1G34120/AT1G48140/AT1G49340/AT1G53710/AT1G62430/... 131
BP GO:0036293 response to decreased oxygen levels 195/10443 354/25557 0.000 0.000 0.000 AT1G02360/AT1G03220/AT1G03610/AT1G04960/AT1G07150/AT1G07400/AT1G07870/AT1G09070/AT1G10140/AT1G13260/AT1G13300/AT1G14200/AT1G17147/AT1G17290/AT1G18300/AT1G18490/AT1G19020/AT1G20970/AT1G21550/AT1G22220/... 195
BP GO:0006913 nucleocytoplasmic transport 77/10443 117/25557 0.000 0.000 0.000 AT1G02690/AT1G05410/AT1G07140/AT1G09270/AT1G12930/AT1G13120/AT1G13160/AT1G14850/AT1G24310/AT1G24706/AT1G26170/AT1G27310/AT1G27970/AT1G33410/AT1G43700/AT1G45233/AT1G55540/AT1G63810/AT1G79280/AT1G80670/... 77
BP GO:0051169 nuclear transport 77/10443 117/25557 0.000 0.000 0.000 AT1G02690/AT1G05410/AT1G07140/AT1G09270/AT1G12930/AT1G13120/AT1G13160/AT1G14850/AT1G24310/AT1G24706/AT1G26170/AT1G27310/AT1G27970/AT1G33410/AT1G43700/AT1G45233/AT1G55540/AT1G63810/AT1G79280/AT1G80670/... 77
BP GO:0070085 glycosylation 103/10443 167/25557 0.000 0.000 0.000 AT1G05170/AT1G08660/AT1G12990/AT1G14080/AT1G14100/AT1G16570/AT1G16900/AT1G17270/AT1G20575/AT1G21480/AT1G27120/AT1G27440/AT1G30000/AT1G32210/AT1G34130/AT1G34270/AT1G48140/AT1G49710/AT1G51590/AT1G51630/... 103
BP GO:0009100 glycoprotein metabolic process 106/10443 173/25557 0.000 0.000 0.000 AT1G05170/AT1G08030/AT1G08660/AT1G12990/AT1G14080/AT1G14100/AT1G16570/AT1G16900/AT1G17270/AT1G20575/AT1G21480/AT1G27120/AT1G27440/AT1G30000/AT1G32210/AT1G34130/AT1G34270/AT1G48140/AT1G49710/AT1G51590/... 106
BP GO:0006399 tRNA metabolic process 109/10443 179/25557 0.000 0.000 0.000 AT1G01210/AT1G01760/AT1G03110/AT1G07910/AT1G08540/AT1G09290/AT1G09620/AT1G11870/AT1G13870/AT1G14610/AT1G17960/AT1G20410/AT1G22270/AT1G22660/AT1G25350/AT1G28350/AT1G29880/AT1G31600/AT1G36310/AT1G48520/... 109
BP GO:0019693 ribose phosphate metabolic process 130/10443 221/25557 0.000 0.000 0.000 AT1G01090/AT1G01710/AT1G09780/AT1G09830/AT1G10700/AT1G13440/AT1G15700/AT1G17410/AT1G20260/AT1G24180/AT1G28960/AT1G29900/AT1G30120/AT1G30680/AT1G30820/AT1G31160/AT1G31220/AT1G31910/AT1G32380/AT1G32440/... 130
BP GO:0009743 response to carbohydrate 118/10443 197/25557 0.000 0.000 0.000 AT1G01140/AT1G01790/AT1G05630/AT1G08830/AT1G10840/AT1G12000/AT1G14000/AT1G15440/AT1G16540/AT1G16610/AT1G18080/AT1G18570/AT1G19200/AT1G20950/AT1G21400/AT1G22160/AT1G22500/AT1G24100/AT1G24460/AT1G27130/... 118
BP GO:0009408 response to heat 218/10443 404/25557 0.000 0.000 0.000 AT1G03190/AT1G04000/AT1G04130/AT1G04570/AT1G05850/AT1G07400/AT1G07890/AT1G07980/AT1G10960/AT1G11270/AT1G11660/AT1G12060/AT1G13080/AT1G13440/AT1G13930/AT1G14590/AT1G14980/AT1G16030/AT1G16540/AT1G17100/... 218
BP GO:0006457 protein folding 111/10443 184/25557 0.000 0.000 0.000 AT1G04130/AT1G07400/AT1G08450/AT1G08780/AT1G09210/AT1G10350/AT1G11660/AT1G12060/AT1G14980/AT1G15020/AT1G16030/AT1G24510/AT1G28210/AT1G35620/AT1G52260/AT1G56260/AT1G68730/AT1G71440/AT1G72280/AT1G77510/... 111
BP GO:0006623 protein targeting to vacuole 33/10443 40/25557 0.000 0.000 0.000 AT1G08190/AT1G09070/AT1G24460/AT1G26670/AT1G48090/AT1G48760/AT1G56590/AT2G14720/AT2G14740/AT2G26890/AT2G28390/AT2G32760/AT2G34940/AT2G36680/AT2G37680/AT3G47700/AT3G50380/AT3G52850/AT3G53120/AT3G54300/... 33
BP GO:0009306 protein secretion 21/10443 22/25557 0.000 0.000 0.000 AT1G09330/AT1G51740/AT2G01470/AT2G25660/AT3G05710/AT3G09900/AT3G46060/AT3G52190/AT3G53610/AT4G02195/AT4G14870/AT4G24190/AT4G34450/AT5G03520/AT5G05760/AT5G07350/AT5G20165/AT5G50440/AT5G50550/AT5G59840/... 21
BP GO:0035592 establishment of protein localization to extracellular region 21/10443 22/25557 0.000 0.000 0.000 AT1G09330/AT1G51740/AT2G01470/AT2G25660/AT3G05710/AT3G09900/AT3G46060/AT3G52190/AT3G53610/AT4G02195/AT4G14870/AT4G24190/AT4G34450/AT5G03520/AT5G05760/AT5G07350/AT5G20165/AT5G50440/AT5G50550/AT5G59840/... 21
BP GO:0071692 protein localization to extracellular region 21/10443 22/25557 0.000 0.000 0.000 AT1G09330/AT1G51740/AT2G01470/AT2G25660/AT3G05710/AT3G09900/AT3G46060/AT3G52190/AT3G53610/AT4G02195/AT4G14870/AT4G24190/AT4G34450/AT5G03520/AT5G05760/AT5G07350/AT5G20165/AT5G50440/AT5G50550/AT5G59840/... 21
BP GO:0045087 innate immune response 159/10443 282/25557 0.000 0.000 0.000 AT1G01040/AT1G01440/AT1G03160/AT1G04300/AT1G05460/AT1G06135/AT1G08030/AT1G08450/AT1G09970/AT1G11280/AT1G11960/AT1G12220/AT1G14780/AT1G16260/AT1G17250/AT1G17980/AT1G18570/AT1G20900/AT1G27130/AT1G28380/... 159
BP GO:0006497 protein lipidation 74/10443 113/25557 0.000 0.000 0.000 AT1G07110/AT1G08750/AT1G11880/AT1G12210/AT1G12220/AT1G12290/AT1G12730/AT1G16560/AT1G20575/AT1G23490/AT1G48140/AT1G52320/AT1G53710/AT1G63110/AT1G63350/AT1G67800/AT1G70490/AT1G74340/AT2G14255/AT2G20140/... 74
BP GO:0042158 lipoprotein biosynthetic process 74/10443 113/25557 0.000 0.000 0.000 AT1G07110/AT1G08750/AT1G11880/AT1G12210/AT1G12220/AT1G12290/AT1G12730/AT1G16560/AT1G20575/AT1G23490/AT1G48140/AT1G52320/AT1G53710/AT1G63110/AT1G63350/AT1G67800/AT1G70490/AT1G74340/AT2G14255/AT2G20140/... 74
BP GO:0051172 negative regulation of nitrogen compound metabolic process 223/10443 417/25557 0.000 0.000 0.000 AT1G01040/AT1G02080/AT1G03760/AT1G03850/AT1G04550/AT1G05380/AT1G06450/AT1G06590/AT1G07705/AT1G08030/AT1G08370/AT1G08390/AT1G08460/AT1G09060/AT1G09570/AT1G09700/AT1G09810/AT1G10450/AT1G13260/AT1G14410/... 223
BP GO:0042157 lipoprotein metabolic process 75/10443 115/25557 0.000 0.000 0.000 AT1G07110/AT1G08750/AT1G11880/AT1G12210/AT1G12220/AT1G12290/AT1G12730/AT1G16560/AT1G20575/AT1G23490/AT1G48140/AT1G52320/AT1G53710/AT1G63110/AT1G63350/AT1G67800/AT1G70490/AT1G74340/AT2G14255/AT2G20140/... 75
BP GO:0006778 porphyrin-containing compound metabolic process 164/10443 293/25557 0.000 0.000 0.000 AT1G01500/AT1G02560/AT1G03475/AT1G04620/AT1G06690/AT1G07040/AT1G08520/AT1G09130/AT1G09940/AT1G13990/AT1G14345/AT1G18360/AT1G18460/AT1G20650/AT1G22850/AT1G23040/AT1G24881/AT1G26930/AT1G27210/AT1G27320/... 164
BP GO:0009101 glycoprotein biosynthetic process 99/10443 162/25557 0.000 0.000 0.000 AT1G05170/AT1G08030/AT1G08660/AT1G12990/AT1G14080/AT1G14100/AT1G16570/AT1G16900/AT1G17270/AT1G20575/AT1G21480/AT1G27120/AT1G27440/AT1G30000/AT1G32210/AT1G34130/AT1G34270/AT1G48140/AT1G49710/AT1G51590/... 99
BP GO:0006417 regulation of translation 81/10443 127/25557 0.000 0.000 0.000 AT1G01040/AT1G02080/AT1G06450/AT1G07705/AT1G08370/AT1G09340/AT1G09570/AT1G09700/AT1G09810/AT1G10840/AT1G13950/AT1G15920/AT1G18080/AT1G20340/AT1G26110/AT1G27960/AT1G48630/AT1G55500/AT1G64790/AT1G67620/... 81
BP GO:0043543 protein acylation 82/10443 129/25557 0.000 0.000 0.000 AT1G02740/AT1G07110/AT1G12210/AT1G12220/AT1G12290/AT1G16710/AT1G18335/AT1G18450/AT1G23490/AT1G24040/AT1G26470/AT1G32070/AT1G52320/AT1G54140/AT1G55970/AT1G63350/AT1G67800/AT1G70490/AT1G71730/AT1G79000/... 82
BP GO:0009640 photomorphogenesis 74/10443 114/25557 0.000 0.000 0.000 AT1G02090/AT1G02330/AT1G06040/AT1G09530/AT1G09570/AT1G14290/AT1G20900/AT1G22920/AT1G54830/AT1G61620/AT1G69640/AT1G69935/AT1G71230/AT1G75180/AT1G75540/AT1G76500/AT1G78600/AT1G79810/AT2G04030/AT2G18790/... 74
BP GO:0051246 regulation of protein metabolic process 219/10443 411/25557 0.000 0.000 0.000 AT1G01040/AT1G01360/AT1G02080/AT1G04810/AT1G05840/AT1G05890/AT1G06060/AT1G06450/AT1G07705/AT1G08370/AT1G09060/AT1G09340/AT1G09570/AT1G09700/AT1G09810/AT1G10840/AT1G13950/AT1G14750/AT1G15800/AT1G15860/... 219
BP GO:0008654 phospholipid biosynthetic process 92/10443 149/25557 0.000 0.000 0.000 AT1G08750/AT1G11880/AT1G12640/AT1G12730/AT1G13560/AT1G15110/AT1G16560/AT1G20575/AT1G21980/AT1G22620/AT1G31910/AT1G32200/AT1G48140/AT1G49340/AT1G53710/AT1G62430/AT1G63050/AT1G63110/AT1G63970/AT1G68000/... 92
BP GO:0006486 protein glycosylation 98/10443 161/25557 0.000 0.000 0.000 AT1G05170/AT1G08660/AT1G12990/AT1G14080/AT1G14100/AT1G16570/AT1G16900/AT1G17270/AT1G20575/AT1G21480/AT1G27120/AT1G27440/AT1G30000/AT1G32210/AT1G34130/AT1G34270/AT1G48140/AT1G49710/AT1G51590/AT1G53290/... 98
BP GO:0043413 macromolecule glycosylation 98/10443 161/25557 0.000 0.000 0.000 AT1G05170/AT1G08660/AT1G12990/AT1G14080/AT1G14100/AT1G16570/AT1G16900/AT1G17270/AT1G20575/AT1G21480/AT1G27120/AT1G27440/AT1G30000/AT1G32210/AT1G34130/AT1G34270/AT1G48140/AT1G49710/AT1G51590/AT1G53290/... 98
BP GO:0001666 response to hypoxia 187/10443 344/25557 0.000 0.000 0.000 AT1G02360/AT1G03220/AT1G03610/AT1G04960/AT1G07150/AT1G07400/AT1G07870/AT1G09070/AT1G10140/AT1G13260/AT1G13300/AT1G14200/AT1G17147/AT1G17290/AT1G18300/AT1G18490/AT1G19020/AT1G20970/AT1G21550/AT1G22220/... 187
BP GO:0032940 secretion by cell 57/10443 83/25557 0.000 0.000 0.000 AT1G02010/AT1G07000/AT1G09330/AT1G11250/AT1G12360/AT1G12470/AT1G28490/AT1G47550/AT1G47560/AT1G51740/AT1G54090/AT1G71820/AT1G72470/AT1G79070/AT2G01470/AT2G05170/AT2G25660/AT2G28650/AT2G44610/AT3G05710/... 57
BP GO:0016311 dephosphorylation 132/10443 230/25557 0.000 0.000 0.000 AT1G01360/AT1G03590/AT1G03760/AT1G03960/AT1G05000/AT1G05630/AT1G07160/AT1G07430/AT1G07630/AT1G09160/AT1G10430/AT1G13320/AT1G17340/AT1G17550/AT1G17710/AT1G17720/AT1G18030/AT1G18640/AT1G22620/AT1G30470/... 132
BP GO:0009259 ribonucleotide metabolic process 123/10443 212/25557 0.000 0.000 0.000 AT1G01090/AT1G01710/AT1G09780/AT1G09830/AT1G13440/AT1G15700/AT1G17410/AT1G20260/AT1G24180/AT1G28960/AT1G29900/AT1G30120/AT1G30680/AT1G30820/AT1G31160/AT1G31220/AT1G31910/AT1G32440/AT1G34430/AT1G36180/... 123
BP GO:0007623 circadian rhythm 101/10443 168/25557 0.000 0.000 0.000 AT1G01060/AT1G09340/AT1G10470/AT1G12910/AT1G15950/AT1G18330/AT1G19860/AT1G22770/AT1G32070/AT1G35460/AT1G59940/AT1G68050/AT1G68830/AT1G72390/AT1G73480/AT1G77180/AT1G80820/AT2G17840/AT2G18790/AT2G18915/... 101
BP GO:0048511 rhythmic process 101/10443 168/25557 0.000 0.000 0.000 AT1G01060/AT1G09340/AT1G10470/AT1G12910/AT1G15950/AT1G18330/AT1G19860/AT1G22770/AT1G32070/AT1G35460/AT1G59940/AT1G68050/AT1G68830/AT1G72390/AT1G73480/AT1G77180/AT1G80820/AT2G17840/AT2G18790/AT2G18915/... 101
BP GO:1901605 alpha-amino acid metabolic process 224/10443 424/25557 0.000 0.000 0.000 AT1G03090/AT1G06550/AT1G06570/AT1G06620/AT1G07750/AT1G08200/AT1G08250/AT1G08490/AT1G11790/AT1G12050/AT1G14810/AT1G15710/AT1G17290/AT1G17330/AT1G17745/AT1G18280/AT1G18500/AT1G18640/AT1G20490/AT1G22020/... 224
BP GO:0006974 cellular response to DNA damage stimulus 208/10443 390/25557 0.000 0.000 0.000 AT1G03190/AT1G05055/AT1G05120/AT1G05180/AT1G05900/AT1G07130/AT1G07500/AT1G07660/AT1G07745/AT1G08130/AT1G08390/AT1G08840/AT1G09815/AT1G10930/AT1G11060/AT1G11100/AT1G11800/AT1G12370/AT1G12400/AT1G13220/... 208
BP GO:0071215 cellular response to abscisic acid stimulus 183/10443 337/25557 0.000 0.000 0.000 AT1G01360/AT1G02310/AT1G04120/AT1G05100/AT1G07430/AT1G08720/AT1G09870/AT1G09950/AT1G10430/AT1G10560/AT1G10930/AT1G11760/AT1G13300/AT1G15100/AT1G18080/AT1G18460/AT1G18720/AT1G19380/AT1G20110/AT1G23140/... 183
BP GO:0097306 cellular response to alcohol 183/10443 337/25557 0.000 0.000 0.000 AT1G01360/AT1G02310/AT1G04120/AT1G05100/AT1G07430/AT1G08720/AT1G09870/AT1G09950/AT1G10430/AT1G10560/AT1G10930/AT1G11760/AT1G13300/AT1G15100/AT1G18080/AT1G18460/AT1G18720/AT1G19380/AT1G20110/AT1G23140/... 183
BP GO:0009141 nucleoside triphosphate metabolic process 69/10443 106/25557 0.000 0.000 0.000 AT1G09780/AT1G13440/AT1G15700/AT1G17410/AT1G20260/AT1G30820/AT1G32440/AT1G50460/AT1G51650/AT1G55810/AT1G74030/AT1G78050/AT1G79470/AT1G79550/AT2G01140/AT2G19680/AT2G21170/AT2G21410/AT2G21790/AT2G22480/... 69

表7.1 GO富集分析部分结果:
ONTOLOGY:GO方面,细胞成分,生物过程或分子功能之一;
ID:GO标识符,GO ID;
Description:GO术语的文字描述;
GeneRatio:该条目基因比例,分子是富集到这个GO条目上的基因的数目,分母是所有peak关联基因的数目;
BgRatio:背景比例,分母是物种全部有GO注释的基因的数目,分子是这些基因中注释到这个GO条目上面的基因的数目;
RichFactor​​:富集因子(Enrichment Factor)= GeneRatio / BgRatio;
​​FoldEnrichment​:富集倍数(Fold Enrichment)= (富集通路基因数 / 输入基因数) / (背景通路基因数 / 背景总基因数);
​​zScore​:标准化富集得分(基于超几何分布的 Z 值);
pvalue:富集的p值;
p.adjust:使用BH校正之后的p值;
qvalue:q值,使用FDR校正之后的p值,q-value相比于p-value更加严格,表示p-value产生假阳性的概率;
geneID:富集到这个GO条目上面的具体的基因ID;
Count:富集到这个GO条目上面的基因的数目。




图7.2 Peak关联基因GO气泡图。纵坐标是GO Term 名称,横坐标是对应GO Term 中检出的基因占背景基因的个数,颜色代表显著性,气泡大小代表该条目基因比例。



图7.3 Peak关联基因GO条状图。按照BP、MF、CC三个方面分别展示GO富集结果。纵坐标是GO Term 名称,横坐标值越大显著性越高,如果为0代表qvalue等于1。



7.2 KEGG富集分析

KEGG (Kyoto Encyclopedia of Genes and Genomes, http://www.genome.jp/kegg/) 是日本京都大学构建的基因组信息数据库,它将基因组序列信息与功能信息相结合,提供了一个全面的基因组功能信息资源。在PATHWAY数据库里,包括图解的细胞生化过程如代谢、膜转运、信号传递、细胞周期,还包括同系保守的子通路等信息。KEGG富集分析可以对peak关联基因进行KEGG通路富集分析。

下面展示peak关联的基因富集KEGG富集分析部分结果,完整结果请见/result/6.gokegg/GOALLterm_peakanno_*.csv。GO富集分析完整结果请详见位于report/result/6.gokegg文件夹的*_KEGG_res.csv表格文件。

显示前100行 (共99行)
ID Description GeneRatio BgRatio pvalue p.adjust qvalue geneID Count
00510 N-Glycan biosynthesis 34/1836 40/3449 0.000 0.002 0.002 AT1G12990/AT1G16570/AT1G16900/AT1G20575/AT1G30000/AT1G32210/AT1G34130/AT1G48140/AT1G51590/AT1G67490/AT1G67880/AT1G74340/AT1G76400/AT1G78800/AT2G05320/AT2G39630/AT2G40190/AT2G41490/AT2G44660/AT2G47760/... 34
03040 Spliceosome 80/1836 115/3449 0.000 0.010 0.010 AT1G02140/AT1G02840/AT1G03140/AT1G03330/AT1G04510/AT1G06220/AT1G07170/AT1G07360/AT1G09760/AT1G10580/AT1G14650/AT1G16030/AT1G20920/AT1G20960/AT1G21190/AT1G24706/AT1G28060/AT1G32490/AT1G44910/AT1G51510/... 80
00970 Aminoacyl-tRNA biosynthesis 35/1836 48/3449 0.004 0.118 0.109 AT1G09620/AT1G11870/AT1G14610/AT1G17960/AT1G25350/AT1G29880/AT1G48520/AT1G66530/AT1G70980/AT1G72550/AT2G31170/AT3G02660/AT3G02760/AT3G04600/AT3G11710/AT3G13490/AT3G46100/AT3G48110/AT3G55400/AT3G58140/... 35
03050 Proteasome 41/1836 58/3449 0.005 0.118 0.109 AT1G04810/AT1G13060/AT1G20200/AT1G21720/AT1G29150/AT1G47250/AT1G53750/AT1G53780/AT1G67250/AT1G75990/AT1G79210/AT2G05840/AT2G20140/AT2G20580/AT2G32730/AT3G05530/AT3G11270/AT3G13330/AT3G22110/AT3G26340/... 41
03013 Nucleocytoplasmic transport 78/1836 121/3449 0.007 0.132 0.122 AT1G02140/AT1G04170/AT1G07920/AT1G10840/AT1G14850/AT1G15200/AT1G15470/AT1G16610/AT1G21160/AT1G24706/AT1G28090/AT1G29590/AT1G33410/AT1G45231/AT1G45233/AT1G49760/AT1G51510/AT1G52160/AT1G53880/AT1G54270/... 78
00450 Selenocompound metabolism 15/1836 18/3449 0.008 0.132 0.122 AT1G08490/AT1G19920/AT2G17420/AT2G41680/AT3G01120/AT3G03780/AT3G22890/AT3G55400/AT3G57050/AT4G13780/AT4G14680/AT4G35460/AT5G17920/AT5G43780/AT5G49810 15
00190 Oxidative phosphorylation 81/1836 127/3449 0.009 0.132 0.122 AT1G01050/AT1G02410/AT1G04630/AT1G15690/AT1G15700/AT1G16700/AT1G16780/AT1G19910/AT1G20260/AT1G22450/AT1G49140/AT1G51650/AT1G53030/AT1G64200/AT1G65290/AT1G75630/AT1G79010/AT1G80230/AT2G02050/AT2G16510/... 81
03420 Nucleotide excision repair 40/1836 59/3449 0.016 0.169 0.156 AT1G03190/AT1G05055/AT1G08130/AT1G09815/AT1G10590/AT1G12400/AT1G16190/AT1G18340/AT1G21690/AT1G27840/AT1G50840/AT1G55750/AT1G73690/AT1G77470/AT1G78650/AT1G79650/AT2G29570/AT2G42120/AT3G02540/AT3G02920/... 40
00010 Glycolysis / Gluconeogenesis 67/1836 105/3449 0.017 0.169 0.156 AT1G01090/AT1G09780/AT1G13440/AT1G16300/AT1G22430/AT1G22440/AT1G23190/AT1G24180/AT1G30120/AT1G32440/AT1G34430/AT1G48030/AT1G50460/AT1G54100/AT1G59900/AT1G70730/AT1G74030/AT1G79530/AT1G79550/AT2G01140/... 67
03008 Ribosome biogenesis in eukaryotes 50/1836 76/3449 0.017 0.169 0.156 AT1G06720/AT1G10490/AT1G15440/AT1G27470/AT1G50920/AT1G56110/AT1G63780/AT1G63810/AT1G72710/AT2G03820/AT2G19470/AT2G23070/AT2G24990/AT2G27200/AT2G46230/AT2G47300/AT2G47990/AT3G01610/AT3G03110/AT3G03920/... 50
00563 Glycosylphosphatidylinositol (GPI)-anchor biosynthesis 11/1836 13/3449 0.020 0.179 0.165 AT1G11880/AT1G63110/AT1G74340/AT2G22530/AT2G34980/AT3G07140/AT3G45100/AT5G14850/AT5G19130/AT5G22130/AT5G46850 11
03060 Protein export 31/1836 46/3449 0.036 0.294 0.272 AT1G06870/AT1G15310/AT1G23465/AT1G24490/AT1G29960/AT1G48160/AT1G48900/AT1G53530/AT1G67650/AT1G67680/AT1G79940/AT2G01110/AT2G28800/AT2G30440/AT2G34250/AT2G39960/AT2G43640/AT2G45770/AT2G46470/AT3G15710/... 31
04146 Peroxisome 39/1836 61/3449 0.058 0.416 0.385 AT1G03000/AT1G04710/AT1G06290/AT1G06310/AT1G08830/AT1G20620/AT1G54340/AT1G77590/AT1G79810/AT2G04350/AT2G14860/AT2G26350/AT2G28190/AT2G33150/AT2G35690/AT2G39970/AT2G45690/AT3G04460/AT3G05970/AT3G12800/... 39
00230 Purine metabolism 80/1836 133/3449 0.061 0.416 0.385 AT1G01210/AT1G09815/AT1G09830/AT1G14230/AT1G17410/AT1G19920/AT1G23190/AT1G29940/AT1G31220/AT1G32380/AT1G32440/AT1G50840/AT1G54250/AT1G67550/AT1G70730/AT1G71750/AT1G72880/AT1G74260/AT1G78650/AT1G79470/... 80
04712 Circadian rhythm - plant 20/1836 29/3449 0.063 0.416 0.385 AT1G01060/AT1G09530/AT1G09570/AT1G22770/AT1G68050/AT2G18790/AT2G18915/AT2G23070/AT2G25930/AT2G46790/AT2G46830/AT3G04910/AT3G60250/AT4G08920/AT4G16250/AT4G17640/AT5G24470/AT5G57360/AT5G61380/AT5G67380 20
00030 Pentose phosphate pathway 34/1836 53/3449 0.070 0.423 0.391 AT1G09420/AT1G13700/AT1G17160/AT1G23190/AT1G24280/AT1G32380/AT1G63290/AT1G64190/AT1G70730/AT1G71100/AT2G01140/AT2G01290/AT2G22480/AT2G35390/AT2G36460/AT2G44530/AT2G45290/AT3G01850/AT3G52930/AT3G54050/... 34
00020 Citrate cycle (TCA cycle) 38/1836 60/3449 0.073 0.423 0.391 AT1G01090/AT1G04410/AT1G24180/AT1G30120/AT1G34430/AT1G48030/AT1G53240/AT1G54340/AT1G59900/AT2G05710/AT2G17130/AT2G20420/AT2G22780/AT2G34590/AT2G42790/AT2G47510/AT3G13930/AT3G15020/AT3G17240/AT3G27380/... 38
03022 Basal transcription factors 23/1836 35/3449 0.093 0.493 0.456 AT1G02680/AT1G04950/AT1G05055/AT1G07470/AT1G18340/AT1G54140/AT1G54360/AT1G55300/AT1G55750/AT1G75510/AT2G41630/AT3G10070/AT3G10330/AT3G13445/AT3G61420/AT4G12610/AT4G17020/AT4G20280/AT4G20330/AT4G24440/... 23
03015 mRNA surveillance pathway 50/1836 82/3449 0.095 0.493 0.456 AT1G02140/AT1G03960/AT1G10430/AT1G12920/AT1G13320/AT1G15200/AT1G16610/AT1G17720/AT1G17760/AT1G17980/AT1G27595/AT1G49760/AT1G51510/AT1G51690/AT1G59830/AT1G61010/AT2G25850/AT2G33410/AT2G39260/AT2G42500/... 50
03018 RNA degradation 35/1836 56/3449 0.102 0.506 0.468 AT1G02080/AT1G03330/AT1G07705/AT1G21190/AT1G49760/AT1G55870/AT1G59760/AT1G74030/AT1G76630/AT1G76860/AT1G79090/AT1G80780/AT2G03870/AT2G17510/AT2G29560/AT2G33210/AT2G35920/AT2G36530/AT3G03710/AT3G13300/... 35
00240 Pyrimidine metabolism 58/1836 97/3449 0.113 0.532 0.492 AT1G01210/AT1G09815/AT1G14230/AT1G17410/AT1G29900/AT1G29940/AT1G30820/AT1G50840/AT1G54250/AT1G55810/AT1G72880/AT1G78650/AT2G02970/AT2G15430/AT2G16370/AT2G17420/AT2G21790/AT2G41680/AT2G42120/AT3G03710/... 58
00920 Sulfur metabolism 22/1836 34/3449 0.120 0.538 0.498 AT1G19920/AT1G55880/AT1G55920/AT2G17640/AT2G43750/AT3G01120/AT3G01910/AT3G13110/AT3G22460/AT3G22890/AT3G57050/AT3G59760/AT3G61440/AT4G14680/AT4G39940/AT5G04590/AT5G09290/AT5G43780/AT5G54390/AT5G56760/... 22
00290 Valine, leucine and isoleucine biosynthesis 23/1836 36/3449 0.131 0.559 0.517 AT1G01090/AT1G09620/AT1G14610/AT1G18500/AT1G24180/AT1G30120/AT1G31180/AT1G59900/AT2G31810/AT2G34590/AT2G43090/AT3G10050/AT3G48560/AT3G49680/AT3G58990/AT4G10320/AT4G13430/AT5G14200/AT5G16715/AT5G49030/... 23
04122 Sulfur relay system 10/1836 14/3449 0.135 0.559 0.517 AT1G01290/AT1G16460/AT1G51310/AT1G76170/AT1G79230/AT2G31955/AT2G44270/AT4G35910/AT5G55130/AT5G65720 10
03430 Mismatch repair 21/1836 33/3449 0.152 0.582 0.538 AT1G08130/AT1G09815/AT1G10590/AT1G21690/AT1G65070/AT1G77470/AT1G78650/AT2G29570/AT2G42120/AT3G02920/AT3G18580/AT4G02070/AT4G02460/AT4G09140/AT4G19130/AT4G25540/AT4G28440/AT5G22010/AT5G27740/AT5G45400/... 21
00260 Glycine, serine and threonine metabolism 29/1836 47/3449 0.153 0.582 0.538 AT1G14810/AT1G17745/AT1G18640/AT1G22020/AT1G31230/AT1G36370/AT1G48030/AT1G54100/AT1G74920/AT2G17265/AT2G17630/AT2G26080/AT2G38400/AT2G42490/AT3G10050/AT3G17240/AT3G19480/AT3G48170/AT3G54640/AT4G13930/... 29
04144 Endocytosis 40/1836 67/3449 0.172 0.610 0.564 AT1G15130/AT1G16030/AT1G17730/AT1G21980/AT1G60860/AT1G60890/AT1G77740/AT2G14120/AT2G26420/AT2G27600/AT2G41210/AT2G42010/AT3G05630/AT3G07960/AT3G08530/AT3G09920/AT3G10640/AT3G11130/AT3G12400/AT3G12580/... 40
03020 RNA polymerase 19/1836 30/3449 0.176 0.610 0.564 AT1G01210/AT1G29940/AT1G54250/AT2G15430/AT3G13940/AT3G16980/AT3G22320/AT3G25940/AT3G49000/AT3G52090/AT3G57660/AT4G01590/AT4G16265/AT4G21710/AT4G25180/AT4G35800/AT5G23710/AT5G51940/AT5G59180 19
00670 One carbon pool by folate 12/1836 18/3449 0.182 0.610 0.564 AT1G22020/AT1G31220/AT1G36370/AT1G76730/AT2G16370/AT2G35040/AT2G44160/AT3G59970/AT4G13930/AT4G32520/AT5G26780/AT5G47435 12
04145 Phagosome 37/1836 62/3449 0.185 0.610 0.564 AT1G09210/AT1G19910/AT1G20260/AT1G51740/AT1G64200/AT1G75630/AT1G75780/AT2G16510/AT2G21410/AT2G25610/AT2G28520/AT2G34250/AT3G01390/AT3G28710/AT3G28715/AT3G42050/AT3G58730/AT4G02620/AT4G14960/AT4G17730/... 37
04130 SNARE interactions in vesicular transport 29/1836 48/3449 0.196 0.612 0.566 AT1G11250/AT1G15880/AT1G16240/AT1G26670/AT1G29060/AT1G51740/AT1G79590/AT2G36900/AT2G45200/AT3G05710/AT3G09740/AT3G11820/AT3G24315/AT3G24350/AT3G58170/AT4G02195/AT4G03330/AT4G14455/AT4G14600/AT4G17730/... 29
00620 Pyruvate metabolism 44/1836 75/3449 0.202 0.612 0.566 AT1G01090/AT1G04410/AT1G08110/AT1G11840/AT1G18500/AT1G24180/AT1G30120/AT1G32440/AT1G34430/AT1G48030/AT1G53240/AT1G53310/AT1G54100/AT1G59900/AT1G67280/AT1G79750/AT2G22780/AT2G34590/AT2G36580/AT2G42600/... 44
03440 Homologous recombination 21/1836 34/3449 0.204 0.612 0.566 AT1G09815/AT1G10590/AT1G10930/AT1G50840/AT1G78650/AT2G01440/AT2G31970/AT2G32000/AT2G42120/AT3G02920/AT3G18580/AT4G19130/AT4G28440/AT4G30870/AT5G20850/AT5G45010/AT5G45400/AT5G54260/AT5G57450/AT5G63960/... 21
00520 Amino sugar and nucleotide sugar metabolism 57/1836 99/3449 0.219 0.612 0.566 AT1G01220/AT1G06780/AT1G08200/AT1G17890/AT1G18580/AT1G23190/AT1G50460/AT1G53500/AT1G63000/AT1G64440/AT1G65590/AT1G67070/AT1G70730/AT1G73250/AT2G20810/AT2G27860/AT2G30575/AT2G31390/AT2G35020/AT2G38650/... 57
00100 Steroid biosynthesis 18/1836 29/3449 0.221 0.612 0.566 AT1G07420/AT1G11680/AT1G20330/AT1G50430/AT1G58440/AT1G76090/AT2G07050/AT2G22830/AT2G29390/AT2G34500/AT3G02580/AT3G19820/AT3G52940/AT4G12110/AT4G22756/AT4G34640/AT4G37760/AT5G24140 18
00650 Butanoate metabolism 14/1836 22/3449 0.223 0.612 0.566 AT1G01090/AT1G24180/AT1G30120/AT1G59900/AT1G65960/AT1G79440/AT2G31810/AT2G34590/AT3G15290/AT3G48560/AT4G16210/AT5G17330/AT5G48230/AT5G50850 14
00860 Porphyrin metabolism 24/1836 40/3449 0.242 0.647 0.598 AT1G03475/AT1G08520/AT1G09940/AT1G58290/AT1G69740/AT1G74470/AT2G40300/AT2G44520/AT3G48730/AT3G51820/AT3G56090/AT4G01690/AT4G18480/AT4G25080/AT4G37000/AT5G01600/AT5G04900/AT5G08280/AT5G13630/AT5G26030/... 24
04140 Autophagy - animal 8/1836 12/3449 0.262 0.668 0.618 AT2G37840/AT2G44140/AT3G13970/AT3G53930/AT4G16520/AT4G29380/AT5G17290/AT5G61500 8
00561 Glycerolipid metabolism 21/1836 35/3449 0.263 0.668 0.618 AT1G32200/AT1G54100/AT1G75020/AT1G80460/AT2G19450/AT3G11670/AT3G48000/AT3G56310/AT3G57650/AT4G00550/AT4G01950/AT4G30580/AT4G31780/AT4G33030/AT5G01220/AT5G06090/AT5G07920/AT5G08380/AT5G20410/AT5G60620/... 21
03010 Ribosome 122/1836 220/3449 0.270 0.669 0.619 AT1G02780/AT1G07070/AT1G09590/AT1G09690/AT1G14320/AT1G18540/AT1G22780/AT1G23290/AT1G23410/AT1G35680/AT1G48350/AT1G52300/AT1G56045/AT1G58380/AT1G58684/AT1G58983/AT1G59359/AT1G70600/AT1G71720/AT1G74050/... 122
03410 Base excision repair 23/1836 39/3449 0.288 0.682 0.631 AT1G05900/AT1G08130/AT1G09815/AT1G15970/AT1G19480/AT1G21710/AT1G50840/AT1G75230/AT1G78650/AT1G80420/AT2G29570/AT2G31450/AT2G42120/AT3G12040/AT3G50880/AT3G51880/AT4G02390/AT4G12740/AT4G36050/AT5G13920/... 23
00600 Sphingolipid metabolism 9/1836 14/3449 0.289 0.682 0.631 AT1G72990/AT3G06060/AT3G48780/AT3G54440/AT3G56310/AT4G04930/AT4G36480/AT5G08380/AT5G23670 9
00051 Fructose and mannose metabolism 29/1836 50/3449 0.296 0.682 0.631 AT1G01220/AT1G07110/AT1G12000/AT1G17890/AT1G20950/AT1G50460/AT1G67070/AT1G73250/AT1G76550/AT2G01140/AT2G21170/AT2G22480/AT2G31390/AT2G36460/AT2G39770/AT2G45790/AT3G02570/AT3G20040/AT3G51160/AT3G52930/... 29
00564 Glycerophospholipid metabolism 31/1836 54/3449 0.316 0.711 0.657 AT1G07230/AT1G13560/AT1G15110/AT1G32200/AT1G62430/AT1G68000/AT1G75020/AT1G80950/AT2G32260/AT2G38670/AT2G42010/AT2G45670/AT3G03520/AT3G03530/AT3G05630/AT3G10370/AT3G15730/AT3G16785/AT3G18000/AT3G55030/... 31
00565 Ether lipid metabolism 11/1836 18/3449 0.334 0.725 0.671 AT1G07230/AT1G13560/AT1G80950/AT2G42010/AT2G45670/AT3G03520/AT3G03530/AT3G05630/AT3G15730/AT3G16785/AT4G11850 11
04141 Protein processing in endoplasmic reticulum 73/1836 132/3449 0.346 0.725 0.671 AT1G07400/AT1G09210/AT1G10230/AT1G16030/AT1G16190/AT1G17280/AT1G20140/AT1G30000/AT1G32210/AT1G34130/AT1G51590/AT1G65040/AT1G67490/AT1G75950/AT1G76400/AT1G77510/AT1G79650/AT1G79940/AT2G01470/AT2G01650/... 73
00511 Other glycan degradation 7/1836 11/3449 0.351 0.725 0.671 AT1G65590/AT1G72990/AT3G11040/AT3G26720/AT3G54440/AT3G55260/AT5G05460 7
00760 Nicotinate and nicotinamide metabolism 7/1836 11/3449 0.351 0.725 0.671 AT1G72880/AT2G01350/AT2G23420/AT3G21070/AT4G14930/AT5G20070/AT5G50210 7
00130 Ubiquinone and other terpenoid-quinone biosynthesis 14/1836 24/3449 0.385 0.762 0.705 AT1G06570/AT1G23360/AT1G51680/AT2G30920/AT3G11945/AT3G11950/AT3G21240/AT3G24200/AT3G63410/AT4G05160/AT4G23660/AT4G32770/AT5G53970/AT5G57300 14
00906 Carotenoid biosynthesis 14/1836 24/3449 0.385 0.762 0.705 AT1G06820/AT1G30100/AT1G52340/AT1G78390/AT2G27150/AT3G14440/AT3G53130/AT4G14210/AT4G18350/AT4G19170/AT4G19230/AT5G17230/AT5G57030/AT5G67030 14
00340 Histidine metabolism 11/1836 19/3449 0.432 0.838 0.775 AT1G09795/AT1G43710/AT1G54100/AT1G58080/AT2G36230/AT3G21300/AT3G27180/AT3G48000/AT4G14910/AT4G26900/AT5G63890 11
00590 Arachidonic acid metabolism 7/1836 12/3449 0.477 0.901 0.833 AT1G63460/AT2G25080/AT3G63080/AT4G11600/AT4G29210/AT4G39640/AT5G13520 7
00592 alpha-Linolenic acid metabolism 15/1836 27/3449 0.482 0.901 0.833 AT1G06290/AT1G06310/AT1G13280/AT1G20510/AT1G55020/AT1G67560/AT1G76680/AT2G06050/AT2G35690/AT3G06860/AT3G25780/AT4G16760/AT4G29010/AT5G42650/AT5G65110 15
04070 Phosphatidylinositol signaling system 26/1836 48/3449 0.507 0.918 0.849 AT1G21980/AT1G49340/AT1G60890/AT1G62430/AT1G68000/AT1G77740/AT2G26420/AT2G41210/AT3G02870/AT3G07960/AT3G08510/AT3G09920/AT3G19420/AT3G50110/AT3G56800/AT4G08170/AT4G33770/AT4G38570/AT5G07920/AT5G09290/... 26
00966 Glucosinolate biosynthesis 10/1836 18/3449 0.517 0.918 0.849 AT1G18590/AT1G24100/AT1G74090/AT1G74100/AT2G20610/AT2G22330/AT3G49680/AT4G31500/AT4G39950/AT5G65780 10
04120 Ubiquitin mediated proteolysis 56/1836 105/3449 0.532 0.918 0.849 AT1G04510/AT1G06590/AT1G10230/AT1G14400/AT1G17280/AT1G20140/AT1G26830/AT1G27840/AT1G63800/AT1G65040/AT1G70320/AT1G75950/AT1G78770/AT2G16920/AT2G18290/AT2G20000/AT2G21470/AT2G30110/AT2G32950/AT2G33340/... 56
00052 Galactose metabolism 21/1836 39/3449 0.535 0.918 0.849 AT1G12240/AT1G23190/AT1G50460/AT1G64440/AT1G70730/AT1G72990/AT2G22480/AT3G03250/AT3G10700/AT3G13790/AT3G20040/AT3G54440/AT3G56310/AT4G23920/AT4G26270/AT4G29220/AT5G08380/AT5G11720/AT5G47810/AT5G56630/... 21
00770 Pantothenate and CoA biosynthesis 12/1836 22/3449 0.538 0.918 0.849 AT2G31810/AT2G46110/AT3G17810/AT3G18030/AT3G48560/AT3G49680/AT3G61530/AT4G32180/AT5G12200/AT5G48840/AT5G57850/AT5G65780 12
00350 Tyrosine metabolism 14/1836 26/3449 0.555 0.927 0.858 AT1G06570/AT1G12050/AT1G22430/AT1G22440/AT1G79440/AT2G30970/AT2G42490/AT3G21300/AT3G27180/AT4G31990/AT5G11520/AT5G24760/AT5G43940/AT5G53970 14
00250 Alanine, aspartate and glutamate metabolism 25/1836 47/3449 0.562 0.927 0.858 AT1G17290/AT1G23310/AT1G29900/AT1G51720/AT1G65960/AT1G79440/AT2G16570/AT2G30970/AT2G38400/AT3G20330/AT3G24090/AT3G57610/AT4G31990/AT4G34740/AT4G39660/AT5G07440/AT5G10240/AT5G11520/AT5G12040/AT5G16570/... 25
01040 Biosynthesis of unsaturated fatty acids 17/1836 32/3449 0.577 0.936 0.866 AT1G01710/AT1G04710/AT1G06290/AT1G06310/AT2G33150/AT2G35690/AT2G43710/AT3G02610/AT3G02630/AT3G03980/AT3G11170/AT3G12120/AT3G15850/AT3G55360/AT4G13180/AT4G16760/AT5G65110 17
00790 Folate biosynthesis 8/1836 15/3449 0.601 0.959 0.887 AT1G78660/AT1G78670/AT1G78680/AT2G16370/AT3G07270/AT4G30000/AT5G05980/AT5G62980 8
00710 Carbon fixation by Calvin cycle 40/1836 77/3449 0.635 0.998 0.923 AT1G04410/AT1G17290/AT1G23310/AT1G32440/AT1G53240/AT1G53310/AT1G63290/AT1G71100/AT1G79550/AT1G79750/AT2G01140/AT2G01290/AT2G21170/AT2G22780/AT2G36460/AT2G36580/AT2G42600/AT2G45290/AT3G01850/AT3G04050/... 40
00480 Glutathione metabolism 32/1836 62/3449 0.651 0.999 0.924 AT1G07890/AT1G09420/AT1G24280/AT1G54340/AT1G63460/AT1G63770/AT1G64190/AT1G70310/AT2G21790/AT2G24200/AT2G25080/AT2G30860/AT3G24170/AT3G27060/AT3G43800/AT3G54660/AT3G63080/AT4G02520/AT4G08390/AT4G11600/... 32
00071 Fatty acid degradation 21/1836 41/3449 0.663 0.999 0.924 AT1G04710/AT1G06290/AT1G06310/AT1G22430/AT1G22440/AT1G54100/AT1G77590/AT2G04350/AT2G33150/AT2G35690/AT3G05970/AT3G06810/AT3G23790/AT3G48000/AT4G14070/AT4G16210/AT4G16760/AT5G24760/AT5G43940/AT5G48230/... 21
00400 Phenylalanine, tyrosine and tryptophan biosynthesis 22/1836 43/3449 0.666 0.999 0.924 AT1G25220/AT1G48850/AT1G48860/AT1G69370/AT2G04400/AT2G29690/AT2G30970/AT2G45300/AT3G06350/AT3G29200/AT3G54640/AT4G27070/AT4G31990/AT4G39980/AT5G05590/AT5G05730/AT5G10870/AT5G11520/AT5G17990/AT5G53970/... 22
04626 Plant-pathogen interaction 76/1836 148/3449 0.710 1.000 0.925 AT1G01340/AT1G12220/AT1G12310/AT1G18210/AT1G21550/AT1G32640/AT1G50700/AT1G51660/AT1G62820/AT1G64060/AT1G66400/AT1G72450/AT1G74740/AT1G74950/AT1G80460/AT2G23980/AT2G39940/AT2G41410/AT2G43790/AT2G46430/... 76
00562 Inositol phosphate metabolism 27/1836 54/3449 0.732 1.000 0.925 AT1G07230/AT1G14520/AT1G21980/AT1G49340/AT1G60890/AT1G68000/AT1G77740/AT2G21170/AT2G22240/AT2G26420/AT2G41210/AT3G02870/AT3G03520/AT3G03530/AT3G07960/AT3G08510/AT3G09920/AT3G19420/AT3G50110/AT4G08170/... 27
00300 Lysine biosynthesis 9/1836 19/3449 0.772 1.000 0.925 AT1G14810/AT1G31230/AT1G54100/AT2G44040/AT2G45440/AT3G14390/AT3G57560/AT3G60880/AT5G11880 9
00310 Lysine degradation 7/1836 15/3449 0.779 1.000 0.925 AT1G54100/AT3G48000/AT3G55410/AT4G16210/AT4G26910/AT5G48230/AT5G55070 7
00900 Terpenoid backbone biosynthesis 23/1836 48/3449 0.813 1.000 0.925 AT1G31910/AT1G63970/AT1G74470/AT1G76490/AT1G78510/AT2G17370/AT2G17570/AT2G18620/AT2G18640/AT2G34630/AT2G38700/AT3G02780/AT3G14510/AT3G32040/AT3G54250/AT4G17190/AT4G36810/AT4G38460/AT5G11380/AT5G16440/... 23
00410 beta-Alanine metabolism 14/1836 30/3449 0.818 1.000 0.925 AT1G06550/AT1G54100/AT1G65960/AT1G70310/AT2G42490/AT3G06810/AT3G17810/AT3G48000/AT4G16210/AT5G12200/AT5G17330/AT5G48840/AT5G53120/AT5G65940 14
00330 Arginine and proline metabolism 33/1836 68/3449 0.818 1.000 0.925 AT1G20270/AT1G44820/AT1G51720/AT1G54100/AT1G67550/AT1G70310/AT1G75330/AT1G80600/AT2G16500/AT2G22910/AT2G30970/AT3G02470/AT3G30775/AT3G47450/AT3G48000/AT3G57560/AT4G08900/AT4G17830/AT4G31990/AT4G33910/... 33
00061 Fatty acid biosynthesis 12/1836 27/3449 0.867 1.000 0.925 AT1G62640/AT1G74960/AT2G04540/AT2G30200/AT2G43710/AT3G02610/AT3G02630/AT3G03980/AT3G25110/AT4G13050/AT4G13180/AT5G10160 12
00941 Flavonoid biosynthesis 8/1836 19/3449 0.886 1.000 0.925 AT2G30490/AT2G40890/AT3G51240/AT4G33360/AT4G34050/AT5G07990/AT5G08640/AT5G48930 8
00380 Tryptophan metabolism 17/1836 38/3449 0.888 1.000 0.925 AT1G20620/AT1G24100/AT1G54100/AT1G74100/AT2G20610/AT2G22330/AT2G44490/AT3G04600/AT3G44310/AT3G48000/AT3G55410/AT4G16210/AT4G31500/AT4G35090/AT4G39950/AT5G20960/AT5G48230 17
00640 Propanoate metabolism 13/1836 31/3449 0.926 1.000 0.925 AT1G06550/AT1G54100/AT1G75280/AT2G20420/AT3G06810/AT3G48000/AT4G13660/AT4G16210/AT4G17260/AT5G08300/AT5G23250/AT5G48230/AT5G65940 13
00280 Valine, leucine and isoleucine degradation 20/1836 46/3449 0.931 1.000 0.925 AT1G03090/AT1G04710/AT1G06550/AT1G21400/AT1G48030/AT1G54100/AT1G55510/AT2G33150/AT3G06810/AT3G06850/AT3G17240/AT3G45300/AT3G48000/AT3G49680/AT4G16155/AT4G16210/AT5G48230/AT5G57850/AT5G65780/AT5G65940 20
00053 Ascorbate and aldarate metabolism 14/1836 34/3449 0.944 1.000 0.925 AT1G07890/AT1G14520/AT1G54100/AT3G02870/AT3G09940/AT3G47930/AT3G48000/AT3G52880/AT4G08390/AT4G32320/AT4G35000/AT5G03630/AT5G15490/AT5G55120 14
00630 Glyoxylate and dicarboxylate metabolism 14/1836 34/3449 0.944 1.000 0.925 AT1G04410/AT1G53240/AT2G05710/AT2G22780/AT2G42790/AT3G14150/AT3G15020/AT3G47520/AT3G58750/AT4G18360/AT4G26970/AT4G35830/AT5G47435/AT5G48230 14
03030 DNA replication 19/1836 45/3449 0.950 1.000 0.925 AT1G08130/AT1G08840/AT1G09815/AT1G10590/AT1G21690/AT1G50840/AT1G77470/AT1G78650/AT2G29570/AT2G42120/AT3G02920/AT3G18580/AT4G19130/AT4G28440/AT5G22010/AT5G22110/AT5G27740/AT5G45400/AT5G63960 19
00910 Nitrogen metabolism 18/1836 43/3449 0.951 1.000 0.925 AT1G37130/AT1G51720/AT1G77760/AT2G15620/AT2G37040/AT2G41220/AT3G23490/AT3G44310/AT3G53260/AT3G57050/AT4G33580/AT5G04230/AT5G07440/AT5G10240/AT5G16570/AT5G18170/AT5G37600/AT5G53460 18
00430 Taurine and hypotaurine metabolism 4/1836 12/3449 0.954 1.000 0.925 AT1G65960/AT4G29210/AT4G39640/AT5G17330 4
00270 Cysteine and methionine metabolism 36/1836 84/3449 0.979 1.000 0.925 AT1G02500/AT1G05010/AT1G14810/AT1G16460/AT1G31230/AT1G55880/AT1G55920/AT1G70310/AT1G77330/AT1G79230/AT2G17640/AT2G19590/AT2G30970/AT2G36880/AT2G43750/AT3G01120/AT3G02470/AT3G03780/AT3G13110/AT3G22460/... 36
00460 Cyanoamino acid metabolism 10/1836 28/3449 0.980 1.000 0.925 AT1G22020/AT1G36370/AT3G07990/AT3G44310/AT3G61440/AT4G13930/AT4G29210/AT4G32520/AT4G39640/AT5G26780 10
00908 Zeatin biosynthesis 7/1836 22/3449 0.988 1.000 0.925 AT1G68460/AT2G27760/AT4G29740/AT5G05860/AT5G05870/AT5G19040/AT5G20040 7
04710 Circadian rhythm 6/1836 24/3449 0.999 1.000 0.925 AT1G10230/AT1G20140/AT1G75950/AT3G15620/AT5G20570/AT5G42190 6
00960 Tropane, piperidine and pyridine alkaloid biosynthesis 5/1836 22/3449 0.999 1.000 0.925 AT2G42490/AT4G31990/AT5G06060/AT5G11520/AT5G53970 5
00945 Stilbenoid, diarylheptanoid and gingerol biosynthesis 23/1836 67/3449 0.999 1.000 0.925 AT1G01600/AT1G13090/AT2G30490/AT2G30750/AT2G40890/AT3G26210/AT3G26300/AT3G48360/AT3G56630/AT4G34050/AT4G37310/AT4G37330/AT4G37340/AT4G37400/AT4G37410/AT5G04630/AT5G04660/AT5G25120/AT5G36220/AT5G42590/... 23
04650 Natural killer cell mediated cytotoxicity 3/1836 17/3449 1.000 1.000 0.925 AT1G10210/AT4G01370/AT4G26070 3
00195 Photosynthesis 13/1836 46/3449 1.000 1.000 0.925 AT1G06680/AT1G10960/AT1G15700/AT1G20340/AT1G30510/AT1G32550/AT1G55670/AT1G76100/AT1G79040/AT2G27510/AT4G05390/AT4G28660/AT4G32260 13
00196 Photosynthesis - antenna proteins 4/1836 22/3449 1.000 1.000 0.925 AT2G40100/AT3G47470/AT3G54890/AT5G01530 4
00904 Diterpenoid biosynthesis 1/1836 12/3449 1.000 1.000 0.925 AT1G80340 1
00903 Limonene degradation 22/1836 70/3449 1.000 1.000 0.925 AT1G01600/AT1G13090/AT1G18270/AT2G30750/AT3G26210/AT3G26300/AT3G48000/AT3G48360/AT3G56630/AT4G16210/AT4G37310/AT4G37330/AT4G37340/AT4G37400/AT4G37410/AT5G04630/AT5G04660/AT5G25120/AT5G36220/AT5G42590/... 22
00500 Starch and sucrose metabolism 40/1836 114/3449 1.000 1.000 0.925 AT1G06780/AT1G12240/AT1G18580/AT1G23190/AT1G50460/AT1G68020/AT1G70730/AT1G73370/AT2G20810/AT2G30575/AT2G31390/AT2G38650/AT2G45220/AT3G01040/AT3G01180/AT3G02350/AT3G03250/AT3G13790/AT3G20040/AT3G23820/... 40
00360 Phenylalanine metabolism 28/1836 92/3449 1.000 1.000 0.925 AT1G06570/AT1G51680/AT2G18980/AT2G30490/AT2G30970/AT2G37040/AT2G37130/AT2G40890/AT2G42490/AT3G21240/AT3G28200/AT3G49120/AT3G53260/AT4G05160/AT4G30170/AT4G31990/AT4G34050/AT4G37520/AT4G37530/AT5G04230/... 28
00040 Pentose and glucuronate interconversions 13/1836 59/3449 1.000 1.000 0.925 AT1G63290/AT2G45220/AT3G01850/AT3G03250/AT3G29090/AT3G48000/AT3G55140/AT4G24780/AT5G04310/AT5G04970/AT5G15490/AT5G49650/AT5G61410 13
04075 Plant hormone signal transduction 86/1836 232/3449 1.000 1.000 0.925 AT1G01360/AT1G04310/AT1G04550/AT1G07430/AT1G08320/AT1G10470/AT1G14920/AT1G17550/AT1G19840/AT1G22070/AT1G27320/AT1G28130/AT1G32640/AT1G49720/AT1G51950/AT1G59940/AT1G60940/AT1G66350/AT1G72450/AT1G74950/... 86
00940 Phenylpropanoid biosynthesis 32/1836 109/3449 1.000 1.000 0.925 AT1G15950/AT1G51680/AT1G80820/AT2G18980/AT2G30490/AT2G37040/AT2G37130/AT2G40890/AT3G19450/AT3G21240/AT3G21560/AT3G28200/AT3G49120/AT3G50740/AT3G53260/AT4G05160/AT4G30170/AT4G34050/AT4G34230/AT4G36220/... 32

表7.2 KEGG富集分析部分结果:
ID:KEGG通路标识符,前面省略"map",比如“04120”代表“map04120”;
Description:KEGG通路的文字描述;
GeneRatio:该条目基因比例,分子是富集到这个KEGG通路上的基因的数目,分母是所有peak关联基因的数目;
BgRatio:背景比例,分母是物种全部有KEGG注释的基因的数目,分子是这些基因中注释到这个KEGG通路上面的基因的数目;
RichFactor​​:富集因子(Enrichment Factor)= GeneRatio / BgRatio;
​​FoldEnrichment​:富集倍数(Fold Enrichment)= (富集通路基因数 / 输入基因数) / (背景通路基因数 / 背景总基因数);
​​zScore​:标准化富集得分(基于超几何分布的 Z 值);
pvalue:富集的p值;
p.adjust:使用BH校正之后的p值;
qvalue:q值,使用FDR校正之后的p值,q-value相比于p-value更加严格,表示p-value产生假阳性的概率;
geneID:富集到这个KEGG通路上面的具体的基因ID;
Count:富集到这个KEGG通路上面的基因的数目。



图7.4 Peak关联基因KEGG气泡图。纵坐标是KEGG通路名称,横坐标是对应KEGG通路中检出的基因占背景基因的个数,颜色代表显著性,气泡大小代表该通路基因比例。



图7.5 Peak关联基因KEGG条状图。纵坐标是KEGG通路名称,横坐标是出现在该通路的基因数,颜色代表显著性。






8. Motif分析

对于一些基因元件或peak区域,分析这些区域的序列中是否有频繁出现的一些基序(motif),从而可以进一步分析这些基序相关的转录因子或结合蛋白。各种蛋白通过不同的motif识别蛋白-DNA结合位点,因此我们通过Homer(version 4.11.1)(Heinz S et al., 2010)来提取peak所在区间的序列对peak之间共有的motif进行扫描,查找其共有的motif区域,基于富集分析预测可能与peaks结合的蛋白。对于有组内生物学重复的样本,我们取其交集({组名}_consensus)进行motif分析。各样本分析结果位于report/result/7.motif文件夹中:
homerMotifs.motifs8/10/12:这些是de novo(从头预测)查找motif的输出文件,由motif长度分隔。
homerMotifs.all.motifs:由所有homerMotifs.motifs组成的连接文件。
motifFindingParameters.txt:用于执行findMotifsGenome.pl的命令,包含使用的参数
knownResults.txt:基于已知motifs富集的统计信息的文本文件(在EXCEL/WPS中打开)。
seq.autonorm.tsv:用于lower-order oligo标准化的autonormalization统计。
knownResults.html:基于已知motifs富集的格式化输出。
homerResults.html:de novo预测motif的格式化输出。



8.1 已知Motif分析

基于已知motifs富集的分析结果,请打开下方链接查看,其文件对应在各个文件夹下的“knownResults.html”

结果说明:
Rank(序号):根据显著性q-value排序;
Motif:展示motif的序列特征的logo图,可直观了解motif中各碱基的分布和保守性;
Name(Motif名称):HOMER数据库中motif的名称;
P-value(P值):未校正的显著性(基于超几何分布或泊松分布);
Log P-value(对数P值):P值的对数值,绝对值越大表示显著性越高;
q-value (Benjamini)(q值,Benjamini校正值):通过Benjamini-Hochberg方法进行的多重假设检验校正后的P值;
# Target Sequences with Motif(含有该motif的目标序列数量):包含该motif的基因组序列数量;
% of Targets Sequences with Motif(目标序列中含有该motif的比例):包含该motif的基因组序列占输入序列的百分比;
of Background Sequences with Motif (背景序列中含有该motif序列数量):背景序列(通常是全基因组序列)中包含该motif的序列数量;
% of Background Sequences with Motif(背景序列中含有该motif的比例):背景序列中含有该motif的序列所占的百分比。
Motif File:motif碱基分步矩阵结果;
SVG:motif的svg可视化文件;



8.2 从头预测Motif分析

基于de novo 从头预测的motifs富集的分析结果,请打开下方链接查看,其文件对应在各个文件夹下的“homerResults.html”。

结果说明:
Rank(序号):根据显著性q-value排序;
Motif:展示motif的序列特征的logo图,可直观了解motif中各碱基的分布和保守性;
P-value(P值):未校正的显著性(基于超几何分布或泊松分布);
Log P-value(对数P值):P值的对数值,绝对值越大表示显著性越高;
% of Targets(目标序列中含有该motif的比例):靶标序列占总序列百分比;
% of Background(背景序列中含有该motif的比例):背景序列占总序列百分比;
STD(Bg STD):靶标和背景的序列集出现偏离序列中心200bp的标准偏差;
Best Match/Details:最佳匹配的结果,点击 More information 后会出现更多信息——该motif的一些基本信息,如链接到motfi文件的超链接,下方match查看denovo motif和已知的motif的相似性比对结果打分, score越高代表越相似。;
Motif File:motif碱基分步矩阵结果。





参考文献


Andrews S. FastQC: a quality control tool for high throughput sequence data.https://www.bioinformatics.babraham.ac.uk/projects/fastqc/, 2010.
Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018 Sep 1;34(17):i884-i890.
Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, Cheng JX, Murre C, Singh H, Glass CK. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol Cell. 2010 May 28;38(4):576-89.
Kaya-Okur HS, Wu SJ, Codomo CA, Pledger ES, Bryson TD, Henikoff JG, Ahmad K, Henikoff S. CUT&Tag for efficient epigenomic profiling of small samples and single cells. Nat Commun. 2019 Apr 29;10(1):1930. doi: 10.1038/s41467-019-09982-5. PMID: 31036827; PMCID: PMC6488672.
Langmead, B., Salzberg, S. Fast gapped-read alignment with Bowtie 2. Nat Methods 9, 357–359 (2012).
Landt SG, Marinov GK, Kundaje A, Kheradpour P, Pauli F, Batzoglou S, Bernstein BE, Bickel P, Brown JB, Cayting P, Chen Y, DeSalvo G, Epstein C, Fisher-Aylor KI, Euskirchen G, Gerstein M, Gertz J, Hartemink AJ, Hoffman MM, Iyer VR, Jung YL, Karmakar S, Kellis M, Kharchenko PV, Li Q, Liu T, Liu XS, Ma L, Milosavljevic A, Myers RM, Park PJ, Pazin MJ, Perry MD, Raha D, Reddy TE, Rozowsky J, Shoresh N, Sidow A, Slattery M, Stamatoyannopoulos JA, Tolstorukov MY, White KP, Xi S, Farnham PJ, Lieb JD, Wold BJ, Snyder M. ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia. Genome Res. 2012 Sep;22(9):1813-31.
Ramírez F, Ryan DP, Grüning B, Bhardwaj V, Kilpert F, Richter AS, Heyne S, Dündar F, Manke T. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 2016 Jul 8;44(W1):W160-5.
Stark, Rory and Gord Brown. “DiffBind: Differential binding analysis of ChIP-seq peak data.” (2012).
Tarasov A, Vilella AJ, Cuppen E, Nijman IJ, Prins P. Sambamba: fast processing of NGS alignment formats. Bioinformatics. 2015 Jun 15;31(12):2032-4.
Wang, Q., Li, M., Wu, T., Zhan, L., Li, L., Chen, M., Xie, W., Xie, Z., Hu, E., Xu, S., & Yu, G. (2022). Exploring epigenomic datasets by ChIPseeker. Current Protocols, 2, e585.
Wu T, Hu E, Xu S, Chen M, Guo P, Dai Z, Feng T, Zhou L, Tang W, Zhan L, Fu X, Liu S, Bo X, Yu G. clusterProfiler 4.0: A universal enrichment tool for interpreting omics data. Innovation (Camb). 2021 Jul 1;2(3):100141.
Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nusbaum C, Myers RM, Brown M, Li W, Liu XS. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 2008;9(9):R137.

联系我们

官网:武汉鸿源韬生物科技有限公司 咨询热线:18086690478

邮箱:sales@hyt-bio.com 地址:湖北省武汉市江夏区郑店街光谷南(郑店)大健康产业园东湖高新国际健康城B地块1号楼7层02号房

报告文件结构

报告文件结构