Upload Files for Analysis
Help & Guidelines
Distance-based Clustering
Groups genes based on their genomic distance. Suitable for identifying local gene clusters.
TAD-based Clustering
Uses Topologically Associating Domains to group genes. Better for capturing 3D genome organization.
Sample Files
Distance-based Example:
- Expression File (Viervaara_HS.hg19.exp.txt)
- Peak File (Viervaara_HS.hg19.HSF1.bed)
- RefSeq File (hg19.refGene.txt.gz)
TAD-based Example:
- Expression File (Viervaara_HS.hg19.exp.txt)
- Peak File (Viervaara_HS.hg19.HSF1.bed)
- TAD File (DomainsK562-rao2014-merged.bed)
- RefSeq File (hg19.refGene.txt.gz)
Recommended Parameters
- RefSeq Version: hg19
- TSS Upstream: 1000 bp
- TSS Downstream: 1000 bp
- Cluster Distance: 100000 bp
- Induced Condition: TNFa
These sample files contain ChIP-seq and expression data from a TNFα stimulation study in endothelial cells. You can use them to test the analysis pipeline with the recommended parameters.
RefSeq File (BED)
chr1 1000 2000 gene1
Expression File (TSV)
geneID log2FC gene1 2.50 gene2 1.21
Peak File (BED/BEDGraph/narrowPeak/broadPeak/GFF/GTF)
chr1 1000 2000 (peak1)
Supports multiple genomic formats including BED, BEDGraph, narrowPeak, broadPeak, GFF, and GTF
TAD File (BED/BEDGraph/GFF/GTF)
chr3 5639 5705 (tad1)
Supports BED, BEDGraph, GFF, and GTF formats
TSS Region
Define the region around the Transcription Start Site (TSS) to be analyzed.
- Upstream: 1000 bp recommended
- Downstream: 1000 bp recommended
Cluster Distance
For distance-based clustering, determines maximum distance between genes in a cluster.
- 100000 bp recommended
- Smaller values: tighter clusters
- Larger values: broader clusters