Human OneGenE - Downloads

The following downloads are related to open data.

Onegene Data

OneGenE is a method to compute ranked candidate gene lists that expands known local gene networks given gene expression data. As its predecessor NES2RA, OneGenE is based on the systematic and iterative application of the skeleton function of the PC algorithm on subsets of the input data. OneGenE aims to overcome the large latency of NES2RA when applied to LGNs, by pre-computing partial results, namely single-gene NES2RA expansions, on the BOINC platform.
(Blanzieri et. al. https://ieeexplore.ieee.org/document/9224179)

The single-gene NES2RA expansions of FANTOM-full were submitted on the BOINC platform, with a tile size of 1000 transcripts, 1000 iterations and 0.05 as alpha threshold. The output as presented here is a list of pair of transcripts with the absolute and relative frequency of detection the latter while expanding the gene regulatory network of the former.

Size: 1.98 GiB

Isoforms list & annotation file

The list of 87554 isoforms present in the transcriptomic data set.

Size: 6.93 MiB

Transcriptomic dataset

The human transcriptome data used in this work have been downloaded from the repository of the FANTOM5 project (http://fantom.gsc.riken.jp/5/). FANTOM is an international research consortium that generates and shares high-quality transcriptome datasets (CC BY 4.0)
The raw FANTOM5 dataset amounts to 1829 samples, encompassing human cell lines (271), primary cells (564) and tissues (188) also part of time course experiments (785) and fractionations/perturbations (21). FANTOM5 identified 201802 distinct genomic transcription start site (TSS) locations, corresponding to bona-fide transcript isoforms. A first filtering has been applied on the dataset to exclude unknown transcripts, i.e. without an annotated HGNC symbol (https://www.genenames.org/). This step resulted in a collection of 87554 transcripts, associated with 18889 genes, constituting our full version of the dataset (FANTOM-full)” (Blanzieri et. al. https://ieeexplore.ieee.org/document/9224179)

Size: 135.59 MiB

Pearson correlations

The values of Pearson correlation computed on the transcriptomic dataset above.

Size: 24.40 GiB

Pearson correlations minimal

The same of Pearson data but limited to the pair who has a non zero relative frequency in the OneGenE data.

Size: 726.14 MiB

Download of specific expansion lists

At the link specified below the user can make a single or multiple queries, using the HGNC gene symbols, to retrieve the corresponding annotated expansion list(s) and a README file as a compressed .zip file.

http://ibdm.disi.unitn.it/onegene/fantom/onegene-fantom.php