flexiMAP: A regression-based method for discovering differential alternative polyadenylation events in standard RNA-seq data
Description
For the "main" dataset, polyadenylation sites splitting each transcript into two isoforms (short and long) were obtained from the poly(A) site atlas (Gruber et al., 2016) for 11000 human transcripts. Each isoform (“short” and “short + long”) was simulated as a different transcript. The expression of the “shot + long” isoform was unchanged between conditions, whereas eleven different fold changes were applied between conditions for the “short” isoform in order to produce a range of different ratios, R. Hence, each fold change is represented by ~ 1000 transcripts in the dataset. Additionally, for each fold change category we assigned 100 different mean expression levels (from 100 to 1000) with the aim of sampling the effect of the expression level on the ability of the method to detect alternative polyadenylation events.
For the "biased" dataset, the aim was to create a scenario where fold changes between two conditions are confounded by the presence of an additional factor. In the specific example set up, we created an imbalanced dataset with 1000 transcripts where male and female-origin samples are present in unequal numbers in the control (7 males and 3 females) and condition (3 males and 7 females) groups. Although the group membership for the factor of interest (condition) plays no role in the choice of polyadenylation site of these transcripts, membership to male or female group does, confounding the outcome of methods that do not take into account additional covariates.
Collection Method
Data Objects
Offline / Analogue Data Records
There are no offline / analogue datasets associated with this recordExternal Data Records
There are no external datasets associated with this recordDigital Data Downloads
To download and items from this dataset, you must agree to abide by the licence attached to the individual items. If you make use of any item you download, you must also cite it in any publication or outputs of your own.
If you have any questions or would like additional information, please contact us at researchdata@bbk.ac.uk.
Additional Metadata
Data
Metadata
Dataset Title: | flexiMAP: A regression-based method for discovering differential alternative polyadenylation events in standard RNA-seq data |
---|---|
Creators: | Szkop, Krzysztof J. and Moss, David S. and Nobeli, Irene |
School/Department: | Birkbeck Schools and Research Centres > School of Science > Biological Sciences |
Data collection method: | Simulated RNA-seq data using polyester package (Frazee et al., 2015) |
Statement on legal, ethical, and access issues: | Not applicable |
Depositing User: | Krzysztof Szkop |
Date Deposited: | 24 Jan 2019 10:47 |
Last Modified: | 01 Jul 2022 16:01 |
Publisher: | Birkbeck College, University of London |
Export / Share Citation
Impact & Reach
Additional statistics for this dataset are available via IRStats2.