tozfcd5d | Knowledge4COVID-19

Property	Value
?:abstract	We construct a simple workflow for fluent genomics data analysis using the R/Bioconductor ecosystem. This involves three core steps: import the data into an appropriate abstraction, model the data with respect to the biological questions of interest, and integrate the results with respect to their underlying genomic coordinates. Here we show how to implement these steps to integrate published RNA-seq and ATAC-seq experiments on macrophage cell lines. Using tximeta, we import RNA-seq transcript quantifications into an analysis-ready data structure, called the SummarizedExperiment, that contains the ranges of the reference transcripts and metadata on their provenance. Using SummarizedExperiments to represent the ATAC-seq and RNA-seq data, we model differentially accessible (DA) chromatin peaks and differentially expressed (DE) genes with existing Bioconductor packages. Using plyranges we then integrate the results to see if there is an enrichment of DA peaks near DE genes by finding overlaps and aggregating over log-fold change thresholds. The combination of these packages and their integration with the Bioconductor ecosystem provide a coherent framework for analysts to iteratively and reproducibly explore their biological data.
is ?:annotates of	<https://research.tib.eu/covid-19/entity/tozfcd5d_hasAnnotation_C0008546> <https://research.tib.eu/covid-19/entity/tozfcd5d_hasAnnotation_C0017337> <https://research.tib.eu/covid-19/entity/tozfcd5d_hasAnnotation_C0017428> <https://research.tib.eu/covid-19/entity/tozfcd5d_hasAnnotation_C0037088>
?:creator	<https://research.tib.eu/covid-19/entity/Lee%2C_Stuart%3B_Lawrence%2C_Michael%3B_Love%2C_Michael_I>
?:doi	10.12688/f1000research.22259.1
?:doi	<https://research.tib.eu/covid-19/entity/10.12688/f1000research.22259.1>
?:journal	F1000Research
?:license	cc-by
?:pmid	<https://research.tib.eu/covid-19/entity/32528659.0>
?:pmid	32528659.0
?:publication_isRelatedTo_Disease	<https://research.tib.eu/covid-19/entity/COVID-19>
?:source	Medline
?:title	Fluent genomics with plyranges and tximeta.
?:type	<https://research.tib.eu/covid-19/vocab/Publication>
?:year	2020

Metadata

Anon_0

<http://www.w3.org/1999/02/22-rdf-syntax-ns#type>

<http://purl.org/net/provenance/ns#DataItem>

<http://www.w3.org/1999/02/22-rdf-syntax-ns#type>

<http://www.w3.org/2004/03/trix/rdfg-1/Graph>

<http://xmlns.com/foaf/0.1/primaryTopic>

<https://research.tib.eu/covid-19/entity/C0035222_COEXISTS_WITH_C0020440>

<http://xmlns.com/foaf/0.1/topic>

Anon_0

<http://www.ontologydesignpatterns.org/cp/owl/informationrealization.owl#realizes>

<https://research.tib.eu/covid-19/data/entity/C0035222_COEXISTS_WITH_C0020440>

<http://purl.org/net/provenance/ns#createdBy>

Anon_1 (more)

expand all