Package 'blaseRdata'

Title: Supporting Data for the blaseRtools Package
Description: What the package does (one paragraph).
Authors: Brad Blaser [aut, cre]
Maintainer: Brad Blaser <[email protected]>
License: MIT + file LICENSE
Version: 0.0.0.9013
Built: 2025-01-22 05:44:28 UTC
Source: https://github.com/blaserlab/blaseRdata

Help Index


Genes to Remove from ScRNA-seq Analysis: Zebrafish

Description

A precompiled list of mitochondrial and ribosomal gene names to remove.

Usage

dr11_remove_genes

Format

A character vector

Details

See data-raw/remove_genes

Source

Ensembl


Zebrafish Gene TSS

Description

From TxDb.Drerio.UCSC.danRer11.refGene

Usage

dr11_tss

Format

A GRanges object:

Details

see data-raw/tss.R


Drerio

Description

This is a copy of the Drerio object from BSgenome.Drerio.UCSC.danRer11. It is here for more convenient use with blaseRtools functions and to reduce dependencies.

Usage

Drerio

Format

A BSgenome object:

Details

see data-raw/grcz11_objects.R

Source

UCSC danRer11 genome build


Human Gene Model

Description

Full Genome Model for humans. Based on GRCh38 genome build.

Usage

hg38_granges

Format

A GRanges object:

Details

see data-raw/hg38_objects.R

Source

http://ftp.ensembl.org/pub/release-104/gff3/homo_sapiens/


Reduced Human Gene Model

Description

Similar to the Human Gene Model "hg38_granges" but with different metadata columns and only with 5', CDS, and 3' UTR ranges. Also has principle isoform data.

Usage

hg38_granges_reduced

Format

A GRanges object:

Details

see data-raw/trace_objects.R

Source

http://ftp.ensembl.org/pub/release-104/gff3/homo_sapiens/


Genes to Remove from ScRNA-seq Analysis: Human

Description

A precompiled list of mitochondrial and ribosomal gene names to remove.

Usage

hg38_remove_genes

Format

A character vector

Details

See data-raw/remove_genes

Source

Ensembl


Human Gene TSS

Description

From TxDb.Hsapiens.UCSC.hg38.knownGene

Usage

hg38_tss

Format

A GRanges object:

Details

see data-raw/tss.R


Hsapiens

Description

This is a copy of the Hsapiens object from BSgenome.Hsapiens.UCSC.hg38. It is here for more convenient use with blaseRtools functions and to reduce dependencies.

Usage

Hsapiens

Format

A BSgenome object:

Details

see data-raw/hg38_objects.R

Source

UCSC hg38 genome build


Jaspar 2020 TF Motifs

Description

JASPAR 2020 TF Motif database in character string format. Mostly for internal use.

Usage

meme

Format

A character vector.

Details

see data-raw/meme.R

Source

JASPAR2020_CORE_vertebrates_non-redundant_pfms_jaspar.txt


Genes to Remove from ScRNA-seq Analysis: Mouse

Description

A precompiled list of mitochondrial and ribosomal gene names to remove.

Usage

mm39_remove_genes

Format

A character vector

Details

See data-raw/remove_genes

Source

Ensembl


Broad Institute MSIGDB Gene Set Metadata

Description

Gene set metadata for the whole MSIGDB. Most gene sets are known by "STANDARD_NAME". You can filter the gene set list by supplying a named filter list to the bb_extract_msig function. The name of each list element should be one of the column names in this metadata table and the list element contents should be the values to filter for. Filtering works in an additive way, meaning if you supply a filter list with two elements it will extract gene sets passing filters 1 AND 2.

Usage

msigdb_geneset_metadata

Format

A data frame with 32796 rows and 18 variables:

STANDARD_NAME

character

SYSTEMATIC_NAME

character

HISTORICAL_NAME

character

ORGANISM

character

PMID

character

AUTHORS

character

GEOID

character

EXACT_SOURCE

character

GENESET_LISTING_URL

character

EXTERNAL_DETAILS_URL

character

CHIP

character

CATEGORY_CODE

character

SUB_CATEGORY_CODE

character

CONTRIBUTOR

character

CONTRIBUTOR_ORG

character

DESCRIPTION_BRIEF

character

DESCRIPTION_FULL

character

TAGS

character

Details

Use with bb_extract_msig

Source

https://www.gsea-msigdb.org/gsea/msigdb/ accessed January 12 2022.


Broad Institute MSIGDB Gene Sets

Description

List of MSIGDB gene sets. Most gene sets are commonly known by "STANDARD_NAME". This is a unique value and is provided as the name for each of the elements in this list. If you want to select a single gene set by name you can use the usual R syntax for selecting named elements: msigdb_genesets$<gene set name>. If you want to select by other metadata features you should use bb_extract_msig.

Usage

msigdb_genesets

Format

A list.

Details

Use with bb_extract_msig

Source

https://www.gsea-msigdb.org/gsea/msigdb/ accessed January 12 2022.


Vignette QC Results

Description

Standard bb_qc function output

Usage

vig_qc_res

Format

a list

Details

See blaseRtools scRNAseq Vignette

Source

vignette_cds


ScRNA-seq Vignette CDS

Description

A small example CDS for the blaseRtools vignette

Usage

vignette_cds

Format

A cellDataSet object

Details

See blaseRtools scRNAseq Vignette

Source

https://www.10xgenomics.com/resources/datasets/500-human-pbm-cs-3-lt-v-3-1-chromium-controller-3-1-low-6-1-0 and https://www.10xgenomics.com/resources/datasets/500-human-pbm-cs-3-lt-v-3-1-chromium-x-3-1-low-6-1-0


BlaseRtools ScRNA-seq Vignette CDS Pseudobulk Results

Description

Pseudobulk header, results and qc plots.

Usage

vignette_pseudobulk_res

Format

A list of pseudobulk results

Details

See blaseRtools scRNAseq Vignette


BlaseRtools ScRNA-seq Vignette CDS Top Markers

Description

Top Markers for the vignette CDS

Usage

vignette_top_markers

Format

A data frame with 1100 rows and 11 variables:

gene_id

character Ensembl gene identifier

gene_short_name

character Gene Name

cluster_method

character Clustering Method

cell_group

character Cell Cluster or Partition

marker_score

double Composite score of fraction expressing and level of expression.

mean_expression

double Mean marker expression

fraction_expressing

double Fraction expressing

specificity

double Marker specificity

pseudo_R2

double Regression pseudo R squared

marker_test_p_value

double Wald test P value

marker_test_q_value

double q value

Details

See blaseRtools scRNAseq Vignette


Wordlist Hashtable

Description

Wordlist from the Grady Parts of Speech database. In order to reuse, 1.) Calculate the md5sum of the object you want to hash, 2.) Take the first 5 or more hex digits and fill them in this expression: as.integer(as.hexmode("<hex digits>") %% 226857) + 1. This will return an integer between 1 and 226857 which is the length of the wordlist hashtable. Then select the word according to the integer/index value using something like: wordlist %>% filter(index == integer) %>% pull(word).

Usage

wordhash

Format

A data frame with 226857 rows and 2 variables:

index

integer

word

character

Details

see data-raw/wordhash.R for construction

Source

https://www.gutenberg.org/ebooks/3201


Zebrafish Gene Model

Description

This is a granges object containing the full genome model from the zfin database. Based on GRCz11 genome build.

Usage

zfin_granges

Format

A GRanges object:

Details

see data-raw/grcz11_objects.R

Source

https://zfin.org/downloads/zfin_genes.gff3 release April 2018


Reduced Zebrafish Gene Model

Description

Similar to the Zebrafish Gene Model but with different metadata columns and only with 5', CDS and 3' UTR ranges. Also has principle isoform data.

Usage

zfin_granges_reduced

Format

A GRanges object:

Details

see data-raw/trace_objects.R

Source

https://zfin.org/downloads/zfin_genes.gff3 release April 2018