Package 'blaseRdata' reference manual

Title:	Supporting Data for the blaseRtools Package
Description:	What the package does (one paragraph).
Authors:	Brad Blaser [aut, cre]
Maintainer:	Brad Blaser <[email protected]>
License:	MIT + file LICENSE
Version:	0.0.0.9013
Built:	2025-02-21 06:04:15 UTC
Source:	https://github.com/blaserlab/blaseRdata

Genes to Remove from ScRNA-seq Analysis: Zebrafish

Description

A precompiled list of mitochondrial and ribosomal gene names to remove.

Usage

dr11_remove_genes
dr11_remove_genes

Format

A character vector

Details

See data-raw/remove_genes

Source

Ensembl

Drerio

Description

This is a copy of the Drerio object from BSgenome.Drerio.UCSC.danRer11. It is here for more convenient use with blaseRtools functions and to reduce dependencies.

Usage

Drerio
Drerio

Format

A BSgenome object:

Details

see data-raw/grcz11_objects.R

Source

UCSC danRer11 genome build

Human Gene Model

Description

Full Genome Model for humans. Based on GRCh38 genome build.

Usage

hg38_granges
hg38_granges

Format

A GRanges object:

Details

see data-raw/hg38_objects.R

Source

http://ftp.ensembl.org/pub/release-104/gff3/homo_sapiens/

Reduced Human Gene Model

Description

Similar to the Human Gene Model "hg38_granges" but with different metadata columns and only with 5', CDS, and 3' UTR ranges. Also has principle isoform data.

Usage

hg38_granges_reduced
hg38_granges_reduced

Format

A GRanges object:

Details

see data-raw/trace_objects.R

Source

http://ftp.ensembl.org/pub/release-104/gff3/homo_sapiens/

Genes to Remove from ScRNA-seq Analysis: Human

Description

A precompiled list of mitochondrial and ribosomal gene names to remove.

Usage

hg38_remove_genes
hg38_remove_genes

Format

A character vector

Details

See data-raw/remove_genes

Source

Ensembl

Hsapiens

Description

This is a copy of the Hsapiens object from BSgenome.Hsapiens.UCSC.hg38. It is here for more convenient use with blaseRtools functions and to reduce dependencies.

Usage

Hsapiens
Hsapiens

Format

A BSgenome object:

Details

see data-raw/hg38_objects.R

Source

UCSC hg38 genome build

Jaspar 2020 TF Motifs

Description

JASPAR 2020 TF Motif database in character string format. Mostly for internal use.

Usage

meme
meme

Format

A character vector.

Details

see data-raw/meme.R

Source

JASPAR2020_CORE_vertebrates_non-redundant_pfms_jaspar.txt

Genes to Remove from ScRNA-seq Analysis: Mouse

Description

A precompiled list of mitochondrial and ribosomal gene names to remove.

Usage

mm39_remove_genes
mm39_remove_genes

Format

A character vector

Details

See data-raw/remove_genes

Source

Ensembl

Broad Institute MSIGDB Gene Set Metadata

Description

Gene set metadata for the whole MSIGDB. Most gene sets are known by "STANDARD_NAME". You can filter the gene set list by supplying a named filter list to the bb_extract_msig function. The name of each list element should be one of the column names in this metadata table and the list element contents should be the values to filter for. Filtering works in an additive way, meaning if you supply a filter list with two elements it will extract gene sets passing filters 1 AND 2.

Usage

msigdb_geneset_metadata
msigdb_geneset_metadata

Format

A data frame with 32796 rows and 18 variables:

STANDARD_NAME: character
SYSTEMATIC_NAME: character
HISTORICAL_NAME: character
ORGANISM: character
PMID: character
AUTHORS: character
GEOID: character
EXACT_SOURCE: character
GENESET_LISTING_URL: character
EXTERNAL_DETAILS_URL: character
CHIP: character
CATEGORY_CODE: character
SUB_CATEGORY_CODE: character
CONTRIBUTOR: character
CONTRIBUTOR_ORG: character
DESCRIPTION_BRIEF: character
DESCRIPTION_FULL: character
TAGS: character

Details

Use with bb_extract_msig

Source

https://www.gsea-msigdb.org/gsea/msigdb/ accessed January 12 2022.

Broad Institute MSIGDB Gene Sets

Description

List of MSIGDB gene sets. Most gene sets are commonly known by "STANDARD_NAME". This is a unique value and is provided as the name for each of the elements in this list. If you want to select a single gene set by name you can use the usual R syntax for selecting named elements: msigdb_genesets$<gene set name>. If you want to select by other metadata features you should use bb_extract_msig.

Usage

msigdb_genesets
msigdb_genesets

Format

A list.

Details

Use with bb_extract_msig

Source

https://www.gsea-msigdb.org/gsea/msigdb/ accessed January 12 2022.

Vignette QC Results

Description

Standard bb_qc function output

Usage

vig_qc_res
vig_qc_res

Format

a list

Details

See blaseRtools scRNAseq Vignette

Source

vignette_cds

ScRNA-seq Vignette CDS

Description

A small example CDS for the blaseRtools vignette

Usage

vignette_cds
vignette_cds

Format

A cellDataSet object

Details

See blaseRtools scRNAseq Vignette

Source

https://www.10xgenomics.com/resources/datasets/500-human-pbm-cs-3-lt-v-3-1-chromium-controller-3-1-low-6-1-0 and https://www.10xgenomics.com/resources/datasets/500-human-pbm-cs-3-lt-v-3-1-chromium-x-3-1-low-6-1-0

BlaseRtools ScRNA-seq Vignette CDS Pseudobulk Results

Description

Pseudobulk header, results and qc plots.

Usage

vignette_pseudobulk_res
vignette_pseudobulk_res

Format

A list of pseudobulk results

Details

See blaseRtools scRNAseq Vignette

BlaseRtools ScRNA-seq Vignette CDS Top Markers

Description

Top Markers for the vignette CDS

Usage

vignette_top_markers
vignette_top_markers

Format

A data frame with 1100 rows and 11 variables:

gene_id: character Ensembl gene identifier
gene_short_name: character Gene Name
cluster_method: character Clustering Method
cell_group: character Cell Cluster or Partition
marker_score: double Composite score of fraction expressing and level of expression.
mean_expression: double Mean marker expression
fraction_expressing: double Fraction expressing
specificity: double Marker specificity
pseudo_R2: double Regression pseudo R squared
marker_test_p_value: double Wald test P value
marker_test_q_value: double q value

Details

See blaseRtools scRNAseq Vignette

Wordlist Hashtable

Description

Wordlist from the Grady Parts of Speech database. In order to reuse, 1.) Calculate the md5sum of the object you want to hash, 2.) Take the first 5 or more hex digits and fill them in this expression: as.integer(as.hexmode("<hex digits>") %% 226857) + 1. This will return an integer between 1 and 226857 which is the length of the wordlist hashtable. Then select the word according to the integer/index value using something like: wordlist %>% filter(index == integer) %>% pull(word).

Usage

wordhash
wordhash

Format

A data frame with 226857 rows and 2 variables:

index: integer
word: character

Details

see data-raw/wordhash.R for construction

Source

https://www.gutenberg.org/ebooks/3201

Zebrafish Gene Model

Description

This is a granges object containing the full genome model from the zfin database. Based on GRCz11 genome build.

Usage

zfin_granges
zfin_granges

Format

A GRanges object:

Details

see data-raw/grcz11_objects.R

Source

https://zfin.org/downloads/zfin_genes.gff3 release April 2018

Reduced Zebrafish Gene Model

Description

Similar to the Zebrafish Gene Model but with different metadata columns and only with 5', CDS and 3' UTR ranges. Also has principle isoform data.

Usage

zfin_granges_reduced
zfin_granges_reduced

Format

A GRanges object:

Details

see data-raw/trace_objects.R

Source

https://zfin.org/downloads/zfin_genes.gff3 release April 2018

Package 'blaseRdata'

Help Index

Genes to Remove from ScRNA-seq Analysis: Zebrafish

Description

Usage

Format

Details

Source

Zebrafish Gene TSS

Description

Usage

Format

Details

Drerio

Description

Usage

Format

Details

Source

Human Gene Model

Description

Usage

Format

Details

Source

Reduced Human Gene Model

Description

Usage

Format

Details

Source

Genes to Remove from ScRNA-seq Analysis: Human

Description

Usage

Format

Details

Source

Human Gene TSS

Description

Usage

Format

Details

Hsapiens

Description

Usage

Format

Details

Source

Jaspar 2020 TF Motifs

Description

Usage

Format

Details

Source

Genes to Remove from ScRNA-seq Analysis: Mouse

Description

Usage

Format

Details

Source

Broad Institute MSIGDB Gene Set Metadata

Description

Usage

Format

Details

Source

Broad Institute MSIGDB Gene Sets

Description

Usage

Format

Details

Source

Vignette QC Results

Description

Usage

Format

Details

Source

ScRNA-seq Vignette CDS

Description