Checks whether an anndata object is in scrattch.taxonomy format and returns a log-file if not
checkTaxonomy.Rd
Any breaking issues will cause this function to return FALSE. And potential issues will still return TRUE but will output a warning to stderr. All messages will get returned to the log file.
Usage
checkTaxonomy(
AIT.anndata,
log.file.path = getwd(),
print.messages = FALSE,
pull_ensembl = FALSE,
validate_percent_ensembl = 60,
pull_assay = FALSE,
pull_cl = FALSE,
validate_percent_cl = 80,
pull_ncbitaxon = FALSE,
pull_uberon = FALSE,
pull_brain_atlases = FALSE,
pull_hancestro = FALSE,
pull_mondo = FALSE,
...
)
Arguments
- AIT.anndata
A reference taxonomy anndata object to be tested
- log.file.path
The directory to output the logfile of errors and warnings (if any; default getwd())
- print.messages
Print messages only to a log file (FALSE; default) or also to the screen (TRUE)
- pull_ensembl
If FALSE (default) loads the list of Ensembl terms from NCBI; otherwise, pulls from NCBI (VERY slow)
- validate_percent_ensembl
Percent of entries that must correspond to valid Ensembl terms to validate
- pull_assay
If FALSE (default) loads the list of EFO terms (assays); otherwise pulls from EBI (VERY slow)
- pull_cl
If FALSE (default) loads a preset list of CL terms from OBO; otherwise pulls from OBO
- validate_percent_cl
Percent of entries that must correspond to valid CL terms to validate
- pull_ncbitaxon
If FALSE (default) loads the list of species with gene information at NCBI; otherwise pulls from OBO (VERY slow)
- pull_uberon
If FALSE (default) loads the list of anatomical regions from UBERON; otherwise pulls from OBO
- pull_brain_atlases
If FALSE (default) loads the list of brain atlas ids; otherwise pulls from brain-bican
- pull_hancestro
If FALSE (default) loads the list of HANCESTRO terms; otherwise, pulls from OBO
- pull_mondo
If FALSE (default) loads the list of MONDO terms; otherwise, pulls from OBO
- ...
Additional parameters for ._validate_schema_elements and .validate_var_elements (can be ignored in most cases)