| scran_variances
    Model per-gene variance in expression | 
Variance modelling for single-cell expression data. More...
| Classes | |
| struct | ChooseHighlyVariableGenesOptions | 
| Options for choose_highly_variable_genes().  More... | |
| struct | FitVarianceTrendOptions | 
| Options for fit_variance_trend().  More... | |
| struct | FitVarianceTrendResults | 
| Results of fit_variance_trend().  More... | |
| struct | FitVarianceTrendWorkspace | 
| Workspace for fit_variance_trend().  More... | |
| struct | ModelGeneVariancesBlockedBuffers | 
| Buffers for model_gene_variances_blocked().  More... | |
| struct | ModelGeneVariancesBlockedResults | 
| Results of model_gene_variances_blocked().  More... | |
| struct | ModelGeneVariancesBuffers | 
| Buffers for model_gene_variances()and friends.  More... | |
| struct | ModelGeneVariancesOptions | 
| Options for model_gene_variances()and friends.  More... | |
| struct | ModelGeneVariancesResults | 
| Results of model_gene_variances().  More... | |
| Functions | |
| template<typename Stat_ , typename Bool_ > | |
| void | choose_highly_variable_genes (const std::size_t n, const Stat_ *const statistic, Bool_ *const output, const ChooseHighlyVariableGenesOptions &options) | 
| template<typename Bool_ = char, typename Stat_ > | |
| std::vector< Bool_ > | choose_highly_variable_genes (const std::size_t n, const Stat_ *const statistic, const ChooseHighlyVariableGenesOptions &options) | 
| template<typename Index_ , typename Stat_ > | |
| std::vector< Index_ > | choose_highly_variable_genes_index (const Index_ n, const Stat_ *const statistic, const ChooseHighlyVariableGenesOptions &options) | 
| template<typename Float_ > | |
| void | fit_variance_trend (const std::size_t n, const Float_ *const mean, const Float_ *const variance, Float_ *const fitted, Float_ *const residuals, FitVarianceTrendWorkspace< Float_ > &workspace, const FitVarianceTrendOptions &options) | 
| template<typename Float_ > | |
| FitVarianceTrendResults< Float_ > | fit_variance_trend (const std::size_t n, const Float_ *const mean, const Float_ *const variance, const FitVarianceTrendOptions &options) | 
| template<typename Value_ , typename Index_ , typename Block_ , typename Stat_ > | |
| void | model_gene_variances_blocked (const tatami::Matrix< Value_, Index_ > &mat, const Block_ *const block, const ModelGeneVariancesBlockedBuffers< Stat_ > &buffers, const ModelGeneVariancesOptions &options) | 
| template<typename Value_ , typename Index_ , typename Stat_ > | |
| void | model_gene_variances (const tatami::Matrix< Value_, Index_ > &mat, ModelGeneVariancesBuffers< Stat_ > buffers, const ModelGeneVariancesOptions &options) | 
| template<typename Stat_ = double, typename Value_ , typename Index_ > | |
| ModelGeneVariancesResults< Stat_ > | model_gene_variances (const tatami::Matrix< Value_, Index_ > &mat, const ModelGeneVariancesOptions &options) | 
| template<typename Stat_ = double, typename Value_ , typename Index_ , typename Block_ > | |
| ModelGeneVariancesBlockedResults< Stat_ > | model_gene_variances_blocked (const tatami::Matrix< Value_, Index_ > &mat, const Block_ *const block, const ModelGeneVariancesOptions &options) | 
Variance modelling for single-cell expression data.
| void scran_variances::choose_highly_variable_genes | ( | const std::size_t | n, | 
| const Stat_ *const | statistic, | ||
| Bool_ *const | output, | ||
| const ChooseHighlyVariableGenesOptions & | options ) | 
| Stat_ | Type of the variance statistic. | 
| Bool_ | Type to be used as a boolean. | 
| n | Number of genes. | |
| [in] | statistic | Pointer to an array of length ncontaining the per-gene variance statistics. This is typically the residuals frommodel_gene_variances(). | 
| [out] | output | Pointer to an array of length n. On output, thei-th entry istrueif thei-th gene is to be retained andfalseotherwise. | 
| options | Further options. | 
| std::vector< Bool_ > scran_variances::choose_highly_variable_genes | ( | const std::size_t | n, | 
| const Stat_ *const | statistic, | ||
| const ChooseHighlyVariableGenesOptions & | options ) | 
| Bool_ | Type to be used as a boolean. | 
| Stat_ | Type of the variance statistic. | 
| n | Number of genes. | |
| [in] | statistic | Pointer to an array of length ncontaining the per-gene variance statistics. This is typically the residuals frommodel_gene_variances(). | 
| options | Further options. | 
n, indicating whether each gene is to be retained. | std::vector< Index_ > scran_variances::choose_highly_variable_genes_index | ( | const Index_ | n, | 
| const Stat_ *const | statistic, | ||
| const ChooseHighlyVariableGenesOptions & | options ) | 
| Index_ | Type of the indices. | 
| Stat_ | Type of the variance statistic. | 
| n | Number of genes. | |
| [in] | statistic | Pointer to an array of length ncontaining the per-gene variance statistics. This is typically the residuals frommodel_gene_variances(). | 
| options | Further options. | 
n. | void scran_variances::fit_variance_trend | ( | const std::size_t | n, | 
| const Float_ *const | mean, | ||
| const Float_ *const | variance, | ||
| Float_ *const | fitted, | ||
| Float_ *const | residuals, | ||
| FitVarianceTrendWorkspace< Float_ > & | workspace, | ||
| const FitVarianceTrendOptions & | options ) | 
Fit a trend to the per-feature variances against the means, both of which are typically computed from log-normalized expression data. We use a LOWESS smoother in several steps:
| Float_ | Floating-point type of the statistics. | 
| n | Number of features. | |
| [in] | mean | Pointer to an array of length n, containing the means for all features. | 
| [in] | variance | Pointer to an array of length n, containing the variances for all features. | 
| [out] | fitted | Pointer to an array of length n, to store the fitted values. | 
| [out] | residuals | Pointer to an array of length n, to store the residuals. | 
| workspace | Collection of temporary data structures. This can be re-used across multiple fit_variance_trend()calls. | |
| options | Further options. | 
| FitVarianceTrendResults< Float_ > scran_variances::fit_variance_trend | ( | const std::size_t | n, | 
| const Float_ *const | mean, | ||
| const Float_ *const | variance, | ||
| const FitVarianceTrendOptions & | options ) | 
Overload of fit_variance_trend() that allocates the output vectors.
| Float_ | Floating-point type of the statistics. | 
| n | Number of features. | |
| [in] | mean | Pointer to an array of length n, containing the means for all features. | 
| [in] | variance | Pointer to an array of length n, containing the variances for all features. | 
| options | Further options. | 
| void scran_variances::model_gene_variances_blocked | ( | const tatami::Matrix< Value_, Index_ > & | mat, | 
| const Block_ *const | block, | ||
| const ModelGeneVariancesBlockedBuffers< Stat_ > & | buffers, | ||
| const ModelGeneVariancesOptions & | options ) | 
Model the per-feature variances from a log-expression matrix with blocking. The mean and variance of each gene is computed separately for all cells in each block, and a separate trend is fitted to each block to obtain residuals (see model_gene_variances()). This ensures that sample and batch effects do not confound the variance estimates.
We also compute the average of each statistic across blocks, using the weighting strategy specified in ModelGeneVariancesOptions::block_weight_policy. The average residual is particularly useful for feature selection with choose_highly_variable_genes().
| Value_ | Data type of the matrix. | 
| Index_ | Integer type of the row/column indices. | 
| Block_ | Integer type of the block IDs. | 
| Stat_ | Floating-point type of the output statistics. | 
| mat | Matrix of expression values, typically after normalization and log-transformation. Rows should be genes while columns should be cells. | |
| [in] | block | Pointer to an array of length equal to the number of cells. Each entry should be a 0-based block identifier in \([0, B)\) where \(B\) is the total number of blocks. blockcan also be anullptr, in which case all cells are assumed to belong to the same block. | 
| [out] | buffers | Collection of pointers of arrays in which to store the output statistics. The length of ModelGeneVariancesBlockedResults::per_blockshould be equal to the number of blocks. | 
| options | Further options. | 
| void scran_variances::model_gene_variances | ( | const tatami::Matrix< Value_, Index_ > & | mat, | 
| ModelGeneVariancesBuffers< Stat_ > | buffers, | ||
| const ModelGeneVariancesOptions & | options ) | 
Model the per-gene variances as a function of the mean in single-cell expression data. We compute the mean and variance for each gene and fit a trend to the variances with respect to the means using fit_variance_trend(). We assume that most genes at any given abundance are not highly variable, such that the fitted value of the trend is interpreted as the "uninteresting" variance - this is mostly attributed to technical variation like sequencing noise, but can also represent constitutive biological noise like transcriptional bursting. Under this assumption, the residual can be treated as a measure of biologically interesting variation. Genes with large residuals can then be selected for downstream analyses, e.g., with choose_highly_variable_genes().
| Value_ | Data type of the matrix. | 
| Index_ | Integer type of the row/column indices. | 
| Stat_ | Floating-point type of the output statistics. | 
| mat | Matrix of expression values, typically after normalization and log-transformation. Rows should be genes while columns should be cells. | 
| buffers | Collection of buffers in which to store the computed statistics. | 
| options | Further options. | 
| ModelGeneVariancesResults< Stat_ > scran_variances::model_gene_variances | ( | const tatami::Matrix< Value_, Index_ > & | mat, | 
| const ModelGeneVariancesOptions & | options ) | 
Overload of model_gene_variances() that allocates space for the output statistics.
| Stat_ | Floating-point type of the output statistics. | 
| Value_ | Data type of the matrix. | 
| Index_ | Integer type of the row/column indices. | 
| mat | Matrix of expression values, typically after normalization and log-transformation. Rows should be genes while columns should be cells. | 
| options | Further options. | 
| ModelGeneVariancesBlockedResults< Stat_ > scran_variances::model_gene_variances_blocked | ( | const tatami::Matrix< Value_, Index_ > & | mat, | 
| const Block_ *const | block, | ||
| const ModelGeneVariancesOptions & | options ) | 
Overload of model_gene_variances_blocked() that allocates space for the output statistics.
| Stat_ | Floating-point type of the output statistics. | 
| Value_ | Data type of the matrix. | 
| Index_ | Integer type of the row/column indices. | 
| Block_ | Integer type of the block IDs. | 
| mat | Matrix of expression values, typically after normalization and log-transformation. Rows should be genes while columns should be cells. | |
| [in] | block | Pointer to an array of length equal to the number of cells, containing 0-based block identifiers. This may also be a nullptrin which case all cells are assumed to belong to the same block. | 
| options | Further options. | 
ModelGeneVariancesOptions::compute_average = true.