tidyms.filter.VariationFilter¶
- class VariationFilter(lb=0, ub=0.25, process_classes=None, robust=False, intraclass=True, verbose=False)¶
Remove features with low reproducibility.
The reproducibility of the features is evaluated using the Relative standard deviation of each feature in samples of a specific class or classes. By default, the QC samples are analyzed.
- Parameters:
- lbnumber between 0 and 1
Lower bound of acceptance
- ubnumber between 0 and 1
Upper bound of acceptance. Must be greater than lb.
- process_classes: List[str], optional
Classes used to evaluate the coefficient of variation. If None, list of classes is taken from the qc sample type from the DataContainer mapping attribute.
- robust: bool
If false uses the mean and standard deviation to compute the cv. Else, the cv is estimated using the MAD and the median of the feature, assuming a normal distribution.
- intraclass: bool
If True, the cv is computed for each class in process_classes and then the maximum value is compared against lb and ub. Else a global cv is computed for all classes in process_classes.
- verbose: bool
If True, prints a message
Constructor of the VariationFilter.