Due to the lapse in federal government funding, NASA is not updating this website. We sincerely regret this inconvenience.
NASA Logo
Ocean Color Science Software

ocssw V2022
MDN.transformers.utils Namespace Reference

Functions

def get_unique_features (features, threshold=0.025, chunksize=10, _bottom=False)
 
def spearmanr (x, y)
 

Function Documentation

◆ get_unique_features()

def MDN.transformers.utils.get_unique_features (   features,
  threshold = 0.025,
  chunksize = 10,
  _bottom = False 
)
Mask out features which are near duplicates of some other 
    feature(s), based on correlation. The threshold parameter
    determines how similar features can be to keep them.
    Roughly based on https://stackoverflow.com/a/66326102

Definition at line 7 of file utils.py.

◆ spearmanr()

def MDN.transformers.utils.spearmanr (   x,
  y 
)
Scipy's provided spearmanr is too slow for large matrices, and so we instead use a custom implementation.
    Source: https://stackoverflow.com/questions/52371329/fast-spearman-correlation-between-two-pandas-dataframes/59072032#59072032

Definition at line 58 of file utils.py.