openclean_metanome.config module¶
Configuration variables and helper methods for running Metanome algorihms in openclean.
- openclean_metanome.config.CONTAINER(env: Optional[Dict] = None) → str¶
Get the identifier of the Metanome container image from the environment variable
- Parameters
env (dict, default=None) – Optional environment variables that override the system-wide settings, default=None
- Returns
- Return type
string
- openclean_metanome.config.JARFILE(env: Optional[Dict] = None) → str¶
Get path to the Metanome.jar file from the environment.
By default, the jar file is expected to be in the OS-specific user cache directory.
- Parameters
env (dict, default=None) – Optional environment variables that override the system-wide settings, default=None
- Returns
- Return type
string
- openclean_metanome.config.VOLUME(env: Optional[Dict] = None) → Dict¶
Get specification for the volume that is associated with the worker that is used to execute the main algorithm step.
- Parameters
env (dict, default=None) – Optional environment variables that override the system-wide settings, default=None
- Returns
- Return type
dict
- openclean_metanome.config.WORKER(env: Optional[Dict] = None) → Dict¶
Get specification for the worker that is used to execute the main algorithm step using the metanome wrapper Jar-file.
- Parameters
env (dict, default=None) – Optional environment variables that override the system-wide settings, default=None
- Returns
- Return type
dict
- openclean_metanome.config.read_config_obj(var: str, env: Dict) → Dict¶
Read configuration object from a given environment variables.
If the variable is set and contains a dictionary as value that value is returned. Otherwise, it is assumed that the variable references a Json or Yaml file that contains the configuration object.
- Parameters
var (string) – Name of the environment variable.
env (dict) – Dictionary representing the current environment settings.
- Returns
- Return type
dict