openclean_metanome.config module

Configuration variables and helper methods for running Metanome algorihms in openclean.

openclean_metanome.config.CONTAINER(env: Optional[Dict] = None)str

Get the identifier of the Metanome container image from the environment variable

Parameters

env (dict, default=None) – Optional environment variables that override the system-wide settings, default=None

Returns

Return type

string

openclean_metanome.config.JARFILE(env: Optional[Dict] = None)str

Get path to the Metanome.jar file from the environment.

By default, the jar file is expected to be in the OS-specific user cache directory.

Parameters

env (dict, default=None) – Optional environment variables that override the system-wide settings, default=None

Returns

Return type

string

openclean_metanome.config.VOLUME(env: Optional[Dict] = None)Dict

Get specification for the volume that is associated with the worker that is used to execute the main algorithm step.

Parameters

env (dict, default=None) – Optional environment variables that override the system-wide settings, default=None

Returns

Return type

dict

openclean_metanome.config.WORKER(env: Optional[Dict] = None)Dict

Get specification for the worker that is used to execute the main algorithm step using the metanome wrapper Jar-file.

Parameters

env (dict, default=None) – Optional environment variables that override the system-wide settings, default=None

Returns

Return type

dict

openclean_metanome.config.read_config_obj(var: str, env: Dict)Dict

Read configuration object from a given environment variables.

If the variable is set and contains a dictionary as value that value is returned. Otherwise, it is assumed that the variable references a Json or Yaml file that contains the configuration object.

Parameters
  • var (string) – Name of the environment variable.

  • env (dict) – Dictionary representing the current environment settings.

Returns

Return type

dict