openclean_metanome.algorithm.base module

openclean_metanome.algorithm.base.run_workflow(workflow: flowserv.controller.serial.workflow.base.SerialWorkflow, arguments: Dict, df: pandas.core.frame.DataFrame, worker: Optional[Dict] = None, volume: Optional[Dict] = None, managers: Optional[Dict] = None, verbose: Optional[bool] = True)flowserv.controller.serial.workflow.result.RunResult

Run a given workflow representing a Metanome profiling algorithm on the given data frame.

Returns the run result. If execution of the Metanome algorithm fails a RuntimeError will be raised.

This implementation assumes that all algorithms operate on a single input file that contains a serialization of the data frame and that they all produce a single output file in Json format.

Parameters
  • workflow (flowserv.controller.serial.workflow.base.SerialWorkflow) – Serial workflow to run a Metanome profiling algorithm on a given data frame.

  • arguments (dict) – Dictionary of algorithm-specific input arguments.

  • df (pd.DataFrame) – Input data frame.

  • worker (dict, default=None) – Optional configuration for the main worker.

  • volume (dict, default=None) – Optional configuration for the volume that is associated with the main worker.

  • managers (dict, default=None) – Mapping of workflow step identifier to the worker that is used to execute them.

  • verbose (bool, default=True) – Output run logs if True.

Returns

Return type

flowserv.controller.serial.workflow.result.RunResult