geostep.analyzer.DiDAnalyzer

class geostep.analyzer.DiDAnalyzer(config: AnalyzerConfig | None = None, performance_config: ParallelConfig | None = None)[source]

Analyzer for Difference-in-Differences (DiD) analysis.

Methods

__init__([config, performance_config])

Initialize DiDAnalyzer with configuration.

analyze(df, **kwargs)

Template method implementing the analysis workflow.

check_balance(**kwargs)

Perform advanced balance check on covariates for DiD analysis.

enable_monitoring([enabled])

Enable or disable performance monitoring.

get_metrics()

Get performance and execution metrics.

monitor_operation(operation_name[, ...])

Context manager for monitoring operations.

post_process_result(result)

Post-process analysis results.

prepare_data(**kwargs)

Prepare data for DiD analysis with performance optimizations.

set_metrics_collector(collector)

Set the metrics collector for this instance.

validate_inputs(df, **kwargs)

Validate inputs for DiD analysis.

Attributes

__init__(config: AnalyzerConfig | None = None, performance_config: ParallelConfig | None = None)[source]

Initialize DiDAnalyzer with configuration.

Parameters:
  • config (AnalyzerConfig, optional) – Configuration for analyzer behavior.

  • performance_config (ParallelConfig, optional) – Configuration for performance optimizations.

validate_inputs(df: DataFrame, **kwargs: Any) None[source]

Validate inputs for DiD analysis.

prepare_data(**kwargs)

Prepare data for DiD analysis with performance optimizations.

check_balance(**kwargs)

Perform advanced balance check on covariates for DiD analysis.

Parameters:
  • df (pd.DataFrame) – Full experimental dataframe.

  • geo_col (str) – Geographic identifier column.

  • assignment_col (str) – Assignment column name.

  • covariates (List[str]) – List of covariate columns to check.

  • **kwargs (Any) – Additional parameters passed to advanced_balance_check.

Returns:

Comprehensive balance assessment.

Return type:

BalanceCheckResult