Trace-Bisimulation Equivalence for Regression Detection in Non Deterministic Code-Generating Analytics Agents
Keywords:
Bisimulation Equivalence, Code-Generating Analytics Agents, Regression Testing, Analytical Dataflow Graphs, Non-Deterministic Program EquivalenceAbstract
Code-generating analytics agents, systems that translate natural language queries into executable analyticalprograms, have become essential components of enterprise data infrastructure. However, their non deterministic nature, producing structurally different but semantically equivalent programs across runs,renders conventional regression testing methods unreliable
References
Mark Chen et al., "Evaluating Large Language Models Trained on Code," arXiv preprint, 2021. Available: https://arxiv.org/pdf/2107.03374
Yujia Li et al., "Competition-Level Code Generation with AlphaCode," Science, 2022. Available: https://www.science.org/doi/10.1126/science.abq1158


