Skip to content

feat: add Cloud Build pipeline and evaluation configuration for Cloud SQL MySQL extension#119

Merged
omkargaikwad23 merged 4 commits intomainfrom
release-please-evalbench-ci
May 4, 2026
Merged

feat: add Cloud Build pipeline and evaluation configuration for Cloud SQL MySQL extension#119
omkargaikwad23 merged 4 commits intomainfrom
release-please-evalbench-ci

Conversation

@omkargaikwad23
Copy link
Copy Markdown
Contributor

Description

Automated multi-turn EvalBench testing pipeline and Cloud Build CI/CD configuration for the Cloud SQL MySQL extension repository.

Key Changes

  • cloudbuild.yaml: Implements dynamic PR fetching; limits test execution to PRs with autorelease: pending or evals labels; configures dynamic, traceable version mapping (pr-$_PR_NUMBER-evals) for BigQuery metrics.
  • evals/dataset.json: Defines 4 multi-turn scenarios tailored to MySQL-specific tools (list_databases, list_tables, list_table_fragmentation, list_table_stats, get_system_metrics) using dynamic environment placeholders.
  • Core Configs (evals/): Establishes run_config.yaml, model_config.yaml, and gemini_2.5_pro_model.yaml to track token count consumption, turn count, and end-to-end latency using Vertex AI judges and centralized BigQuery reporting.
  • substitute_env.py: Dynamically injects environment variables into model, run, and dataset configuration files at runtime.

@omkargaikwad23 omkargaikwad23 added the autorelease: pending Release please needs to do its work on this. label May 4, 2026
@omkargaikwad23 omkargaikwad23 requested review from a team as code owners May 4, 2026 06:26
@github-actions github-actions Bot requested a review from dolphin1999 May 4, 2026 06:26
@omkargaikwad23 omkargaikwad23 merged commit fdf3cc6 into main May 4, 2026
10 checks passed
@omkargaikwad23 omkargaikwad23 deleted the release-please-evalbench-ci branch May 4, 2026 08:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

autorelease: pending Release please needs to do its work on this.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants