Skip to content

Add E2E homogeneous graph store training example#514

Open
kmontemayor2-sc wants to merge 45 commits intomainfrom
kmonte/setup-gs-hom-trainer
Open

Add E2E homogeneous graph store training example#514
kmontemayor2-sc wants to merge 45 commits intomainfrom
kmonte/setup-gs-hom-trainer

Conversation

@kmontemayor2-sc
Copy link
Collaborator

@kmontemayor2-sc kmontemayor2-sc commented Feb 25, 2026

Scope of work done

Add example for graph store homogeneous training, and update splitter slightly to allow tuple-edge types to be passed in.

Will follow up with heterogeneous loop when ready :)

Adding new deployment/configs/e2e_glt_gs_train_resource_config.yaml so we can have GS trainer as a temp workaround

Where is the documentation for this feature?: N/A

Did you add automated tests or write a test plan?

Updated Changelog.md? NO

Ready for code review?: NO

@kmontemayor2-sc
Copy link
Collaborator Author

/e2e_test

@github-actions
Copy link
Contributor

github-actions bot commented Mar 3, 2026

GiGL Automation

@ 16:26:40UTC : 🔄 E2E Test started.

@ 18:01:01UTC : ❌ Workflow failed.
Please check the logs for more details.

@kmontemayor2-sc
Copy link
Collaborator Author

Sorry, can be hard to see pure GH comments if they're not on a file :/

Let's look at https://console.cloud.google.com/vertex-ai/pipelines/locations/us-central1/runs/hom-cora-sup-test-on-20260303-010325?project=external-snap-ci-github-gigl for a colocated test, and https://console.cloud.google.com/vertex-ai/pipelines/locations/us-central1/runs/hom-cora-sup-gs-test-on-20260303-010325?project=external-snap-ci-github-gigl for a graph store test, both on cora.

They take ~ the same time (43 minutes and change). But I don't see model metrics for either run, see the below for the logs I see in the colocated pipeline/

[KFP Executor 2026-03-03 01:37:09,114 INFO]: Fetching eval metrics from: gs://gigl-cicd-perm/hom_cora_sup_test_on_20260303_010325/trainer/models/trainer_eval_metrics.json

[KFP Executor 2026-03-03 01:37:09,192 WARNING]: Error loading metrics file: 404 GET https://storage.googleapis.com/download/storage/v1/b/gigl-cicd-perm/o/hom_cora_sup_test_on_20260303_010325%2Ftrainer%2Fmodels%2Ftrainer_eval_metrics.json?alt=media: No such object: gigl-cicd-perm/hom_cora_sup_test_on_20260303_010325/trainer/models/trainer_eval_metrics.json: ('Request failed with status code', 404, 'Expected one of', <HTTPStatus.OK: 200>, <HTTPStatus.PARTIAL_CONTENT: 206>), evaluation could have been skipped

is it possible we broke the metrics for these pipelines at some point? I'd rather have that be a separate fix if possible.

@kmontemayor2-sc
Copy link
Collaborator Author

/e2e_test

@github-actions
Copy link
Contributor

github-actions bot commented Mar 3, 2026

GiGL Automation

@ 18:07:23UTC : 🔄 E2E Test started.

@ 19:34:50UTC : ❌ Workflow failed.
Please check the logs for more details.

@kmontemayor2-sc
Copy link
Collaborator Author

/e2e_test

@github-actions
Copy link
Contributor

github-actions bot commented Mar 3, 2026

GiGL Automation

@ 19:49:01UTC : 🔄 E2E Test started.

@ 21:12:51UTC : ❌ Workflow failed.
Please check the logs for more details.

@kmontemayor2-sc
Copy link
Collaborator Author

/e2e_test

@github-actions
Copy link
Contributor

github-actions bot commented Mar 3, 2026

GiGL Automation

@ 21:44:39UTC : 🔄 E2E Test started.

@ 23:12:47UTC : ❌ Workflow failed.
Please check the logs for more details.

@kmontemayor2-sc kmontemayor2-sc marked this pull request as ready for review March 4, 2026 00:11
@kmontemayor2-sc kmontemayor2-sc added this pull request to the merge queue Mar 4, 2026
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Mar 4, 2026
@kmontemayor2-sc kmontemayor2-sc added this pull request to the merge queue Mar 4, 2026
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Mar 4, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants