Scope gemini runs by dataset and enforce output_path format #1843

rohitkumarbhagat · 2026-01-19T13:07:32Z

Prefix gemini run folders with dataset output basename and scope run artifacts (logs/counters)
under .datacommons/runs/<gemini_run_id>/.
Update prompts/docs to reference run-scoped logs and backups.
Require output_path to be non‑empty and in / format, with tests covering run id pathing and
invalid output paths.

Scope logs to per-run directory

Add coverage for run id prefix

gemini-code-assist · 2026-01-19T13:07:53Z

Summary of Changes

Hello @rohitkumarbhagat, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the organization and robustness of the Gemini import process. By introducing run-specific directories for all generated artifacts and enforcing a structured format for output paths, it aims to simplify debugging, improve traceability of runs, and prevent common configuration errors.

Highlights

Artifact Organization: Gemini run artifacts (logs, counters) are now organized into run-specific directories under .datacommons/runs/<gemini_run_id>/, prefixed by the dataset output basename.
Output Path Validation: The output_path configuration now enforces a non-empty value and a / format, improving path validation and preventing common configuration errors.
Documentation and Prompts Update: Documentation and internal prompts have been updated to reflect the new structured logging and backup locations, guiding users to the correct places for debugging.
New Unit Tests: New unit tests have been added to cover the run ID pathing and the validation of output_path formats, ensuring the robustness of the changes.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request effectively scopes Gemini runs by dataset and enforces a stricter format for output_path. The changes are well-structured, and the updates to documentation and prompts are consistent with the new functionality. The addition of tests for the new validation logic is also a great improvement.

I've found one potential bug that could lead to a TypeError and have suggested a fix. I've also recommended an additional test case to cover this scenario. Overall, this is a solid contribution that improves the robustness and organization of the agentic import tool.

tools/agentic_import/pvmap_generator.py

tools/agentic_import/pvmap_generator_test.py

Add test for None output_path

rohitkumarbhagat · 2026-01-19T13:35:43Z

/gemini review

gemini-code-assist

Code Review

This pull request effectively scopes Gemini run artifacts by dataset, prefixing run folders with the dataset's output basename. The changes are consistently applied across the codebase, including updates to prompts, documentation, and shell scripts to reference the new run-scoped paths. A key improvement is the enforcement of a <dir>/<prefix> format for output_path, which is backed by solid validation and new unit tests.

My review includes one suggestion to combine validation checks for the output_path to improve code conciseness. Overall, this is a well-executed change that enhances the organization and traceability of run artifacts.

tools/agentic_import/pvmap_generator.py

…-dataset-prefix

rohitkumarbhagat added 2 commits January 19, 2026 12:32

Prefix gemini run folders

8dcb869

Scope logs to per-run directory

Validate output path format

a8ff236

Add coverage for run id prefix

rohitkumarbhagat requested a review from ajaits January 19, 2026 13:07

gemini-code-assist bot reviewed Jan 19, 2026

View reviewed changes

tools/agentic_import/pvmap_generator.py Outdated Show resolved Hide resolved

tools/agentic_import/pvmap_generator_test.py Show resolved Hide resolved

Validate output_path null

6232967

Add test for None output_path

gemini-code-assist bot reviewed Jan 19, 2026

View reviewed changes

tools/agentic_import/pvmap_generator.py Outdated Show resolved Hide resolved

Simplify output path validation

65767df

ajaits reviewed Jan 20, 2026

View reviewed changes

tools/agentic_import/pvmap_generator.py Outdated Show resolved Hide resolved

ajaits approved these changes Jan 20, 2026

View reviewed changes

rohitkumarbhagat added 2 commits January 20, 2026 08:54

Add random suffix to gemini run id

5c8b2a5

Merge remote-tracking branch 'upstream/master' into gemini-run-folder…

e4743fd

…-dataset-prefix

rohitkumarbhagat enabled auto-merge (squash) January 20, 2026 08:57

rohitkumarbhagat merged commit d36dbf2 into datacommonsorg:master Jan 20, 2026
9 checks passed

rohitkumarbhagat deleted the gemini-run-folder-dataset-prefix branch January 20, 2026 09:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scope gemini runs by dataset and enforce output_path format #1843

Scope gemini runs by dataset and enforce output_path format #1843

Uh oh!

rohitkumarbhagat commented Jan 19, 2026

Uh oh!

gemini-code-assist bot commented Jan 19, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

rohitkumarbhagat commented Jan 19, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Scope gemini runs by dataset and enforce output_path format #1843

Scope gemini runs by dataset and enforce output_path format #1843

Uh oh!

Conversation

rohitkumarbhagat commented Jan 19, 2026

Uh oh!

gemini-code-assist bot commented Jan 19, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

rohitkumarbhagat commented Jan 19, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants