feat: Mutations Batcher shim by gkevinzheng · Pull Request #1309 · googleapis/python-bigtable

gkevinzheng · 2026-03-16T16:22:21Z

Changes Made:

Replaced mutations batcher implementation with one based off of the data client.
Reworked unit tests.
Added additional system tests.

gemini-code-assist · 2026-03-16T16:22:46Z

Summary of Changes

Hello, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request refactors the MutationsBatcher to act as a shim, delegating its core batching responsibilities to an internal batcher provided by the data client. This change streamlines the batching mechanism, leveraging the data client's capabilities for more robust and efficient mutation operations, while maintaining the existing public interface.

Highlights

Mutations Batcher Implementation: The underlying implementation of the Mutations Batcher has been replaced with one based on the new data client, simplifying the batching logic.
Unit Tests Reworked: Existing unit tests have been refactored to align with the new batcher implementation and improve testability.
System Tests Added: New system tests have been introduced to cover exception handling and manual flushing scenarios for the Mutations Batcher.

Changelog

google/cloud/bigtable/batcher.py
- Removed deprecated internal batching classes and methods, including _MutationsBatchQueue, _BatchInfo, _FlowControl, _flush_async, and related helpers.
- Updated imports to utilize MutationsExceptionGroup and RowMutationEntry from the google.cloud.bigtable.data module.
- Modified the MutationsBatcher constructor to initialize and manage an internal batcher instance from table._table_impl.mutations_batcher.
- Implemented _init_batcher and _close_batcher methods to control the lifecycle and exception handling of the internal batcher.
- Updated the mutate method to append mutations directly to the new internal batcher.
- Refactored the flush method to close and re-initialize the internal batcher, ensuring pending mutations are processed.
- Adjusted the close method to properly shut down the internal batcher and collect any exceptions.
google/cloud/bigtable/data/_async/client.py
- Removed unused Callable and status_pb2 imports.
google/cloud/bigtable/data/_async/mutations_batcher.py
- Removed unused Callable and Optional imports.
tests/system/v2_client/test_data_api.py
- Added test_mutations_batcher_exceptions to verify the batcher's error propagation and handling.
- Added test_mutations_batcher_manual_flush to confirm correct behavior when manually flushing mutations.
tests/unit/v2_client/test_batcher.py
- Refactored unit tests to use _setup_batcher and _atexit_mock fixtures for improved test setup and isolation.
- Updated constructor tests to verify the correct initialization parameters are passed to the internal batcher.
- Modified tests for mutate_rows, mutate, flush, and close to assert interactions with the mocked internal batcher operations.
- Updated exception handling tests to reflect the new MutationsExceptionGroup and FailedMutationEntryError types.
- Removed obsolete test helper classes and functions related to the old batcher implementation.

Activity

No specific activity has been recorded for this pull request yet.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

The pull request successfully refactors the MutationsBatcher to leverage the new data client's batching capabilities. This significantly simplifies the batcher.py file by removing a substantial amount of custom queueing, flow control, and asynchronous flushing logic. The updated unit and system tests adequately cover the new implementation, including exception handling and manual flushing behavior. The changes align with the goal of migrating to the new data client.

google/cloud/bigtable/batcher.py

daniel-sanche · 2026-03-18T22:40:26Z

google/cloud/bigtable/batcher.py

-        self.flow_control = _FlowControl(
-            max_mutations=MAX_OUTSTANDING_ELEMENTS,
-            max_mutation_bytes=MAX_OUTSTANDING_BYTES,
-        )


It looks like you're discarding the flow control? You should be able to pass these through to the data client batcher

daniel-sanche · 2026-03-18T22:42:47Z

google/cloud/bigtable/batcher.py

+        )
+        self._batcher._user_batch_completed_callback = (
+            self._user_batch_completed_callback
+        )


It looks like you're only storing self._flush_interval, self._flush_count, and self._max_row_bytes so you have the context to re-build a new batcher later?

This pattern is fine, but I think functool.partial can make this kind of thing cleaner, letting you condensing all the state into single nullable function

self._batcher_build_fn = partial(self._build_batcher, callback, interval=flush_interval, ...) def _build_batcher(self, callback, **kwargs): batcher = self.table._table_impl.mutations_batcher(**kwargs) batcher._user_batch_completed_callback = callback return batcher

And then every time you need a new batcher, you just call self._batcher = self._batcher_build_fn()

(It could be simplified more to get rid of the extra _build_batcher function if we make _user_batch_completed_callback into an unadvertised kwarg in the data client, but I'm not sure if that's worth it)

daniel-sanche · 2026-03-18T22:57:31Z

google/cloud/bigtable/data/_async/mutations_batcher.py

            maxlen=self._exception_list_limit
        )
-        self._user_batch_completed_callback = None
+        self._user_batch_completed_callback: Optional[


we should probably add a comment here describing that this is currently just used by the shim

daniel-sanche · 2026-03-18T23:05:13Z

google/cloud/bigtable/batcher.py

@@ -406,9 +194,8 @@ def close(self):
        :raises:
            * :exc:`.batcherMutationsBatchError` if there's any error in the mutations.


I think the exc name in the docstrings might be broken?

daniel-sanche · 2026-03-18T23:10:25Z

google/cloud/bigtable/batcher.py

+            for error in exc_group.exceptions:
+                # Return the cause of the FailedMutationEntryError to the user,
+                # as this might be more what they're expecting.
+                self._exceptions.put(error.__cause__)


This might be a good time to address this TODO?

The code seems to assume it will be FailedMutationEntryError, so we should make the types agree

daniel-sanche · 2026-03-18T23:12:06Z

google/cloud/bigtable/batcher.py

+        except MutationsExceptionGroup as exc_group:
+            for error in exc_group.exceptions:
+                # Return the cause of the FailedMutationEntryError to the user,
+                # as this might be more what they're expecting.


Can this comment be improved, to be more definitive? Maybe something like # Unpack the root cause from FailedMutationEntryError wrapper

feat: Mutations Batcher shim

cd2cc45

gkevinzheng requested a review from a team as a code owner March 16, 2026 16:22

product-auto-label bot added size: l Pull request size is large. api: bigtable Issues related to the googleapis/python-bigtable API. labels Mar 16, 2026

gkevinzheng requested a review from daniel-sanche March 16, 2026 16:22

gemini-code-assist bot reviewed Mar 16, 2026

View reviewed changes

google/cloud/bigtable/batcher.py Show resolved Hide resolved

mypy

f47e606

daniel-sanche requested changes Mar 18, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Mutations Batcher shim#1309

feat: Mutations Batcher shim#1309
gkevinzheng wants to merge 2 commits intov3_stagingfrom
mutations-batcher

gkevinzheng commented Mar 16, 2026

Uh oh!

gemini-code-assist bot commented Mar 16, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

daniel-sanche Mar 18, 2026

Uh oh!

daniel-sanche Mar 18, 2026

Uh oh!

daniel-sanche Mar 18, 2026

Uh oh!

daniel-sanche Mar 18, 2026

Uh oh!

daniel-sanche Mar 18, 2026

Uh oh!

daniel-sanche Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -406,9 +194,8 @@ def close(self):
		:raises:
		* :exc:`.batcherMutationsBatchError` if there's any error in the mutations.

Conversation

gkevinzheng commented Mar 16, 2026

Uh oh!

gemini-code-assist bot commented Mar 16, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

daniel-sanche Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

daniel-sanche Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

daniel-sanche Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

daniel-sanche Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

daniel-sanche Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

daniel-sanche Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants