Refine physical backup and restore documentation by igroene · Pull Request #346 · percona/pbm-docs

igroene · 2026-03-12T11:35:43Z

Clarify the definition of physical backup and restore processes, update warnings regarding encryption key management, and improve overall readability.

Copilot

Pull request overview

Refines the “Physical backups and restores” documentation to better explain the physical restore flow and strengthen guidance around encryption key management during restores.

Changes:

Rewords the physical backup/restore definitions and removes duplicated content.
Clarifies that restore involves temporary/intermediate mongod starts and updates log-file behavior wording.
Updates the encryption admonition from “important” to “warning” and expands key-rotation / key-retention guidance.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-12T11:38:36Z

docs/features/physical.md

 ![image](../_images/pbm-phys-restore-shard.png)

-During the restore, the ``pbm-agents`` temporarily start the ``mongod`` nodes using the WiredTiger storage options retrieved from the backup's metadata. The logs for these starts are saved to the ``pbm.restore.log`` file inside the ``dbPath``. Upon successful restore, this file is deleted. However, it remains for debugging if the restore were to fail. 
+During the restore, the ``pbm-agents`` start a temporary non-user reachable instance of each ``mongod`` nodes using the WiredTiger storage options retrieved from the backup's metadata. The logs for these "intermediate" starts are saved to the ``pbm.restore.log`` file inside the ``dbPath``. Upon successful restore, these files are deleted. However, they remain for debugging if the restore were to fail. 


Grammar/clarity issues in this sentence make the restore flow hard to follow: "instance of each mongod nodes" mixes singular/plural, and the text says logs are saved to a single "pbm.restore.log" file but later refers to "these files" being deleted. Consider rephrasing to consistently describe whether there is one temporary instance/log per node and use consistent singular/plural (e.g., "each mongod node" and "log file(s)").

Suggested change

During the restore, the ``pbm-agents`` start a temporary non-user reachable instance of each ``mongod`` nodes using the WiredTiger storage options retrieved from the backup's metadata. The logs for these "intermediate" starts are saved to the ``pbm.restore.log`` file inside the ``dbPath``. Upon successful restore, these files are deleted. However, they remain for debugging if the restore were to fail.

During the restore, the ``pbm-agents`` start a temporary, non-user-reachable instance for each ``mongod`` node using the WiredTiger storage options retrieved from the backup's metadata. The logs for these "intermediate" starts are saved to a ``pbm.restore.log`` file inside each node's ``dbPath``. Upon successful restore, these log files are deleted. However, they remain for debugging if the restore were to fail.

Copilot · 2026-03-12T11:38:37Z

docs/features/physical.md

 | Advantages                     | Disadvantages                   |
 | ------------------------------ | ------------------------------- |
-|- Faster backup and restore speed <br> - Recommended for big, multi-terabyte datasets <br> - No database overhead | - The backup size is bigger than for logical backups due to data fragmentation extra cost of keeping data and indexes in appropriate data structures <br> - Extra manual operations are required after the restore <br> - Point-in-time recovery requires manual operations | Sharded clusters and non-sharded replica sets |
+|- Faster backup and restore speed <br> - Recommended for big, multi-TB datasets <br> - No overhead at database level | - The backup size could be bigger than for logical backups due to data fragmentation, and the cost of storing the files of each index <br> - Extra manual post-restore steps are required |


The doc uses both "multi-terabyte" (line 25) and the abbreviated "multi-TB" (line 29) to describe the same scale. Standardizing on one term in this section would improve readability and avoid potential ambiguity for readers unfamiliar with the abbreviation.

Suggested change

|- Faster backup and restore speed - Recommended for big, multi-TB datasets - No overhead at database level | - The backup size could be bigger than for logical backups due to data fragmentation, and the cost of storing the files of each index - Extra manual post-restore steps are required |

|- Faster backup and restore speed - Recommended for big, multi-terabyte datasets - No overhead at database level | - The backup size could be bigger than for logical backups due to data fragmentation, and the cost of storing the files of each index - Extra manual post-restore steps are required |

Copilot · 2026-03-12T11:38:37Z

docs/features/physical.md

+!!! warning

-    Make sure that you know which master encryption key was used and keep it safe, as this key is required for the restore.
+    The master encryption key is not stored as part of the backup. Make sure that you know which encryption key was used at the time of each backup (and back it up separately). If using a key management system, make sure to enable key versioning (Vault does this by default). Do NOT delete old key versions as these are still required for restoring "old" backups that happened before key rotation.


In this warning, the terminology switches from "master encryption key" to the more generic "encryption key", which can be ambiguous (e.g., readers may not realize you mean the master key and/or its specific version at backup time). Consider using consistent terminology throughout and explicitly calling out that the restore requires the same master key version/ID that was active when the backup was taken.

Refine physical backup and restore documentation

13f80b3

Clarify the definition of physical backup and restore processes, update warnings regarding encryption key management, and improve overall readability.

Copilot AI review requested due to automatic review settings March 12, 2026 11:35

rasika-chivate deployed to igroene-patch-13 - pbm-docs PR #346 March 12, 2026 11:35 — with Render View deployment

Copilot started reviewing on behalf of igroene March 12, 2026 11:36 View session

Copilot AI reviewed Mar 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refine physical backup and restore documentation#346

Refine physical backup and restore documentation#346
igroene wants to merge 1 commit intomainfrom
igroene-patch-13

igroene commented Mar 12, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 12, 2026

Uh oh!

Copilot AI Mar 12, 2026

Uh oh!

Copilot AI Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	\|- Faster backup and restore speed <br> - Recommended for big, multi-TB datasets <br> - No overhead at database level \| - The backup size could be bigger than for logical backups due to data fragmentation, and the cost of storing the files of each index <br> - Extra manual post-restore steps are required \|
	\|- Faster backup and restore speed <br> - Recommended for big, multi-terabyte datasets <br> - No overhead at database level \| - The backup size could be bigger than for logical backups due to data fragmentation, and the cost of storing the files of each index <br> - Extra manual post-restore steps are required \|

Conversation

igroene commented Mar 12, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants