Zebra

Commit Graph

Author	SHA1	Message	Date
teor	188d06e7a1	change(state): Add state requests and support code for the `z_getsubtreesbyindex` RPC (#7408 ) * Make NoteCommitmentSubtreeIndex compatible with serde-based RPCs * Add a stub for z_getsubtreesbyindex * Define a GetSubtrees RPC response type * Reject invalid shielded pool names * Make limit optional * Define state request and response types for subtrees * Implement FromDisk for NoteCommitmentSubtreeIndex and add a round-trip test * Make subtrees compatible with round-trip proptests * Add finalized state subtree list methods and delete unused methods * Remove Arc from subtrees in zebra-chain * Remove Arc from subtrees in zebra-state and use BTreeMap * Implement subtree list lookups in the non-finalized state and delete unused methods * Implement consistent concurrent subtree read requests * Implement ToHex for sapling::Node * Implement ToHex for orchard::Node * Implement z_get_subtrees_by_index RPC * Check for the start_index from the non-finalized state * Remove an unused mut * Fix missing doc links * Fix RPC comments * Temporarily remove the z_get_subtrees_by_index RPC method	2023-09-03 22:18:41 +00:00
Marek	2ea994a19e	fix(state): Fix the deduplication of note commitment trees (#7379 ) * Log errors and panic if duplicate trees are found after the de-duplicate upgrade * Always check for duplicates, even if the state is already marked as upgraded * Minor doc fixes * Document ranges for `zs_delete_range` * Revert the comment for `sapling_tree` * Rearrange tree methods & fix their docs * Bump DATABASE_FORMAT_PATCH_VERSION from 0 to 1 * Remove the manual tree deletion at early heights * Add `skip_while` to `zs_range_iter` * Refactor the tree deduplication * Add comments to the pruning * Turn warnings into panics * Remove redundant checks These checks are superseded by `check_for_duplicate_trees` * Remove an edge case that ignored the last tree * Suggestion for Fix the deduplication of note commitment trees (#7391) Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com> --------- Co-authored-by: teor <teor@riseup.net> Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>	2023-08-28 22:59:07 +00:00
Arya	94d9155adb	change(state): Add note subtree index handling to zebra-state, but don't write them to the finalized state yet (#7334 ) * zebra-chain changes from the subtree-boundaries branch ```sh git checkout -b subtree-boundaries-zebra-chain main git checkout origin/subtree-boundaries zebra-chain git commit ``` * Temporarily populate new subtree fields with None - for revert This temporary commit needs to be reverted in the next PR. * Applies suggestions from code review * removes from_repr_unchecked methods * simplifies loop * adds subtrees to zebra-state * uses split_at, from_repr, & updates state-db-upgrades.md * Update book/src/dev/state-db-upgrades.md Co-authored-by: teor <teor@riseup.net> * renames partial_subtree to subtree_data * tests that subtree serialization format * adds raw data format serialization round-trip test * decrements minor version and skips inserting subtrees in db --------- Co-authored-by: teor <teor@riseup.net>	2023-08-28 08:50:31 +00:00
teor	62258d51da	0. Add note commitment subtree types to zebra-chain (#7371 ) * zebra-chain changes from the subtree-boundaries branch ```sh git checkout -b subtree-boundaries-zebra-chain main git checkout origin/subtree-boundaries zebra-chain git commit ``` * Temporarily populate new subtree fields with None - for revert This temporary commit needs to be reverted in the next PR. * Applies suggestions from code review * removes from_repr_unchecked methods * simplifies loop --------- Co-authored-by: arya2 <aryasolhi@gmail.com>	2023-08-28 00:48:16 +00:00
Marek	d8f5d6b6f1	change(state): Deduplicate note commitment trees stored in the finalized state (#7312 ) * Add support for deleting the trees * Prune the trees * Remove `Network` from `DiskWriteBatch` Removing the `Network` from `DiskWriteBatch` makes it easy to instantiate `DiskWriteBatch`es in `ZebraDb` that remove individual note commitment trees. The `Network` from `DiskWriteBatch` was used only for transparent addresses, so the refactor isn't large. After removing it from `DiskWriteBatch`, I passed it as a function argument instead. However, we should simplify the parameter lists because at least two functions have more than seven parameters now. * Support individual tree removal in `ZebraDb` * Refactor the tree removal task * Prune old comments * Remove redundant code * Batch the removals * delete ranges before relevant network upgrades * moves prev_tree inits * add iterator methods for reading note commitment trees * Sets up skeleton of sapling pipeline * Replaces .filter with .take_while Fills in pipeline Reuses zs_range_iter instead of repeating that code Updates logic to stop at initial tip height * uses std threads * delete_range excludes end key * fixes off by one bugs * Log warning when a send fails * Removes progress logs * Log join errors instead of panicking * Revert: Make the `db` field of `ZebraDb` private * Move `delete_range_sapling_tree` * Remove a redundant `else if` branch Rationale: The condition `n == 1` for the removed branch is true for a subset of values of `n` in the preceding condition `n >= 1`. * Use more specific error messages * Revert: Remove redundant methods for tree removal * Suggestions for Deduplicate note commitment trees stored in the finalized state (#7330) * Add TODOs to some `Height` methods * Add methods for deleting individual trees * Refactor the tasks for deleting trees --------- Co-authored-by: arya2 <aryasolhi@gmail.com>	2023-08-17 00:41:11 +00:00
Marek	57c9249141	change(state): Insert only the first tree in each series of identical trees into finalized state (#7266 ) * Pass ZebraDB to batch preparation * Dedup the insertion of Sapling trees into database * Dedup the insertion of Orchard trees into database * Update snapshots * Rename batch preparation of trees * Simplify the naming of note commitment trees * Correctly retrieve Sapling trees from fin state * Correctly retrieve Orchard trees from fin state * Simplify the naming of methods for Sprout trees * Simplify the naming of methods for Sapling trees * Simplify the naming of methods for Orchard trees * Reduce disk reads by caching trees. (#7276) * Bump the state minor version * Reset the state patch version * Simplify the preparation of genesis trees * Store the roots of the trees of the genesis block * Add the genesis roots to snapshots * fix(test): Don't include shielded data in genesis blocks (#7302) * fix(state): Fix marking format upgrades (#7304) --------- Co-authored-by: Arya <aryasolhi@gmail.com>	2023-08-09 00:32:27 +00:00
teor	5324e5afd2	add(tests): Add snapshot tests for sprout database formats (#7057 ) * Add methods for loading entire column families from the database * Add a method that loads all the sprout trees from the database * Add snapshot tests for sprout note commitment trees * Add round-trip proptests for tree root database serialization * Add a manual sprout note commitment tree database serialization snapshot test * Add tests for 1,2,4,8 note commitments in a tree * Remove redundant "rand" package rename in dependencies * Randomly cache roots rather than only caching even roots --------- Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>	2023-06-27 15:32:30 +00:00
Marek	1f1d04b547	change(state): Refactor the structure of finalizable blocks (#7035 ) * Add and use `FinalizableBlock` This commit adds `FinalizableBlock`, and uses it instead of `ContextuallyVerifiedBlockWithTrees` in `commit_finalized_direct()` * Use `ContextuallyVerifiedBlockWithTrees` This commit passes `ContextuallyVerifiedBlockWithTrees` instead of passing separate `finalized`, `history_tree` and `note_commitment_trees` when storing blocks in the finalized state. * Apply suggestions from code review Co-authored-by: teor <teor@riseup.net> * add docs to new methods * fix existing doc * rename `ContextuallyVerifiedBlockWithTrees` to `SemanticallyVerifiedBlockWithTrees` * Refactor docs * Refactor comments * Add missing docs, fix typo * Fix rustfmt --------- Co-authored-by: teor <teor@riseup.net> Co-authored-by: Alfredo Garcia <oxarbitrage@gmail.com> Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>	2023-06-27 08:58:14 +00:00
Marek	006c2ae42b	change(state): Refactor the structure of verified blocks (#7025 ) * Refactor `CheckpointVerifiedBlock` This commit turns `CheckpointVerifiedBlock` into a wrapper of `SemanticallyVerifiedBlock` since both structs have the same fields. * Refactor `ContextuallyVerifiedBlockWithTrees` This commit uses `SemanticallyVerifiedBlock` in `ContextuallyVerifiedBlockWithTrees` instead of `CheckpointVerifiedBlock`.	2023-06-21 16:58:11 +00:00
teor	355f1233f5	change(db): Make the first stable release forward-compatible with planned state changes (#6813 ) * Implement minor and patch database format versions * Log and update database format versions when opening database * Refactor the current list of column families into a constant * Open all available column families, including from future Zebra versions * Refactor note commitment tree lookups to go through the height methods * Make Sapling/Orchard note commitment tree lookup forwards compatible * Ignore errors reading column family lists from disk * Update format version comments and TODOs * Correctly log newly created database formats --------- Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>	2023-06-06 21:18:57 +00:00
Alfredo Garcia	eb07bb31d6	rename(state): Rename state verifiers and related code (#6762 ) * rename verifiers * rename `PreparedBlock` to `SemanticallyVerifiedBlock` * rename `CommitBlock` to `SemanticallyVerifiedBlock` * rename `FinalizedBlock` to `CheckpointVerifiedBlock` * rename `CommitFinalizedBlock` to `CommitCheckpointVerifiedBlock` * rename `FinalizedWithTrees` to `ContextuallyVerifiedBlockWithTrees` * rename `ContextuallyValidBlock` to `ContextuallyVerifiedBlock` * change some `finalized` variables or function arguments to `checkpoint_verified` * fix docs * document the difference between `CheckpointVerifiedBlock` and `ContextuallyVerifiedBlock` * fix doc links * apply suggestions to request Co-authored-by: Marek <mail@marek.onl> * apply suggestions to service Co-authored-by: Marek <mail@marek.onl> * apply suggestions to finalized_state.rs and write.rs Co-authored-by: Marek <mail@marek.onl> * fmt * change some more variable names * change a few missing generics * fix checkpoint log issue * rename more `prepared` vars `semantically_verified` * fix test regex * fix test regex 2 --------- Co-authored-by: Marek <mail@marek.onl>	2023-06-01 12:29:03 +00:00
Marek	b8712d9a1e	feat(state): Send treestate from non-finalized state to finalized state (#4721 ) * Add history trees for each height in non-fin state * Refactor formatting * Pass the treestate to the finalized state I created a new structure `FinalizedBlockWithTrees` that wraps the treestate and the finalized block. I did that because the original `FinalizedBlock` is `Eq`, but `HistoryTree` can't be `Eq`. This makes Zebra faster because: 1. The finalized state doesn't retrieve the treestate from the disk if the non-finalized state supplies it. 2.The finalized state doesn't recompute the treestate if the non-finalized state supplies it. * Check block commitment before updating hist tree * Store Sprout commitment trees in non-fin state * Send trees for the root block to fin-state When committing a block and sending the treestate from the non-finalized state to the finalized state, Zebra was sending trees that correspond to the tip block instead of trees that correspond to the root block of the best chain. This commit fixes that. * Refactor doc comments * Refactor block finalization Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>	2022-09-06 09:32:54 +00:00
teor	6ad445eb97	1. fix(perf): Run CPU-intensive state updates in parallel rayon threads (#4802 ) * Split disk reads from CPU-heavy Sprout interstitial tree cryptography * Improve anchor validation debugging and error messages * Work around a test data bug, and save some CPU * Remove redundant checks for empty shielded data * Skip generating unused interstitial treestates * Do disk fetches and quick checks, then CPU-heavy cryptography * Wrap HistoryTree in an Arc in the state * Run CPU-intensive chain validation and updates in parallel rayon threads * Refactor to prepare for parallel tree root calculations * Run finalized state note commitment tree root updates in parallel rayon threads * Update finalized state note commitment trees using parallel rayon threads * Fix a comment typo and add a TODO * Split sprout treestate fetch into its own function * Move parallel note commitment trees to zebra-chain * Re-calculate the tree roots in the same parallel batches * Do non-finalized note commitment tree updates in parallel threads * Update comments about note commitment tree rebuilds * Do post-fork tree updates in parallel threads * Add a TODO for parallel tree updates in tests * Fix broken intra-doc links * Clarify documentation for sprout treestates * Sort Cargo.toml dependencies	2022-07-22 12:19:11 -04:00
Marek	485bac819d	change(state): Wrap commitment trees into `Arc` (#4757 ) * Wrap Sprout note commitment trees into `Arc` * Remove a redundant comment * Rephrase a comment about chain forking * Remove a redundant comment The comment is not valid because Zebra uses `bridgetree::Frontier`s from the `incrementalmerkletree` crate to represent its note commitment trees. This `struct` does not support popping elements from the tree. * Wrap Sapling commitment trees into `Arc` * Remove unnecessary `as_ref`s * Wrap Orchard commitment trees into `Arc`	2022-07-15 10:39:41 +10:00
Alfredo Garcia	97fb85dca9	lint(clippy): add `unwrap_in_result` lint (#4667 ) * `unwrap_in_result` in zebra-chain crate * `unwrap_in_result` in zebra-script crate * `unwrap_in_result` in zebra-state crate * `unwrap_in_result` in zebra-consensus crate * `unwrap_in_result` in zebra-test crate * `unwrap_in_result` in zebra-network crate * `unwrap_in_result` in zebra-rpc crate * `unwrap_in_result` in zebrad crate * rustfmt * revert `?` and add exceptions * explain some panics better * move some lint positions * replace a panic with error * Fix rustfmt? Co-authored-by: teor <teor@riseup.net>	2022-06-28 06:22:07 +00:00
teor	2439bed3d2	2. fix(state): index spending transaction IDs for each address (#4355 ) * Make jobs that use cached state wait for state rebuilds * Run jobs that need cached state even if the rebuild was skipped * Fix missing dependencies And update a TODO * Split writing transaction indexes into transparent and shielded * Split writing transparent indexes into created and spent * Correctly populate spending address transaction ID indexes * Increment the database format to rebuild address tx ID indexes * Update non-finalized docs to prevent similar bugs * Fix a comment * Make jobs that use cached state wait for state rebuilds * Run jobs that need cached state even if the rebuild was skipped * Fix missing dependencies And update a TODO * refactor(ci): look for available disks instead of files changed This ensure that if the constants.rs file was changed, we search for disks available in the whole repository with the same state. If there's no disk available a rebuild is triggered depending the missing disk. And if there's a disk available, tests are run with this one. * fix(ci): lwd syncs needs to wait for zebra disk rebuild * docs(ci): use better comments on integration tests * fix(ci): we must authenticate to GCP to find disks * fix(ci): add needed permissions for google auth * fix(ci): the output needs to be echoed * imp(ci): reduce diff with main * fix(ci): remove redundant dependency Co-authored-by: teor <teor@riseup.net> * fix(ci): also add `false` to the JSON object output * fix(ci): hasty copy/paste * force a push event * fix(ci): standardize comments * fix(ci): run disk rebuilds if no disk was found * fix(ci): do not restrict on push * fix(ci): build on any event if a cached disk is not found * fix(ci): sync .patch file with changes on the workflow Co-authored-by: Gustavo Valverde <gustavo@iterativo.do>	2022-05-20 02:22:01 +00:00
teor	7faa6a26c5	2. feat(db): Add address balance indexes to the finalized state (#3963 ) * Add an empty balance_by_transparent_addr column family * Add an AddressBalanceLocation type for balance_by_transparent_addr * Add serialization for balance_by_transparent_addr types * Add round-trip tests for the new serialized types * Add missing round-trip and serialized equality tests * Add a network field to DiskWriteBatch * Refactor confusing all_utxos_spent_by_block argument It was actually just the UTXOs from the state spent by the block, excluding the UTXOs created and spent within the block. But now we need it to contain all the spent outputs, including the ones created by the block. * Read and update address balances in the finalized state * Update raw data snapshots for transparent address balances * Add test-only deserialization for transparent addresses * Add high-level snapshot test code for address balances * Add high-level snapshots for address balances * Increment the state version after NU5 testnet 2 rollback	2022-04-07 23:15:17 +00:00
teor	6aba60d657	1. feat(db): Store transactions in a separate database index, to improve query speed (#3934 ) * Implement disk serialization for block headers and transactions * Re-order column family initialization to match the design * Add new empty transaction column families * Split writing block header and transaction data * Re-order column families for consistency * Update write snapshots for transaction split * Use split block and transaction data when reading * Update snapshots to include genesis transaction hash location * Filter all prefix iterators to make sure they return the correct values * Test that the new transaction indexes are consistent * Add some cleanup TODOs * Increment the database format to version 15 * Remove unused fisk format impls for Block * Add a missing prefix extractor for transaction locations * Make the database generic over the thread mode * Replace prefix iteration with iteration from a key, and a filter Prefix iteration caused database hangs. * Manually iterate through transaction locations to re-create blocks Also: - re-write disk read API to avoid iterator hangs - move disk read API to ReadDisk - re-write impl rocksdb::AsColumnFamilyRef to a where clause, for consistency * Update the database version so it's larger than the NU5 testnet 2 version	2022-04-07 08:30:50 +00:00
Marek	38a2bcb042	feat(shielded): Store Sapling & Orchard note commitment trees in finalized and non-finalized state (#3818 ) * Query Sapling & Orchard trees by height in the finalized state * Add Sapling & Orchard trees to the non-finalized state * Add a TODO about concurrent read-only access to Sprout tree Co-authored-by: teor <teor@riseup.net> * Update the database format version * Keep only the most recent Sprout tree in the database * Check that the database returns empty trees for the genesis block * Assert that the database returns the highest trees * Document how to update insta snapshots * Add note commitment tree insta snapshot tests * Add comments about cached tree roots in snapshots * Add snapshot data for sapling and orchard trees Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com> Co-authored-by: teor <teor@riseup.net>	2022-03-15 05:18:18 +00:00
teor	6fb426ef93	8. refactor(state): allow shared read access to the finalized state database (#3846 ) * Move database read methods to a new ZebraDb wrapper type * Rename struct fields	2022-03-11 20:23:32 +00:00
teor	f8a4021c07	refactor(state): split database access into modules by Zebra types (#3617 ) Also split the genesis block check from the genesis note commitment trees.	2022-02-28 22:21:03 +00:00

21 Commits