Zebra

Commit Graph

Author	SHA1	Message	Date
teor	6f8f4d8987	Provide recent syncer response lengths as a watch channel (#2602 ) * Minimal recent sync lengths implementation Also includes metrics and logging, to make diagnosing bugs easier. * Add logging to check what happens when Zebra reaches the chain tip * Add tests for recent sync lengths - initially empty - pruned to correct length - newest entries go first * Drop a redundant `/` from a Cargo.lock URL This seems to be a nightly or beta Rust change, but hopefully stable just accepts it. * Use metrics histograms to avoid overwriting values * Add detailed syncer monitoring dashboard * Increase the recent sync length to 4 This length makes it easier to distinguish between temporary and sustained errors/syncs. Co-authored-by: Janito Vaqueiro Ferreira Filho <janito.vff@gmail.com>	2021-08-19 23:16:16 +00:00
Janito Vaqueiro Ferreira Filho	4c4dbfe7cd	Reject connections from outdated peers (#2519 ) * Simplify state service initialization in test Use the test helper function to remove redundant code. * Create `BestTipHeight` helper type This type abstracts away the calculation of the best tip height based on the finalized block height and the best non-finalized chain's tip. * Add `best_tip_height` field to `StateService` The receiver endpoint is currently ignored. * Return receiver endpoint from service constructor Make it available so that the best tip height can be watched. * Update finalized height after finalizing blocks After blocks from the queue are finalized and committed to disk, update the finalized block height. * Update best non-finalized height after validation Update the value of the best non-finalized chain tip block height after a new block is committed to the non-finalized state. * Update finalized height after loading from disk When `FinalizedState` is first created, it loads the state from persistent storage, and the finalized tip height is updated. Therefore, the `best_tip_height` must be notified of the initial value. * Update the finalized height on checkpoint commit When a checkpointed block is commited, it bypasses the non-finalized state, so there's an extra place where the finalized height has to be updated. * Add `best_tip_height` to `Handshake` service It can be configured using the `Builder::with_best_tip_height`. It's currently not used, but it will be used to determine if a connection to a remote peer should be rejected or not based on that peer's protocol version. * Require best tip height to init. `zebra_network` Without it the handshake service can't properly enforce the minimum network protocol version from peers. Zebrad obtains the best tip height endpoint from `zebra_state`, and the test vectors simply use a dummy endpoint that's fixed at the genesis height. * Pass `best_tip_height` to proto. ver. negotiation The protocol version negotiation code will reject connections to peers if they are using an old protocol version. An old version is determined based on the current known best chain tip height. * Handle an optional height in `Version` Fallback to the genesis height in `None` is specified. * Reject connections to peers on old proto. versions Avoid connecting to peers that are on protocol versions that don't recognize a network update. * Document why peers on old versions are rejected Describe why it's a security issue above the check. * Test if `BestTipHeight` starts with `None` Check if initially there is no best tip height. * Test if best tip height is max. of latest values After applying a list of random updates where each one either sets the finalized height or the non-finalized height, check that the best tip height is the maximum of the most recently set finalized height and the most recently set non-finalized height. * Add `queue_and_commit_finalized` method A small refactor to make testing easier. The handling of requests for committing non-finalized and finalized blocks is now more consistent. * Add `assert_block_can_be_validated` helper Refactor to move into a separate method some assertions that are done before a block is validated. This is to allow moving these assertions more easily to simplify testing. * Remove redundant PoW block assertion It's also checked in `zebra_state::service::check::block_is_contextually_valid`, and it was getting in the way of tests that received a gossiped block before finalizing enough blocks. * Create a test strategy for test vector chain Splits a chain loaded from the test vectors in two parts, containing the blocks to finalize and the blocks to keep in the non-finalized state. * Test committing blocks update best tip height Create a mock blockchain state, with a chain of finalized blocks and a chain of non-finalized blocks. Commit all the blocks appropriately, and verify that the best tip height is updated. Co-authored-by: teor <teor@riseup.net>	2021-08-08 23:52:52 +00:00
teor	74e155ff9f	Spelling: gossipped -> gossiped (#2119 )	2021-05-07 13:01:11 +02:00
teor	7e2c3a2fc7	Clarify a duplicate log message	2021-04-21 23:59:29 -04:00
teor	24f1b9bad1	Document the Inbound service in the start module (#1653 )	2021-01-29 22:19:06 +10:00
Alfredo Garcia	486e55104a	create Downloads for Inbound	2020-11-25 10:55:44 -08:00
Henry de Valence	e9c847bbd7	zebrad: avoid a borrow in the ChainSync future	2020-11-17 14:56:27 -08:00
Henry de Valence	253bab042e	sync: add a concurrency limit for block downloads	2020-10-26 12:05:35 -07:00
Henry de Valence	65e0c22fbe	state: don't pre-buffer the service There's no reason to return a pre-Buffer'd service (there's no need for internal access to the state service, as in zebra-network), but wrapping it internally removes control of the buffer size from the caller.	2020-10-26 12:05:35 -07:00
Henry de Valence	cab96aa1a8	zebrad: clarify config help text (#1194 )	2020-10-22 15:03:01 +10:00
Alfredo Garcia	21ad6ffc47	Reverse displayed endianness of transaction and block hashes (#1171 ) * Reverse displayed endianness of transaction and block hashes * fix zebra-checkpoints utility for new hash order * Stop using "zebrad revhex" in zebrad-hash-lookup * Rebuild checkpoint lists in new hash order This change also adds additional checkpoints to the end of each list. * Replace TransactionHash with transaction::Hash This change should have been made in #905, but we missed Debug impls and some docs. Co-authored-by: Ramana Venkata <vramana@users.noreply.github.com> Co-authored-by: teor <teor@riseup.net>	2020-10-22 07:54:02 +10:00
Henry de Valence	55f46967b2	zebrad: serve blocks from Inbound service The original version of this commit ran into https://github.com/rust-lang/rust/issues/64552 again. Thanks to @yaahc for suggesting a workaround (using futures combinators to avoid writing an async block).	2020-09-18 18:34:25 -07:00
Henry de Valence	170f588ffb	network: document load-shedding behavior This was part of the original design and is described in the Connection internals, but we never documented it externally.	2020-09-18 18:34:25 -07:00
Henry de Valence	1d0ebf89c6	zebrad: move seed command into inbound component Remove the seed command entirely, and make the behavior it provided (responding to `Request::Peers`) part of the ordinary functioning of the start command. The new `Inbound` service should be expanded to handle all request types.	2020-09-18 18:34:25 -07:00
Henry de Valence	1d3892e1dc	network: rename alias to BoxError This is shorter and consistent with Tower (which is why we use it in the first place).	2020-09-18 18:34:25 -07:00
teor	b1e1291f45	Log inbound peer requests at debug Logging at info was a bit too verbose. Also add a short log message.	2020-09-10 09:46:53 -07:00
Henry de Valence	9b6e66c1b9	zebrad: rename Syncer to ChainSync This name clarifies what is being synced and avoids an agent-noun construction.	2020-09-10 09:45:52 -07:00
Henry de Valence	0bc79686b8	zebrad: move sync into components module. Part of #1030.	2020-09-10 09:45:52 -07:00
teor	adafe1d189	Restart sync after the first failed ObtainTips The ObtainTips retry was redundant. The timeout wasn't much shorter, but it made the code and sync logic more complicated.	2020-09-09 15:35:09 -07:00
teor	2a68ef5acb	Update the peerset buffer size and sync timeout Also add a bunch of comments and documentation for network-constrained nodes, and for testnet.	2020-09-08 12:44:33 -07:00
teor	b062a682b0	Refactor "waiting for pending blocks" log	2020-09-08 12:44:33 -07:00
teor	e6e859dce2	Tweak sync timeouts * increase the EWMA default and decay * increase the block download retries * increase the request and block download timeouts * increase the sync timeout	2020-09-08 12:44:33 -07:00
teor	ce12d4dadc	Add timeouts for tip responses and block verify tasks	2020-09-08 12:44:33 -07:00
teor	379ce5c1b8	Retry obtain and extend tips on failure	2020-09-08 12:44:33 -07:00
teor	48497d4857	Ignore sync errors when the block is already verified (#980 ) * Ignore sync errors when the block is already verified If we get an error for a block that is already in our state, we don't need to restart the sync. It was probably a duplicate download. Also: Process any ready tasks before reset, so the logs and metrics are up to date. (But ignore the errors, because we're about to reset.) Improve sync logging and metrics during the download and verify task. * Remove duplicate hashes in logs Co-authored-by: Jane Lusby <jlusby42@gmail.com> * Log the sync hash span at warn level Co-authored-by: Jane Lusby <jlusby42@gmail.com>	2020-09-04 08:13:00 +10:00
teor	437549d8e9	Always drop the final hash in peer responses (#991 ) To workaround a zcashd bug that squashes responses together.	2020-09-04 08:09:34 +10:00
teor	c770daa51f	If the first ExtendTips hash is bad, discard it and re-check (#992 )	2020-09-04 08:08:19 +10:00
Alfredo Garcia	5485f4429a	Add config path to acceptance tests (#946 ) * add and apply config mode to get_child * remove option to read config from current directory * remove argument from get_child	2020-09-03 13:13:23 -07:00
Jane Lusby	ffdec0cb23	Remove in-memory state service (#974 ) * Remove in-memory state service * make the config compatible with toml again * checkpoint commit to see how much I still have to revert * back to the starting point... * remove unused dependency * reorganize error handling a bit * need to make a new color-eyre release now * reorder again because I have problems * remove unnecessary helpers * revert changes to config loading * add back missing space * Switch to released color-eyre version * add back missing newline again... * improve error message on unix when terminated by signal * add context to last few asserts in acceptance tests * instrument some of the helpers * remove accidental extra space * try to make this compile on windows * reorg platform specific code * hide on_disk module and fix broken link	2020-09-01 12:39:04 -07:00
teor	3fdfcb3179	fix: remove old tips that are behind new tips This change makes sync less reliant on the exact order of ObtainTips and ExtendTips responses.	2020-09-01 11:42:48 -04:00
teor	78201b456d	feature: Implement checkpoint_sync for checkpoint verification * add CheckpointList::new_up_to(limit: NetworkUpgrade) * if checkpoint_sync is false, limit checkpoints to Sapling * update tests for CheckpointList and chain::init	2020-08-24 15:34:46 +10:00
teor	b8e8d4f548	fix: Remove some deeply-nested instrument spans Closes #923.	2020-08-20 14:52:39 -04:00
Henry de Valence	103b663c40	chain: rename BlockHeight to block::Height	2020-08-17 11:46:34 -07:00
Henry de Valence	61dea90e2f	chain: rename BlockHeaderHash to block::Hash This is the first in a sequence of changes that change the block:: items to not include Block as a prefix in their name, in accordance with the Rust API guidelines.	2020-08-17 11:46:34 -07:00
Henry de Valence	948b067808	chain: move Network, NetworkUpgrade to parameters Also, avoid using star-imports of the enum variants, which pollutes the namespace.	2020-08-17 11:46:34 -07:00
Henry de Valence	0d1f56ad2f	chain: remove utils module A catch-all utils module can really easily slip into being a place to stash miscellaneous functions that don't really belong anywhere in particular.	2020-08-17 11:46:34 -07:00
Henry de Valence	a79ce97957	Fix sync algorithm. (#887 ) * checkpoint: reject older of duplicate verification requests. If we get a duplicate block verification request, we should drop the older one in favor of the newer one, because the older request is likely to have been canceled. Previously, this code would accept up to four duplicate verification requests, then fail all subsequent ones. * sync: add a timeout layer to block requests. Note that if this timeout is too short, we'll bring down the peer set in a retry storm. * sync: restart syncing on error Restart the syncing process when an error occurs, rather than ignoring it. Restarting means we discard all tips and start over with a new block locator, so we can have another chance to "unstuck" ourselves. * sync: additional debug info * sync: handle lookahead limit correctly. Instead of extracting all the completed task results, the previous code pulled results out until there were fewer tasks than the lookahead limit, then stopped. This meant that completed tasks could be left until the limit was exceeded again. Instead, extract all completed results, and use the number of pending tasks to decide whether to extend the tip or wait for blocks to finish. * network: add debug instrumentation to retry policy * sync: instrument the spawned task * sync: streamline ObtainTips/ExtendTips logic & tracing This change does three things: 1. It aligns the implementation of ObtainTips and ExtendTips so that they use the same deduplication method. This means that when debugging we only have one deduplication algorithm to focus on. 2. It streamlines the tracing output to not include information already included in spans. Both obtain_tips and extend_tips have their own spans attached to the events, so it's not necessary to add Scope: prefixes in messages. 3. It changes the messages to be focused on reporting the actual events rather than the interpretation of the events (e.g., "got genesis hash in response" rather than "peer could not extend tip"). The motivation for this change is that when debugging, the interpretation of events is already known to be incorrect, in the sense that the mental model of the code (no bug) does not match its behavior (has bug), so presenting minimally-interpreted events forces interpretation relative to the actual code. * sync: hack to work around zcashd behavior * sync: localize debug statement in extend_tips * sync: change algorithm to define tips as pairs of hashes. This is different enough from the existing description that its comments no longer apply, so I removed them. A further chunk of work is to change the sync RFC to document this algorithm. * sync: reduce block timeout * state: add resource limits for sled Closes #888 * sync: add a restart timeout constant * sync: de-pub constants	2020-08-12 16:48:01 -07:00
Henry de Valence	299afe13df	zebra-network tweaks. (#877 ) * network: move gossiped peer selection logic into address book. * network: return BoxService from init. * zebrad: add note on why we truncate thegossiped peer list Co-authored-by: Jane Lusby <jlusby42@gmail.com> * Remove unused .rustfmt.toml Many of these options are never actually loaded by our CI because of a channel mismatch, where they're not applied on stable but only on nightly (see the logs from a rustfmt job). This means that we can get different settings when running `cargo fmt` on the nightly and stable channels, which was causing a CI failure on this PR. Reverting back to the default rustfmt settings avoids this problem and keeps us in line with upstream rustfmt. There's no loss to us since we were using the defaults anyways. Co-authored-by: Jane Lusby <jlusby42@gmail.com>	2020-08-11 13:07:44 -07:00
teor	2550c44d48	Make sync ignore known hashes (#853 ) * fix: Handle known ObtainTips correctly enumerate never returns a value beyond the end of the vector. * fix: Ignore known tips in ExtendTips Some peers send us known tips when we try to extend. * fix: Ignore known hashes when downloading Despite all our other checks, we still end up downloading some hashes multiple times. * fix: Increase the number of retries The old sync code relied on duplicate block fetches to make progress, but the last few commits have removed some of those duplicates. Instead, just retry the fetches that fail. * fix: Tweak comments Co-authored-by: Jane Lusby <jlusby42@gmail.com> * fix: Cleanup the state_contains interface in Sync * Fix brackets Oops Co-authored-by: Jane Lusby <jlusby42@gmail.com>	2020-08-10 16:17:50 -07:00
teor	e95358dbe3	fix: Increase the number of retries The old sync code relied on duplicate block fetches to make progress, but the last few commits have removed some of those duplicates. Instead, just retry the fetches that fail.	2020-08-10 18:58:21 +10:00
teor	faac50697c	feature: Add a verified blocks metrics counter We have a counter for pending "download and verify" futures. But these futures are spawned, so they can complete in any order. They can also complete before we receive their results.	2020-08-10 15:12:08 +10:00
teor	6aeefcee8b	fix: Improve sync diagnostics	2020-08-10 15:12:08 +10:00
Henry de Valence	a77328ad7c	Refactor tracing components (#834 ) * Split tracing component code into modules. * Repatriate Tracing and simplify config handling. We upstreamed our Tracing component, expecting not to have to exert fine control over the tracing settings. But this turned out not to be the case, and now that we want to do other things (flamegraphs, journalctl, opentelemetry, etc), we end up with really awkward code (as in the current flamegraph handling). This also makes use of the changes to `init()` to load the config early to pass configuration data into the components, which avoids the need for the refactoring in #775. Finally, we restore support for the `-v` flag when the filter is unset. Closes #831. * Disable tracing and metrics endpoints by default. Closes #660. * Switch back to upstream Abscissa. * Integrate flamegraph support into the new Tracing component. * Pass -v in acceptance tests to get info-level output. * Clean up acceptance test code.	2020-08-06 10:29:31 -07:00
Jane Lusby	867dd0b475	Setup tracing-flame for use profiling zebrad (#436 ) * Setup tracing-flame for use profiling zebrad * start work on conditional flamegraph generation * review time! * update comments * Update Cargo.toml * disable default features for inferno * reorganize * missing one trait * Apply suggestions from code review * graceful shutdown! * remove special case handling on ctrlc for cleanup * rename signal fn to better represent its responsibility * remove unused global hook for flushing flamegraph * move tracing logic to the right file * just copy linkerd's signal handling logic * update book * make zebrad app drop on shutdown normally * Update zebrad/src/components/tokio.rs Co-authored-by: teor <teor@riseup.net> * Update zebrad/src/application.rs Co-authored-by: teor <teor@riseup.net> * Apply suggestions from code review Co-authored-by: teor <teor@riseup.net> * cleanup a little * ooh yea there's an API for that * setup env-filter for backup subscriber * document env filter * document return codes * forgot to save * Update book/src/applications/zebrad.md Co-authored-by: teor <teor@riseup.net> Co-authored-by: teor <teor@riseup.net>	2020-08-05 16:35:56 -07:00
Henry de Valence	82da4a5326	Remove connect command.	2020-08-04 23:34:45 -07:00
Alfredo Garcia	f2d7bb3177	Command execution tests (#690 ) * add zebrad acceptance tests * add custom command test helpers that work with kill * add and use info event for start and seed commands * combine conflicting tests into one test case Co-authored-by: Jane Lusby <jane@zfnd.org>	2020-08-01 16:15:26 +10:00
teor	11090dbf91	feature: Separate Mainnet and Testnet state	2020-07-29 01:45:19 -04:00
Alfredo Garcia	5b3c6e4c6c	Port bash checkpoint scripts to zebra-checkpoints single rust binary (#740 ) * make zebra-checkpoints * fix LOOKAHEAD_LIMIT scope * add a default cli path * change doc usage text * add tracing * move MAX_CHECKPOINT_HEIGHT_GAP to zebra-consensus * do byte_reverse_hex in a map	2020-07-25 17:53:00 +10:00
Henry de Valence	b59cfc49b7	sync: create requests sequentially to respect backpressure. This seems like a better design on principle but also appears to give a much nicer sawtooth pattern of queued blocks in the checkpointer and a much smoother pattern of block requests.	2020-07-24 18:36:00 -04:00
teor	2acfcf3a90	Make the CheckpointVerifier handle partial restarts (#736 ) Also put generic bounds on the BlockVerifier struct, so we get better compilation errors.	2020-07-24 11:47:48 +10:00
teor	77a1fefa1e	Download genesis (#731 ) * feature: Add more CheckpointVerifier tracing * fix: Download the genesis block	2020-07-23 10:56:52 -07:00
teor	c95c825707	fix: Lookup the genesis hash based on the network	2020-07-23 03:46:24 -04:00
Henry de Valence	4a98b8fa0d	Add basic metrics to the syncer.	2020-07-22 21:59:00 -07:00
Henry de Valence	c2c2a28e8b	Improve tracing output in chain verifier	2020-07-22 21:59:00 -07:00
Jane Lusby	7d4e717182	Add block locator request to state layer (#712 ) * Add block locator request to state layer * pass genesis in request * Update zebrad/src/commands/start/sync.rs * fix errors	2020-07-22 18:01:31 -07:00
Henry de Valence	49aa41544d	sync: try to ignore spurious inv messages. Closes #697. per https://github.com/ZcashFoundation/zebra/issues/697#issuecomment-662742971 The response to a getblocks message is an inv message with the hashes of the following blocks. However, inv messages are also sent unsolicited to gossip new blocks across the network. Normally, this wouldn't be a problem, because for every other request we filter only for the messages that are relevant to us. But because the response to a getblocks message is an inv, the network layer doesn't (and can't) distinguish between the response inv and the unsolicited inv. But there is a mitigation we can do. In our sync algorithm we have two phases: (1) "ObtainTips" to get a set of tips to chase down, (2) repeatedly call "ExtendTips" to extend those as far as possible. The unsolicited inv messages have length 1, but when extending tips we expect to get more than one hash. So we could reject responses in ExtendTips that have length 1 in order to ignore these messages. This way we automatically ignore gossip messages during initial block sync (while we're extending a tip) but we don't ignore length-1 responses while trying to obtain tips (while querying the network for new tips).	2020-07-22 17:55:52 -07:00
teor	9b97ebbd61	feature: Choose checkpoints based on the config	2020-07-23 10:26:25 +10:00
teor	3d721a96a5	feature: Add the state config to the config file	2020-07-23 10:26:25 +10:00
teor	89ac2793d6	feature: Use ChainVerifier in the sync service	2020-07-23 10:26:25 +10:00
Henry de Valence	928b0beb5d	sync: unindent fetch task	2020-07-21 20:16:23 -07:00
Henry de Valence	b722818e02	sync: remove redundant tracing specifier Co-authored-by: Jane Lusby <jlusby42@gmail.com>	2020-07-21 20:16:23 -07:00
Henry de Valence	1047d2f690	sync: add backpressure to syncer Closes #617. Closes #698. The remaining work on the syncer is alluded to in a new comment: 1. Correctly constructing a block locator object 2. Detecting when we've stopped making progress syncing and restarting obtain_tips.	2020-07-21 20:16:23 -07:00
Alfredo Garcia	db2eb80b3e	Create consensus utils and move byte_reverse_hex function to it (#705 ) * move byte_reverse_hex function	2020-07-22 12:29:14 +10:00
teor	e5bb96715f	fix: Reduce sync error logs to info or warn Network issues are very common.	2020-07-21 10:13:03 -07:00
teor	a0dbe85acd	fix: Rewrite the config usage comment	2020-07-21 12:58:55 -04:00
teor	851afad01f	fix: Resist CheckpointVerifier memory DoS attacks (#635 ) * fix: Resist CheckpointVerifier memory DoS attacks Allow a maximum of 2 queued blocks at each height, as a tradeoff between efficient bad block rejection, and memory usage. Closes #628. * fix: Make max queued blocks at height equal to fanout * fix: Just allocate all the capacity upfront * fix: Use with_capacity(1) and reserve_exact(1)	2020-07-15 13:27:10 -07:00
teor	78459afe97	fix: Stop revhex on EOF	2020-07-15 19:19:02 +10:00
teor	12b9fa8ae2	Let zebrad revhex read from stdin (#648 ) * Log at warn level for commands that use stdout * Let zebrad revhex read from stdin Most unix tools support reading from stdin, so they can be used in pipelines. Part of #564.	2020-07-15 16:16:07 +10:00
teor	8b5ec155f0	Consensus refactor (#629 ) * Flatten consensus::verify::* to consensus::* * Move consensus::::tests into their own files Move CheckpointList into its own file * Move Progress and Target into a types module QueuedBlock and QueuedBlockList can stay in checkpoint.rs, because they are tightly coupled to CheckpointVerifier.	2020-07-10 16:51:01 +10:00
Henry de Valence	ff4e722cd7	sync: touch up tracing output.	2020-07-09 11:15:06 -07:00
Dimitris Apostolou	ba81d7d4c0	Fix typos	2020-07-07 11:13:49 -07:00
Jane Lusby	51f6ce86ff	Implement retry policy for syncer (#551 )	2020-07-01 13:35:01 -07:00
Jane Lusby	7245d91fe9	fix block downloading to be parallelized and commited via the verifier (#540 )	2020-06-30 09:42:09 -07:00
Henry de Valence	21bf913b48	Revert "correctly trim and download tips (#531 )" This reverts commit `e102bd5e34`.	2020-06-24 12:24:37 -07:00
Jane Lusby	e102bd5e34	correctly trim and download tips (#531 ) * also download tips and filter tips * dispatch all block downloads together * tweek to match henry's changes * switch to more intuitive match Co-authored-by: Jane Lusby <jane@zfnd.org>	2020-06-24 15:19:34 -04:00
Alfredo Garcia	67718898c5	add usage help to generated config (#527 )	2020-06-23 11:56:00 -07:00
Henry de Valence	a453edd91c	Put type definitions back at the bottom of the file.	2020-06-23 10:16:27 -07:00
Henry de Valence	18eb212d8e	Set the new tips to be the last, not first, hash.	2020-06-23 10:16:27 -07:00
Jane Lusby	1c42b66a4f	Implement sync component for start subcommand (#506 )	2020-06-22 19:24:53 -07:00
Jane Lusby	246e7cd2a9	Start testing out new version of `eyre` and `color-eyre` in zebra (#526 ) * port to new version of eyre without generics * correctly setup color_eyre hooks Co-authored-by: Jane Lusby <jane@zfnd.org>	2020-06-22 15:36:23 -07:00
Deirdre Connolly	05316dee21	Listen on 0.0.0.0, not 127.0.0.1 Turns out when your node faces the internet directly, it has to listen to those addresses directly.	2020-06-19 03:46:09 -04:00
Henry de Valence	4b8f07ebb2	zebrad: Add reference to config docs.	2020-06-18 17:43:36 -04:00
Alfredo Garcia	b8f174ee3a	change config module to generate	2020-06-18 12:44:02 -07:00
Jane Lusby	7f8a336b69	switch to on_disk state service for start cmd	2020-06-17 23:30:50 -07:00
Jane Lusby	df18ac72c5	fix sharedpeererror to propagate tracing context	2020-06-17 14:38:26 -07:00
Jane Lusby	06fd3b2503	be more explicit with pattern in drain_requests	2020-06-16 12:04:45 -07:00
Jane Lusby	b0ecd019b6	apply comments from code review	2020-06-16 12:04:45 -07:00
Jane Lusby	d09c339dc5	little more cleaning	2020-06-16 12:04:45 -07:00
Jane Lusby	528fd2b5b1	add an outline of the structure of the node	2020-06-16 12:04:45 -07:00
Jane Lusby	fc96a41b18	copy connect command into start command	2020-06-16 12:04:45 -07:00
Jane Lusby	df656a8bf0	Reorganize `connect` subcommand for readibility (#450 )	2020-06-12 09:20:58 -07:00
Jane Lusby	431f194c0f	propagate errors out of zebra_network::init (#435 ) Prior to this change, the service returned by `zebra_network::init` would spawn background tasks that could silently fail, causing unexpected errors in the zebra_network service. This change modifies the `PeerSet` that backs `zebra_network::init` to store all of the `JoinHandle`s for each background task it depends on. The `PeerSet` then checks this set of futures to see if any of them have exited with an error or a panic, and if they have it returns the error as part of `poll_ready`.	2020-06-09 12:24:28 -07:00
Deirdre Connolly	42cc55b0bb	Remove testing tokio task That fires 'GetPeers' requests at our running 'zebra seed'.	2020-06-08 19:26:23 -04:00
Deirdre Connolly	43b77b080e	Fix 'dos' feature for seed command, and Buffer the seed service	2020-06-08 19:26:23 -04:00
Deirdre Connolly	8f5e7c268b	Request::Peers not GetPeers	2020-06-08 19:26:23 -04:00
Jane Lusby	9bcda0f9c7	Wrap Blocks in Arc throughout codebase	2020-06-05 00:36:55 -04:00
Jane Lusby	18b4dbc16c	fix tracing configuration issues (#432 )	2020-06-04 19:34:06 -07:00
Jane Lusby	e9af80b875	Add initial version of zebra-state (#414 ) * rename zebra-storage to zebra-state * Setup initial skeleton for zebra-state * add test * Apply suggestions from code review Co-authored-by: Henry de Valence <hdevalence@hdevalence.ca> * move shared test vectors to a common crate Co-authored-by: Jane Lusby <jane@zfnd.org> Co-authored-by: Henry de Valence <hdevalence@hdevalence.ca>	2020-06-02 16:16:17 -07:00
Jane Lusby	da72c5a86a	switch from abscissa::Context to color-eyre (#409 ) Co-authored-by: Jane Lusby <jane@zfnd.org>	2020-05-28 23:01:24 -04:00
Jane Lusby	8c178c3ee4	fix panic in seed subcommand (#401 ) Co-authored-by: Jane Lusby <jane@zfnd.org> Prior to this change, the seed subcommand would consistently encounter a panic in one of the background tasks, but would continue running after the panic. This is indicative of two bugs. First, zebrad was not configured to treat panics as non recoverable and instead defaulted to the tokio defaults, which are to catch panics in tasks and return them via the join handle if available, or to print them if the join handle has been discarded. This is likely a poor fit for zebrad as an application, we do not need to maximize uptime or minimize the extent of an outage should one of our tasks / services start encountering panics. Ignoring a panic increases our risk of observing invalid state, causing all sorts of wild and bad bugs. To deal with this we've switched the default panic behavior from `unwind` to `abort`. This makes panics fail immediately and take down the entire application, regardless of where they occur, which is consistent with our treatment of misbehaving connections. The second bug is the panic itself. This was triggered by a duplicate entry in the initial_peers set. To fix this we've switched the storage for the peers from a `Vec` to a `HashSet`, which has similar properties but guarantees uniqueness of its keys.	2020-05-27 17:40:12 -07:00

1 2 3 4 5

211 Commits