Zebra

Commit Graph

Author	SHA1	Message	Date
teor	381c20b6af	Security: change the GetAddr fanout to 3 Zebra avoids having a majority of addresses from a single peer by asking 3 peers for new addresses. Also update a bunch of security comments and related documentation.	2021-04-15 13:09:14 -04:00
teor	59aa04c9b9	Stop panicking when Zebra sends a reject without extra data Also add round-trip unit tests for extra data and no extra data.	2021-04-15 12:20:33 -04:00
teor	a417c7c8c7	Use meaningful names for select! variables	2021-04-13 23:56:16 -04:00
teor	fb95de99a6	Refactor the dial result into a From impl	2021-04-13 18:52:49 -04:00
Alfredo Garcia	5ec05e91e1	update version strings for v1.0.0-alpha.6	2021-04-08 18:48:34 -04:00
teor	1626ec383a	Add InventoryHash and MetaAddr proptests (#1985 ) * Make proptest dependencies consistent between chain and network * Implement Arbitrary for InventoryHash and use it in tests * Impl Arbitrary for MetaAddr and use it in tests Also test some extreme times in MetaAddr sanitization.	2021-04-07 14:13:52 -03:00
teor	375c8d8700	Fix a deadlock between the crawler and dialer, and other hangs (#1950 ) * Stop ignoring inbound message errors and handshake timeouts To avoid hangs, Zebra needs to maintain the following invariants in the handshake and heartbeat code: - each handshake should run in a separate spawned task (not yet implemented) - every message, error, timeout, and shutdown must update the peer address state - every await that depends on the network must have a timeout Once the Connection is created, it should handle timeouts. But we need to handle timeouts during handshake setup. * Avoid hangs by adding a timeout to the candidate set update Also increase the fanout from 1 to 2, to increase address diversity. But only return permanent errors from `CandidateSet::update`, because the crawler task exits if `update` returns an error. Also log Peers response errors in the CandidateSet. * Use the select macro in the crawler to reduce hangs The `select` function is biased towards its first argument, risking starvation. As a side-benefit, this change also makes the code a lot easier to read and maintain. * Split CrawlerAction::Demand into separate actions This refactor makes the code a bit easier to read, at the cost of sometimes blocking the crawler on `candidates.next()`. That's ok, because `next` only has a short (< 100 ms) delay. And we're just about to spawn a separate task for each handshake. * Spawn a separate task for each handshake This change avoids deadlocks by letting each handshake make progress independently. * Move the dial task into a separate function This refactor improves readability. * Fix buggy future::select function usage And document the correctness of the new code.	2021-04-07 10:25:10 -03:00
teor	de6d1c93f3	Clarify a comment	2021-04-07 18:56:38 +10:00
teor	64662a758d	Move the preallocate tests into their own files (#1977 ) * Move the preallocate tests into their own files And move the MetaAddr proptest into its own file. Also do some minor formatting and cleanups. Co-authored-by: Deirdre Connolly <durumcrustulum@gmail.com>	2021-04-07 12:32:27 +10:00
Preston Evans	0daaf582e2	Implement Trusted Vector Preallocation (#1920 ) * Implement SafePreallocate. Resolves #1880 * Add proptests for SafePreallocate * Apply suggestions from code review Comments which did not include replacement code will be addressed in a follow-up commit. Co-authored-by: teor <teor@riseup.net> * Rename [Safe-> Trusted]Allocate. Add doc and tests Add tests to show that the largest allowed vec under TrustedPreallocate is small enough to fit in a Zcash block/message (depending on type). Add doc comments to all TrustedPreallocate test cases. Tighten bounds on max_trusted_alloc for some types. Note - this commit does NOT include TrustedPreallocate impls for JoinSplitData, String, and Script. These impls will be added in a follow up commit * Implement SafePreallocate. Resolves #1880 * Add proptests for SafePreallocate * Apply suggestions from code review Comments which did not include replacement code will be addressed in a follow-up commit. Co-authored-by: teor <teor@riseup.net> * Rename [Safe-> Trusted]Allocate. Add doc and tests Add tests to show that the largest allowed vec under TrustedPreallocate is small enough to fit in a Zcash block/message (depending on type). Add doc comments to all TrustedPreallocate test cases. Tighten bounds on max_trusted_alloc for some types. Note - this commit does NOT include TrustedPreallocate impls for JoinSplitData, String, and Script. These impls will be added in a follow up commit * Impl TrustedPreallocate for Joinsplit * Impl ZcashDeserialize for Vec<u8> * Arbitrary, TrustedPreallocate, Serialize, and tests for Spend<SharedAnchor> Co-authored-by: teor <teor@riseup.net>	2021-04-06 09:49:42 +10:00
teor	83b88f5b7a	Merge pull request #1972 from ZcashFoundation/peer-set-demand-deadlock-doc Document peer set deadlock resistance	2021-04-01 22:50:17 -04:00
teor	306fa88214	Document the correctness of Poll::Pending wakeups	2021-03-27 08:55:49 -04:00
teor	b329892665	Add a comment about a zcashd inv message bug	2021-03-26 11:26:59 -04:00
teor	1a159dfcb6	Add more methods for creating MetaAddrs This refactor lets us remove `MetaAddr::update_last_seen()`.	2021-03-26 07:23:49 +10:00
teor	6fe81d8992	Make MetaAddr.last_seen into a private field	2021-03-26 07:23:49 +10:00
teor	eae59de1e8	use PeerAddrState::*	2021-03-26 07:23:49 +10:00
teor	e9cdc224a2	Rewrite MetaAddr::sanitize so it's harder to misuse `sanitize` could be misused in two ways: * accidentally modifying the addresses in the address book itself * forgetting to sanitize new fields added to `MetaAddr` This change prevents accidental modification by taking `&self`, and explicitly creates a new sanitized `MetaAddr` with all fields listed.	2021-03-26 07:23:49 +10:00
Deirdre Connolly	c5bad9fac2	Rename NU5 to Nu5 to appease newly stable clippy::upper-case-acronyms (#1945 )	2021-03-26 07:22:50 +10:00
Deirdre Connolly	7efc700aca	Merge pull request #1713 from ZcashFoundation/use-groth16-batch-math Use batch optimizations, load params in groth16::Verifier, verify Spend & Output descriptions in transaction verifier	2021-03-24 12:28:25 -04:00
Deirdre Connolly	ca1d2de87d	Bump versions for v1.0.0-alpha.5 (#1932 ) Zebra's latest alpha checkpoints on Canopy activation, continues our work on NU5, and fixes a security issue. Some notable changes include: ## Added - Log address book metrics when PeerSet or CandidateSet don't have many peers (#1906) - Document test coverage workflow (#1919) - Add a final job to CI, so we can easily require all the CI jobs to pass (#1927) ## Changed - Zebra has moved its mandatory checkpoint from Sapling to Canopy (#1898, #1926) - This is a breaking change for users that depend on the exact height of the mandatory checkpoint. ## Fixed - tower-batch: wake waiting workers on close to avoid hangs (#1908) - Assert that pre-Canopy blocks use checkpointing (#1909) - Fix CI disk space usage by disabling incremental compilation in coverage builds (#1923) ## Security - Stop relying on unchecked length fields when preallocating vectors (#1925)	2021-03-22 22:05:01 -04:00
Alfredo Garcia	c5b1d0deee	move consts to start of the function	2021-03-22 11:54:31 -04:00
teor	b623acc945	Add memory DoS prevention comments	2021-03-22 11:54:31 -04:00
teor	8e18c99cdc	Avoid risky use of Read::take with untrusted lengths Zebra already uses `Read::take` to enforce message, body, and block maximum sizes. So using `Read::take` on untrusted sizes can result in short reads, without a corresponding `UnexpectedEof` error. (The old code was correct, but copying it elsewhere would have been risky.)	2021-03-22 11:54:31 -04:00
teor	609d70ae53	Stop untrusted preallocation during string deserialization This is an easy memory denial of service attack.	2021-03-22 11:54:31 -04:00
teor	4f923b90ea	Log address book metrics when peers aren't responding	2021-03-17 10:47:04 +10:00
teor	5a30268d7a	Log address metrics when the peer set has no ready peers	2021-03-17 10:47:04 +10:00
teor	6a342e93ca	Refactor AddressBook metrics into their own struct And provide an accessor function for address book metrics.	2021-03-17 10:47:04 +10:00
Alfredo Garcia	d49eaab68e	Bump versions for zebrad 1.0.0-alpha.4 (#1913 ) * Bump versions for zebrad 1.0.0-alpha.4 * add Cargo.lock	2021-03-16 21:12:37 -03:00
Jack Grigg	7a8cae9321	Tag message metrics by type	2021-03-17 09:38:07 +10:00
Jack Grigg	e51f33a4b9	Use interoperable names for common metrics These names match the equivalent metrics in zcashd, enabling common metrics to be collected across both node types.	2021-03-17 09:38:07 +10:00
teor	8fabbce037	Document and log trailing message bytes (#1888 ) * Rename a variable for consistency * Log extra trailing message bytes at debug level	2021-03-15 08:25:27 +10:00
teor	976ec912db	Document that the listed address is also advertised to peers (#1891 ) Documents a potential privacy leak, and a missing feature.	2021-03-15 08:25:07 +10:00
teor	e50692bd51	CandidateSet: Add Listener Port Connections Inbound connections on the Zcash protocol listener port perform a handshake. If the handshake is successful, it adds the peer to the AddressBook.	2021-03-09 23:05:18 -05:00
Jane Lusby	03aa6f671f	Implement outbound connection rate limiting - includes config rename with alias (#1855 ) * Implement outbound connection rate limiting * fix breaking change on config Co-authored-by: teor <teor@riseup.net>	2021-03-10 01:36:05 +00:00
Jane Lusby	e541746a50	Add initial support for NU5 to zebra (#1823 ) * Add NU5 variant to NetworkUpgrade * Add consensus branch ID for NU5 * Add network protocol versions for NU5 * Add NU5 to the protocol::version_consistent test * Make unimplemented panic messages more specific * Block target spacing doesn't change in NU5 * add comments for future updates for NU5 Co-authored-by: teor <teor@riseup.net>	2021-03-03 06:22:11 +10:00
teor	895bb43ead	Clippy: Fix inconsistent struct member orders lint	2021-03-01 23:31:18 -05:00
teor	2587a4e272	Fix a peer DNS resolution edge case (#1796 ) * Retry each peer DNS a few times individually We retry each peer individually, as well as retrying if there are no peers in the combined list. DNS failures are correlated, so all peers can fail DNS, leaving Zebra with a small list of custom-configured IP address peers. Individual retries avoid this issue. * Rename parse_peers to resolve_peers Co-authored-by: Deirdre Connolly <durumcrustulum@gmail.com>	2021-02-26 09:06:27 +10:00
teor	9c3f236075	Stop sending blocks and transactions on error	2021-02-25 08:44:57 -08:00
teor	78f162733d	Revert "leverage return value for propagating errors" This reverts commit `e6cb20e13f`.	2021-02-24 13:07:31 -08:00
teor	72e2e83828	Revert "introduce Transition enum" This reverts commit `6906f87ead`.	2021-02-24 13:07:31 -08:00
teor	a5e89f4f2b	Revert "accidental drop on mustusesender" This reverts commit `5ec8d09e0d`.	2021-02-24 13:07:31 -08:00
teor	d60226a3cf	Revert "rustfmt" This reverts commit `9d9734ea81`.	2021-02-24 13:07:31 -08:00
teor	359015b2be	Revert "Only reject pending client requests when the peer has errored" This reverts commit `e06705ed81`.	2021-02-24 13:07:31 -08:00
teor	663ed6c842	Revert "Remove remaining references to fail_with" This reverts commit `5e4bf804aa`.	2021-02-24 13:07:31 -08:00
teor	3c225550ee	Revert "rename transitions from Exit to Close" This reverts commit `cfc4717b98`.	2021-02-24 13:07:31 -08:00
teor	86dc66dfa9	Revert "deduplicate match arms in handle_client_request" This reverts commit `2adee7b31a`.	2021-02-24 13:07:31 -08:00
teor	292a4391e2	Revert "update comments throughout connection.rs" This reverts commit `651d352ce1`.	2021-02-24 13:07:31 -08:00
teor	fc44a97925	Revert "remove unnecessary Option around request timeout" This reverts commit `c3724031df`.	2021-02-24 13:07:31 -08:00
teor	e06120cd36	Revert "ensure peer/client.rs comments are up to date" This reverts commit `2266886a53`.	2021-02-24 13:07:31 -08:00
teor	1a70d807b6	Revert "make sure peer/error.s comments are up to date" This reverts commit `6f205a1812`.	2021-02-24 13:07:31 -08:00
teor	3b2077fcfd	Revert "Apply suggestions from code review" This reverts commit `736092abb8`.	2021-02-24 13:07:31 -08:00
teor	7558f74c78	Bump versions for zebrad 1.0.0-alpha.3	2021-02-23 10:39:13 -05:00
dependabot[bot]	b578d1ff2e	build(deps): bump proptest-derive from 0.2.0 to 0.3.0 Bumps [proptest-derive](https://github.com/AltSysrq/proptest) from 0.2.0 to 0.3.0. - [Release notes](https://github.com/AltSysrq/proptest/releases) - [Changelog](https://github.com/AltSysrq/proptest/blob/master/CHANGELOG.md) - [Commits](https://github.com/AltSysrq/proptest/compare/proptest-derive-0.2.0...proptest-derive-0.3.0) Signed-off-by: dependabot[bot] <support@github.com>	2021-02-22 01:33:54 -05:00
teor	d4f2f27218	Add global span to spawned network tasks (#1761 ) Closes #1575	2021-02-20 08:36:50 +10:00
ebfull	b7fddbde94	Compute the expected body length to reduce heap allocations (#1773 ) * Compute the expected body length to reduce heap allocations	2021-02-19 22:18:57 +00:00
Jane Lusby	736092abb8	Apply suggestions from code review Co-authored-by: teor <teor@riseup.net>	2021-02-19 14:11:35 -08:00
Jane Lusby	6f205a1812	make sure peer/error.s comments are up to date	2021-02-19 14:11:35 -08:00
Jane Lusby	2266886a53	ensure peer/client.rs comments are up to date	2021-02-19 14:11:35 -08:00
Jane Lusby	c3724031df	remove unnecessary Option around request timeout	2021-02-19 14:11:35 -08:00
Jane Lusby	651d352ce1	update comments throughout connection.rs	2021-02-19 14:11:35 -08:00
Jane Lusby	2adee7b31a	deduplicate match arms in handle_client_request	2021-02-19 14:11:35 -08:00
Jane Lusby	cfc4717b98	rename transitions from Exit to Close	2021-02-19 14:11:35 -08:00
teor	5e4bf804aa	Remove remaining references to fail_with	2021-02-19 14:11:35 -08:00
teor	e06705ed81	Only reject pending client requests when the peer has errored - Add an `ExitClient` transition, used when the internal client channel is closed or dropped, and there are no more pending requests - Ignore pending requests after an `ExitClient` transition - Reject pending requests when the peer has caused an error (the `Exit` and `ExitRequest` transitions) - Remove `PeerError::ConnectionDropped`, because it is now handled by `ExitClient`. (Which is an internal error, not a peer error.)	2021-02-19 14:11:35 -08:00
teor	9d9734ea81	rustfmt	2021-02-19 14:11:35 -08:00
Jane Lusby	5ec8d09e0d	accidental drop on mustusesender	2021-02-19 14:11:35 -08:00
Jane Lusby	6906f87ead	introduce Transition enum	2021-02-19 14:11:35 -08:00
Jane Lusby	e6cb20e13f	leverage return value for propagating errors	2021-02-19 14:11:35 -08:00
teor	e61b5e50a2	Diagnostics for CI port conflict failures (#1766 ) Log a "Trying..." message before each listener opens, to see if the delay is inside Zebra, or in the test harness or OS. Also report the configured and actual ports where possible, for better diagnostics.	2021-02-18 12:15:09 -03:00
teor	5424e1d8ba	Fix candidate set address state handling (#1709 ) Design: - Add a `PeerAddrState` to each `MetaAddr` - Use a single peer set for all peers, regardless of state - Implement time-based liveness as an `AddressBook` method, rather than a `PeerAddrState` variant - Delete `AddressBook.by_state` Implementation: - Simplify `AddressBook` changes using `update` and `take` modifier methods - Simplify the `AddressBook` iterator implementation, replacing it with methods that are more obviously correct - Consistently collect peer set metrics Documentation: - Expand and update the peer set documentation We can optimise later, but for now we want simple code that is more obviously correct.	2021-02-18 11:18:32 +10:00
teor	579bd4a368	Retry DNS resolution on failure (#1762 ) Otherwise, a transient DNS failure makes the node hang.	2021-02-18 07:09:02 +10:00
teor	86169f6412	Update PeerSet metrics after every change (#1727 )	2021-02-18 07:06:59 +10:00
teor	8d1c498234	Log initial peer connection failures And standardise another log message	2021-02-17 09:21:53 -05:00
teor	e85441c914	Add a correctness comment to justify the revert	2021-02-16 05:52:54 +10:00
teor	a02a00a3f5	Revert "Stop using CallAllUnordered in peer_set::add_initial_peers (#1705 )" This reverts commit `241c7ad849`.	2021-02-16 05:52:54 +10:00
teor	e7176b86da	Clarify the Response::Nil documentation	2021-02-11 09:45:42 -05:00
Deirdre Connolly	0c5daa8410	Bump versions for zebrad 1.0.0-alpha.2 Including tower-batch bump to 0.2.0, tower-fallback to 0.2.0, zebra-script to 1.0.0-alpha.3	2021-02-09 16:14:29 -05:00
Alfredo Garcia	241c7ad849	Stop using CallAllUnordered in peer_set::add_initial_peers (#1705 ) * use ServiceExt::oneshot and FuturesUnordered Co-authored-by: teor <teor@riseup.net>	2021-02-09 08:16:02 +10:00
teor	1e156a5d60	Document that connect_isolated only works on mainnet Document that connect_isolated only works on mainnet. See #1687.	2021-02-04 17:32:00 -05:00
Alfredo Garcia	d7c40af2a8	Fix shutdown panics (#1637 ) * add a shutdown flag in zebra_chain::shutdown * fix network panic on shutdown * fix checkpoint panic on shutdown	2021-02-03 19:03:28 +10:00
Alfredo Garcia	221512c733	Async DNS seeder lookups (#1662 ) * replace to_socket_addrs * refactor `resolve()` into `resolve_host()` * use `resolve_host()` to resolve config peers * add DNS_LOOKUP_TIMEOUT constant * don't block the main thread in initialize	2021-02-03 12:20:26 +10:00
teor	983e94f9e4	Add a TODO for inbound error handling cleanup	2021-02-03 08:32:10 +10:00
Alfredo Garcia	4b34482264	Add hints to port conflict and lock file panics (#1535 ) * add hint for port error * add issue filter for port panic * add lock file hint * add metrics endpoint port conflict hint * add hint for tracing endpoint port conflict * add acceptance test for resource conflics * Split out common conflict test code into a function * Add state, metrics, and tracing conflict tests * Add a full set of stderr acceptance test functions This change makes the stdout and stderr acceptance test interfaces identical. * move Zcash listener opening * add todo about hint for disk full * add constant for lock file * match path in state cache * don't match windows cache path * Use Display for state path logs Avoids weird escaping on Windows when using Debug * Add Windows conflict error messages * Turn PORT_IN_USE_ERROR into a regex And add another alternative Windows-specific port error Co-authored-by: teor <teor@riseup.net> Co-authored-by: Jane Lusby <jane@zfnd.org>	2021-01-29 22:36:33 +10:00
Deirdre Connolly	1b09538277	Bump versions for zebrad 1.0.0-alpha.1 (#1646 ) * Bump versions where appropriate Tested with cargo install --locked --path etc * Remove fixed panics from 'Known Issues' * Change to alpha release series in the README Co-authored-by: teor <teor@riseup.net>	2021-01-27 20:31:39 -05:00
teor	b551d81f8d	Explain why we stay connected on Inbound errors We might be syncing using this peer, so it's ok to just ignore any internal errors in their Inbound requests, and drop the request.	2021-01-27 12:08:49 -08:00
teor	258789ed9b	Use the rustc unknown lints attribute The clippy unknown lints attribute was deprecated in nightly in rust-lang/rust#80524. The old lint name now produces a warning. Since we're using `allow(unknown_lints)` to suppress warnings, we need to add the canonical name, so we can continue to build without warnings on nightly. But we also need to keep the old name, so we can continue to build without warnings on stable. And therefore, we also need to disable the "removed lints" warning, otherwise we'll get warnings about the old name on nightly. We'll need to keep this transitional clippy config until rustc 1.51 is stable.	2021-01-19 11:02:20 -05:00
teor	05fff8e6f7	Revert "Stop panicking when fail_with is called twice on a connection" But keep the extra error information.	2021-01-18 00:23:36 -05:00
teor	4fe81da953	Improve logging for connection state errors	2021-01-18 00:23:36 -05:00
teor	a6c1cd3c35	Stop panicking when fail_with is called twice on a connection We can't rule out the connection state changing between the state checks and any eventual failures, particularly in the presence of async code. So we turn this panic into a warning.	2021-01-18 00:23:36 -05:00
teor	44c8fafc29	Stop processing the request after failing an overloaded connection zebra-network's Connection expects that `fail_with` is only called once per connection, but the overload handling code continues to process the current request after an overload error, potentially leading to further failures. Closes #1599	2021-01-18 00:23:36 -05:00
teor	0f0fb93b5c	Update some comments in zebra-network Add ticket numbers, and update based on design decisions and new code.	2021-01-15 09:02:10 -05:00
teor	730910cd99	Upgrade to tokio 0.3.6 from crates.io And remove the tokio git dependency patch	2021-01-12 15:37:27 -05:00
Jane Lusby	15698245e1	Deduplicate metrics dependencies (#1561 ) ## Motivation This PR is motivated by the regression identified in https://github.com/ZcashFoundation/zebra/issues/1349. That PR notes that the metrics stopped working for most of the crates other than `zebrad`. ## Solution This PR resolves the regression by deduplicating the `metrics` crate dependency. During a recent change we upgraded the metrics version in `zebrad` and a couple other of our crates, but we never updated the dependencies in `zebra-state`, `zebra-consensus`, or `zebra-network`. This caused the metrics macros to attempt to retrieve the current metrics exporter through the wrong function. We would install the metrics exporter in `0.13`, but then attempt to look it up through the `0.12` crate, which contains a different instance of the metrics exporter static variable which is unset. Doing this causes the metrics macros to return `None` for the current exporter after which they just silently give up. ## Related Issues closes https://github.com/ZcashFoundation/zebra/issues/1349 ## Follow Up Work I noticed we have quite a few duplicate dependencies in our tree. We might be able to save some compilation time by auditing those and deduplicating them as much as possible. - https://github.com/ZcashFoundation/zebra/issues/1582 Co-authored-by: teor <teor@riseup.net>	2021-01-12 12:28:56 +10:00
dependabot[bot]	38ac869f57	build(deps): bump byteorder from 1.3.4 to 1.4.2 Bumps [byteorder](https://github.com/BurntSushi/byteorder) from 1.3.4 to 1.4.2. - [Release notes](https://github.com/BurntSushi/byteorder/releases) - [Changelog](https://github.com/BurntSushi/byteorder/blob/master/CHANGELOG.md) - [Commits](https://github.com/BurntSushi/byteorder/compare/1.3.4...1.4.2) Signed-off-by: dependabot[bot] <support@github.com>	2021-01-11 18:45:49 -05:00
teor	b7d0a40ee1	Revert unused instrument macros Reverts most of "Instrument some functions to try to locate the panic"	2021-01-06 13:07:23 -08:00
teor	6d3aa0002c	Ensure received client request oneshots are used via the type system The `peer::Client` translates `Request`s into `ClientRequest`s, which it sends to a background task. If the send is `Ok(())`, it will assume that it is safe to unconditionally poll the `Receiver` tied to the `Sender` used to create the `ClientRequest`. We enforce this invariant via the type system, by converting `ClientRequest`s to `InProgressClientRequest`s when they are received by the background task. These conversions are implemented by `ClientRequestReceiver`. Changes: * Revert `ClientRequest` so it uses a `oneshot::Sender` * Add `InProgressClientRequest`, which is the same as `ClientRequest`, but has a `MustUseOneshotSender` * `impl From<ClientRequest> for InProgressClientRequest` * Add a new `ClientRequestReceiver` type that wraps a `mpsc::Receiver<ClientRequest>` * `impl Stream<InProgressClientRequest> for ClientRequestReceiver`, converting the successful result of `inner.poll_next_unpin` into an `InProgressClientRequest` * Replace `client_rx: mpsc::Receiver<ClientRequest>` in `Connection` with the new `ClientRequestReceiver` type * `impl From<mpsc::Receiver<ClientRequest>> for ClientRequestReceiver`	2021-01-06 13:07:23 -08:00
teor	df1b0c8d58	Defer a timeout fix until later	2021-01-06 13:07:23 -08:00
teor	d5cfd5ad5f	Clarify the ClientRequest invariant Co-authored-by: Jane Lusby <jlusby42@gmail.com>	2021-01-06 13:07:23 -08:00
teor	f8ff2e9c0b	Add more sends before dropping ClientRequests This fix also changes heartbeat behaviour in the following ways: * if the queue is full, the connection is closed. Previously, the sender would wait until the queue had emptied * if the queue flush fails, Zebra panics, because it can't send an error on the ClientRequest sender, so the invariant is broken	2021-01-06 13:07:23 -08:00
teor	3e711ccc8a	Instrument some functions to try to locate the panic	2021-01-06 13:07:23 -08:00

1 2 3 4 5 ...

515 Commits