-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Errors when requesting signatures on testnet #1260
Comments
@ameba23 correction: i am finding that it is reproducible in a dev environment at a loop of 20. but it's not consistent side note are their docs for this error: [{"Err":"Oneshot timeout error: channel closed"},{"Err":"Subscribe message rejected: Decryption(\"Public key does not match any of those expected for this protocol session\")"}] |
For context, this error is coming from the response body of the So one TSS node is responding with The other error comes from the key-value db getting a entropy-core/crates/kvdb/src/kv_manager/kv.rs Line 196 in 84d7014
In the error case we should still be sending the result on the channel, not just logging and dropping the sender. To be clear, this is not the cause of the problem, but its a reason why we cannot see the underlying db error without reading the logs, and it for sure needs fixing. I have not looked at the logs from the testnet TSS nodes, and i have not yet tried to reproduce this using the rust test-cli. |
I am also seeing this happen quite a bit now too: [{"Err":"User Error: Kv error: Recv Error: channel closed"},
{"Ok":["OM6uWOmEbr2++85oarrq56uepXEzz4WpiQx66aFAjDZ8EgjgwvBfSVZGx/CIGCBn7nJjZnAS+h8Da6GHQaoORwE=","f8c973f7f4fa9287c6d1328439c1f3c539e51d97e9708fd32e2b51b0e5c7a44ca51e0f31bad339608632624823d63f6a54e96d98c784c517dcaf89655780138b"]}] should i be concerned? this is in are 4 node dev network so that is a valid signature right? or am i misunderstanding something? |
I can recreate this error on testnet using When i attempt to sign three messages one after the other, one time i get a signature and the other two times i get a channel closed error:
Here are some logs (from all TSS nodes) pasted from grafana - TLDR a subscribe message is getting rejected. But im not sure why. Im pretty sure this was not the case when we first deployed testnet v0.3.0, as i signed a bunch of messages and never got any errors. |
I cannot re-create this with bug the docker-compose setup on head of master. But its hard to say if thats because of changes in master, or that the network has just been started, or something particular to the docker deployment. Next step would be to try with the docker-compose setup at release/v0.3.0 |
did you try looping? if so how many times @ameba23 |
Like 10 times. Having looked at this a bit more, i think this is caused by a mismatch between which nodes the chain thinks the current signers are, and which nodes actually hold keyshares. The reason i think this is:
The one thing that speaks against this is that @frankiebee has seen a similar problem in a dev environment (comment above: #1260 (comment) ) If this is the problem, it should be (hopefully) be fixed in the next deployment resharing has been fixed since the last release. |
see this for master reproduction: entropyxyz/sdk#461 side notes error still not consistent |
We are seeing this happen more often then not in the js cli
i tried this:
related to this issue in the sdk
The text was updated successfully, but these errors were encountered: