-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Akash Provider Fails to Start #69
Comments
Are you on the latest v0.5.2 build?
…Sent from my iPhone
On Mar 12, 2022, at 7:53 AM, Douglas Bebber ***@***.***> wrote:
Everything appears to be working except that the Provider is offline and will not successfully start.
I keep seeing the following in the /var/log/handyhost.log:
AKT: get aggregates query error Error: post failed: Post "http://rpc-1.handyhost.computer/": dial tcp: address rpc-1.handyhost.computer: missing port in address
AKT: get aggregates query error Error: post failed: Post "http://rpc-1.handyhost.computer/": dial tcp: address rpc-1.handyhost.computer: missing port in address
AKT: get aggregates query error Error: post failed: Post "http://rpc-1.handyhost.computer/": dial tcp: address rpc-1.handyhost.computer: missing port in address
AKT: get aggregates query error Error: post failed: Post "http://rpc-1.handyhost.computer/": dial tcp: address rpc-1.handyhost.computer: missing port in address
AKT: get aggregates query error Error: post failed: Post "http://rpc-1.handyhost.computer/": dial tcp: address rpc-1.handyhost.computer: missing port in address
error calling http://rpc-1.handyhost.computer:26659/akash1ghhkmp9c0zwynvktefh2uu0saam2dzw04pf52m Error: connect ECONNREFUSED 72.8.228.138:26659
at TCPConnectWrap.afterConnect [as oncomplete] (net.js:1146:16) {
errno: -111,
code: 'ECONNREFUSED',
syscall: 'connect',
address: '72.8.228.138',
port: 26659
}
Any known solution?
—
Reply to this email directly, view it on GitHub, or unsubscribe.
Triage notifications on the go with GitHub Mobile for iOS or Android.
You are receiving this because you are subscribed to this thread.
|
Yes, I installed handyhost_v0.5.2.1.deb on Ubuntu 20.04 LTS. Handy Host shows:
|
i think the errors you were seeing in the logs were likely due to an rpc outage on our aggregates node yesterday morning. |
Any chance you've tried a system reboot (on the provider system)? Sometimes, like the case you ran into where the rpc node was down, a provider can get stuck in an endless restarting loop which i suspect may be happening here. |
Rebooted several times. Actually the machine locks up after running for several hours and I am forced to reboot. The provider is running on the handy host machine. Not any of the 3 akash cluster nodes. |
Is there an akash provider specific log I can inspect to try to discover the details of the problem? |
not sure if its the same thing it happened to me at first when i build my first cluster Try this. start your provider but uncheck the "start at boot".It will probably fail if you had attempted to start the provider multiple times so reboot your host after a minute. (this is so that the provider does not try to start on its own and you have a better point of reference on when you actually started it) when your host is backonline try to start your provider once again (this time check the start provider at boot. the dot will go yellow and then gray.... but just give it time (go make some coffee or something and be back in a few minutes) it will go online as long as you keep seeing the "Provider Logs" menu on the left DO NOT TRY TO START IT AGAIN. if you dont see the provider logs menu on the left.. then the provider is not running when i kept trying to start it right after it goes gray the first time. it would just never come online. it might be something totally unrelated, but my two cents! |
I tried your suggestion. However the provider never started. I eventually clicked the "Provider logs" link and no information was presented. I'd like to be able to view some Akash provider log information to attempt to debug what's happening on my machine. |
what does your networking look like? I know its probably the obvious... just trying to help :) |
Port 80 is forwarded to the Ingress Controller machine on one of my cluster nodes. Port 8443 is forwarded to my Provider node (machine running Handy Host). I have tried a number of things with no success getting the Provider running. |
You must open & forward ports 80 and 30000-32767 to your Ingress Controller |
What you could try that helped on my side sometimes: do a |
Everything appears to be working except that the Provider is offline and will not successfully start.
I keep seeing the following in the /var/log/handyhost.log:
AKT: get aggregates query error Error: post failed: Post "http://rpc-1.handyhost.computer/": dial tcp: address rpc-1.handyhost.computer: missing port in address
AKT: get aggregates query error Error: post failed: Post "http://rpc-1.handyhost.computer/": dial tcp: address rpc-1.handyhost.computer: missing port in address
AKT: get aggregates query error Error: post failed: Post "http://rpc-1.handyhost.computer/": dial tcp: address rpc-1.handyhost.computer: missing port in address
AKT: get aggregates query error Error: post failed: Post "http://rpc-1.handyhost.computer/": dial tcp: address rpc-1.handyhost.computer: missing port in address
AKT: get aggregates query error Error: post failed: Post "http://rpc-1.handyhost.computer/": dial tcp: address rpc-1.handyhost.computer: missing port in address
error calling http://rpc-1.handyhost.computer:26659/akash1ghhkmp9c0zwynvktefh2uu0saam2dzw04pf52m Error: connect ECONNREFUSED 72.8.228.138:26659
at TCPConnectWrap.afterConnect [as oncomplete] (net.js:1146:16) {
errno: -111,
code: 'ECONNREFUSED',
syscall: 'connect',
address: '72.8.228.138',
port: 26659
}
Any known solution?
The text was updated successfully, but these errors were encountered: