-
Notifications
You must be signed in to change notification settings - Fork 154
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Guacamole connections start to fail #3641
Comments
The error:
Suggest the user has not accessed the TRE workspace portal or the application registration has been incorrectly provisioned. Can you confirm the user in question can access the workspace portal, and no pop ups appear or are blocked to request consent? If that seems ok, post a screen shot of the API permissions for the |
Hi Marcus. It's all users including me as the Admin. We have rerun the build pipeline too trying to see if anything has changed but it had no effect on the symptoms |
Thats users in the Enterprise App, can you get API permissions in the app registration. Thanks. |
Can you confirm you see the error ` The user or administrator has not consented to use the application with ID '*********************************' named 'tre-ws-f4eb'. each time? There is no reason it would work for some VMs and not others, are you 100% sure that is the case. Can you provide the logs before that error? |
ok i will connect to a working VM in the WS and then a failing WS, then capture the logs. bear with me |
Logs sent over privately |
Can you check the cloud init logs on the failing VM? The logs show:
Which makes me think RDP hasn't configured correctly. Recommend using a prebaked image with everything configured to remove the risk of these transient issues. |
these VM's where built using the out of the box images. Looks like it there are failures around nexus in the logs I've checked and nexus is running on that IP too |
from looking at the logs of both servers it seems to go wrong here Cloud-init v. 22.2-0ubuntu1~18.04.2 running 'modules:config' at Fri, 28 Jul 2023 14:02:26 +0000. Up 570.95 seconds. i wonder if this is an issue with the nexus ssl certs ? |
Looks like could be SSL cert issue. Strange that previous Linux VMs worked though. What do you get at https://nexus-tre.uksouth.cloudapp.azure.com ? Is the certificate valid? |
I doubt it will have ever worked if always been like this. Can you confirm the certs service is installed and the certificate name, along with the certificate name specified in the nexus deployment, and that the two match? Again worth checking the nexus cloud init logs. Full details can be found here - https://microsoft.github.io/AzureTRE/v0.12.0/tre-templates/shared-services/nexus/ |
we have 2 environments currently. both with the same cert message but only this one has started failing. I am still deploying successfully into the other environment fine this morning. |
There is a renew action in the UI for Cert Service - https://microsoft.github.io/AzureTRE/v0.12.0/tre-templates/shared-services/nexus/#renewing-certificates-for-nexus |
In an existing workspace we have Linux and WIndows machines which can be connected to fine. However since yesterday morning things have changed.
Connections to existing Windows machines successful
Connections to existing Linux machines successful
Connections to New WIndows machines successful
Connection to New Linux machines FAILED
We found errors in the Guacamole logs ; -
Unable to refresh session: error refreshing tokens: unable to redeem refresh token: failed to get token: oauth2: cannot fetch token: 400 Bad Request
"error":"invalid_grant","error_description":"AADSTS65001: The user or administrator has not consented to use the application with ID '*********************************' named 'tre-ws-f4eb'. Send an interactive authorization request for this user and resource
2023-07-28T16:13:14.120090970Z 16:13:14.119 [http-nio-8080-exec-10] DEBUG c.a.identity.EnvironmentCredential - Azure Identity => ERROR in EnvironmentCredential: Missing required environment variable AZURE_CLIENT_ID
2023-07-28T16:13:14.126404402Z 16:13:14.123 [http-nio-8080-exec-10] DEBUG c.a.i.ManagedIdentityCredential - Azure Identity => Found the following environment variables: MSI_ENDPOINT, MSI_SECRET
2023-07-28T16:13:14.130749393Z 16:13:14.127 [http-nio-8080-exec-10] DEBUG c.a.i.SharedTokenCacheCredential - Azure Identity => Found the following environment variables: MSI_ENDPOINT, MSI_SECRET
2023-07-28T16:13:14.143955970Z 16:13:14.141 [http-nio-8080-exec-10] DEBUG i.o.j.i.a.v.s.c.a.c.t.o.OpenTelemetryTracer - Could not extract key 'trace-context' of type 'interface io.opentelemetry.javaagent.shaded.io.opentelemetry.context.Context' from context.
2023-07-28T16:13:14.143978270Z 16:13:14.141 [http-nio-8080-exec-10] DEBUG i.o.j.i.a.v.s.c.a.c.t.o.OpenTelemetryTracer - Could not extract key 'parent-span' of type 'interface io.opentelemetry.javaagent.shaded.io.opentelemetry.api.trace.Span' from context.
2023-07-28T16:13:14.143985371Z 16:13:14.141 [http-nio-8080-exec-10] DEBUG i.o.j.i.a.v.s.c.a.c.t.o.OpenTelemetryTracer - Could not extract key 'trace-context' of type 'interface io.opentelemetry.javaagent.shaded.io.opentelemetry.context.Context' from context.
2023-07-28T16:13:14.143990371Z 16:13:14.141 [http-nio-8080-exec-10] DEBUG i.o.j.i.a.v.s.c.a.c.t.o.OpenTelemetryTracer - Could not extract key 'parent-span' of type 'interface io.opentelemetry.javaagent.shaded.io.opentelemetry.api.trace.Span' from context.
Its strange that this would only fail with New Linux machines but work fine with new linux and also existing machines?
we've tried creating a new Guacamole server but the problem just continues there.
Pipelines have been rerun all the way through to reinstate anything missing like AZURE_CLIENT_ID but no success.
(Famous last words but) nothing has changed in this environment so why something has become missing is a mystery
The text was updated successfully, but these errors were encountered: