Unify the Elastic Agent and the Horde agent emulator used in scale testing #2169

cmacknz · 2023-01-24T19:55:27Z

Horde is our internal framework used for scale testing the agent and Fleet. It currently implements a much more lightweight emulated agent. This allows us to spin up many thousands of agents on a single host, but has the major drawback of not actually testing the agent itself.

We should work towards making it possible to share code between the agent itself and Horde. This could be done by extracting internal packages of the agent and reusing them Horde, but it is more appealing to produce a distribution of the agent specifically meant for scale testing to ensure the code is always in sync and used in the same way. We are most interested in directly testing the Fleet gateway code, action handling, and upgrades.

The initial idea for this would be to build a lightweight variant of the Elastic Agent that does not need to be installed to run (does not depend on any internal directory structure), can be started multiple times on the same host, and is not actually capable of starting and managing subprocesses. Possibly we produce a single binary that can automatically launch many instances of this simple agent as goroutines.

joshdover · 2023-05-05T15:04:54Z

elastic/fleet-server#2519 (comment) is another case where this biting us. We cannot reasonably rely on horde to emulate Agent successfully while we maintain 2 implementations of the Fleet Server client code. Any small difference in behavior has quality and scalability implications as we've seen now several times.

We are going to be relying on the scaling suite to verify agent scale as part of every release to Serverless, making this even more critical.

@jlind23 @amitkanfer I'd like to consider putting this in a sprint 13.

amitkanfer · 2023-05-05T16:51:24Z

fine by me.

cmacknz · 2023-05-08T15:17:08Z

I'm setting @pchila as the preliminary assignee here, we just spoke about this one. Paolo has been working with the code that needs to be shared here recently and he has some ideas about how to improve the testability of the agent in general with the changes that will be needed here.

juliaElastic · 2024-01-11T14:18:48Z

@pierrehilbert @cmacknz Is this going to be a priority anytime soon?

pierrehilbert · 2024-01-17T13:21:53Z

Sorry I missed this ping.
Still in our technical priority but won't be addressed soon.

elasticmachine · 2024-06-03T15:58:33Z

Pinging @elastic/elastic-agent-control-plane (Team:Elastic-Agent-Control-Plane)

cmacknz added the Team:Elastic-Agent Label for the Agent team label Jan 24, 2023

cmacknz mentioned this issue Jan 24, 2023

Investigate allowing the agent to check in more frequently when the agent status changes #1946

Open

cmacknz assigned pchila May 8, 2023

cmacknz mentioned this issue Oct 3, 2023

Track and report upgrade details #3119

Closed

3 tasks

pierrehilbert added the Team:Elastic-Agent-Control-Plane Label for the Agent Control Plane team label Jun 3, 2024

jlind23 unassigned pchila Jun 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unify the Elastic Agent and the Horde agent emulator used in scale testing #2169

Unify the Elastic Agent and the Horde agent emulator used in scale testing #2169

cmacknz commented Jan 24, 2023 •

edited

Loading

joshdover commented May 5, 2023

amitkanfer commented May 5, 2023

cmacknz commented May 8, 2023

juliaElastic commented Jan 11, 2024

pierrehilbert commented Jan 17, 2024

elasticmachine commented Jun 3, 2024

Unify the Elastic Agent and the Horde agent emulator used in scale testing #2169

Unify the Elastic Agent and the Horde agent emulator used in scale testing #2169

Comments

cmacknz commented Jan 24, 2023 • edited Loading

joshdover commented May 5, 2023

amitkanfer commented May 5, 2023

cmacknz commented May 8, 2023

juliaElastic commented Jan 11, 2024

pierrehilbert commented Jan 17, 2024

elasticmachine commented Jun 3, 2024

cmacknz commented Jan 24, 2023 •

edited

Loading