Memory increase due to too many traces #590

ChrisTerBeke · 2019-04-01T08:02:42Z

Hi,

We're using this lib in several APIs running on GKE and using StackDriver. We noticed that when sending all traces to StackDriver, the amount of traces collected grows faster than the amount that can be exported to StackDriver. Because of this, a list containing all trace spans that still need to be sent grows over time, to the point where Kubernetes kills our pod because it uses too much memory.

While this is expected behavior from the current code, I wonder if there would be a cleaner way to do this. For example by dropping the spans when the list reaches a maximum and then logging a warning or error? This will prevent apps from 'leaking' memory.

What are your thoughts about this?

p.s. this is probably the same as reported on #334, but it's not really resolved there.

c24t · 2019-04-01T20:47:44Z

Configuring samplers to have a global rate limit may help (see #458), but I agree that memory shouldn't grow unbounded by default. Dropping traces seems like a good solution to me.

As for your application: are you sampling every trace? You might want to use the ProbabilitySampler instead.

#334 is a different issue, the library wasn't cleaning up monitoring clients.

c24t · 2019-04-01T22:07:10Z

See census-instrumentation/opencensus-java#1813 for a similar issue in the java client.

ChrisTerBeke · 2019-04-02T06:35:16Z

We were sampling every trace, but since then have switched to the probability sampler (which solves the problem).

Glad to hear you also think this is something to improve :)

c24t · 2019-05-02T20:25:34Z

@reyang's work on #642 should fix the memory issue by dropping spans once the queue is full.

c24t added metrics exporters labels Apr 1, 2019

reyang mentioned this issue Apr 26, 2019

Introduce persistent storage to Azure exporter #632

Merged

c24t closed this as completed May 2, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory increase due to too many traces #590

Memory increase due to too many traces #590

ChrisTerBeke commented Apr 1, 2019

c24t commented Apr 1, 2019

c24t commented Apr 1, 2019

ChrisTerBeke commented Apr 2, 2019

c24t commented May 2, 2019

Memory increase due to too many traces #590

Memory increase due to too many traces #590

Comments

ChrisTerBeke commented Apr 1, 2019

c24t commented Apr 1, 2019

c24t commented Apr 1, 2019

ChrisTerBeke commented Apr 2, 2019

c24t commented May 2, 2019