-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Very long query time with many topics and consumer groups #41
Comments
I am seeing the same behavior, it's so bad that prometheus is skipping it because the scrape to the /metrics endpoint is taking to long. Seems to be related to the number of partitions |
@panda87 Would you mind sharing a link to his fix/branch and I can look into bringing it into |
@zot42 FYI, you can increase that timeout in Prometheus, though. |
@panda87 Never mind. Please ignore my comment, I just saw his PR. ;) |
@JensRantil I used his PR, but I still get errors like this:
|
@panda87 That looks like a different issue than this. Please open a new issue (and specify which version/commit you are running). |
Ok, I will create new issue |
It seems that now after I pulled this last repo with latest changes I dont get the errors above, so thanks! Now its only the response time, which is still high |
@panda87 Good! I know I saw that error when I was recently revamping some of the parsing logic. |
I noticed the response time being high too....I wonder if this is actually kafka who is taking a long time to run vs prometheus.... |
I'm pretty sure it is. #47 will help us tell whether that's the case. |
any update on this one? |
Unfortunately not. Pull requests are to fix #47. I've been pretty busy lately and haven't had time to get back to this 😥 |
Any update on this? |
Unfortunately not. |
Might be worth mentioning that I had a colleague that claimed lag is now exposed through JMX. A workaround might be to have a look at using jmx_exporter instead of this. |
wait what? that'd be awesome if it is! do you know which kafka version? to clarify...its always been there on the consumer side but not on the server side as far as I know |
@k1ng87, If you're interested in consumer lag it's published via JMX by the consumer:
Replication lag is published by the broker:
See the official Kafka documentation for more details: https://kafka.apache.org/documentation/#monitoring. I checked only version 1.0, the latest one as of now. Hope this helps. |
Hi
Im using this plugin for a while, and it worked pretty well while I had small amount of consumers.
Today, I added many other consumers and new topics and 2 things started to appear
max-concurrent-group-queries
to 10 - it just effected my CPU cores and increased the load to 500%Do you know why this happens?
D.
The text was updated successfully, but these errors were encountered: