-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Errors when starting to perform the first collection #539
Comments
I checkout version 1.5.2 and build another container to have a try, but still can't get the data collected as before. attached is the console log of running the new container. |
@fulwang If you use version 1.5.2 of the exporter, you also need to use the metric definition file for that version. The warning in your 1.5.2 log:
Is caused by using a metric definition file that uses the resource object in its conditions, with an exporter version that does not yet have that support. On your original error with 1.7.0.dev1: There are two main errors there:
I have never seen this before and have started a dialogue with the Z development team on that.
That is an error in the exporter code, but to find that it would be very helpful to get an exporter log file. -> Could you please run this version of the exporter again and add the following options to its command line: |
@andy-maier Thanks for the analysis? |
@andy-maier For rerun v1.7.0, do i need to rebuild the container to add the log options you mentioned or just add it to the podman command line is enough? |
I just scheduled a run by adding the options on command line.[root@lpar27 ~]# podman run -itd -v /opt/zhmcexporter:/root/myconfig -p 9291:9291 --name zhmcexporter zhmcexporter:v1.7.0 -c /root/myconfig/hmccreds.yaml -v --log-comp all=debug --log exporter.log |
podman passes the command line after the container name through to the invoked container, so your podman command line looks good to me. |
The metric definition file for a specific exporter version can be downloaded from the repo, when selecting the tag for that version. For example, for version 1.5.2, this is the repo at that version: https://github.com/zhmcclient/zhmc-prometheus-exporter/tree/1.5.2, and the sample metric file for that version is https://github.com/zhmcclient/zhmc-prometheus-exporter/blob/1.5.2/examples/metrics.yaml I don't know how you build your container image, and whether you have the metric definition file in the image (vs. mounting its directory). If you have it in the image (which I think is the case given your podman command line), then you need to rebuild your image, and then you probably already have a |
zhmcexporter_1.log cd /root |
@andy-maier Can this be something wrong with the HMC side? The physical server was shutdown for several days due to malfunctions of the cooling system and was powered on in last week. I can saw many of errors include "HTTPError: 409,272: Unable to obtain STP configuration data, rc=[0x1000] [GET /api/cpcs/348762ef-90df-36c2-ae18-8dd2abf730b4]". I have checked the user for hmc access and the option of "Web Services API " was checked as before. |
@fulwang The errors "Unable to obtain STP configuration data" are not severe, they only cause the "cpc" label not to be added to metrics for some types of resources. Having said that, I suggest to configure STP on that HMC so that this error goes away. Let's walk through the errors in the zhmcexporter_1.log file you attached above:
|
@fulwang If you use the "make build" command, then it uses the Dockerfile in the repo. That Dockerfile gets the metrics.yaml file from Your commands shown above first check out version 1.5.2, and then replace the So your commands should be (after removing
|
@andy-maier I built with the latest code and run on the testing env a moment ago, here is the log for your review. |
@andy-maier I realized this later and built the container image using the source code (tar.gz download from your repo) yesterday. |
@andy-maier How we can customize the metrics.yaml to exclude the data collection from CPC BZ17? We just need to ignore it. Enabling auto-update for CPC BZ17 |
I think we should release version 1.7.0 now - the remaining two issues (STP config, and too many threads) cannot be solved by the exporter. To avoid the too many threads error, I suggest to disable the following metric groups in the metric definition file (set
If that causes the error to go away, you can gradually enable the metric groups again, starting from the top of the list. |
@fulwang The "too many threads" error happens when the HMC user has more than 25 requests open at the WS-API that are being processed (i.e. request sent, but not yet complete). I think that also applies to asynchronous operations whose jobs are not yet complete. The exporter can have a maximum of 2 concurrent HMC requests open (the main thread, and a background fetch thread, and they all wait for the operations to complete before starting the next one). Are you using the HMC userid for other tasks that run at the same time? Could you please post a log file (with |
@andy-maier I have built a image with your latest code and it's now running on the testing env for debuging purpose. pls advise me when to feedback you the logs or any other information needed. |
@fulwang |
FuLong confirmed that the "too many threads" error did not show up anymore. I am closing this ticket now. Please open a new one if there are other issues. |
Describe the bug
can't not collect any data after start the container
Expected behavior
can collect data and queried in the browser
To Reproduce
start the container with command as below:
podman run -itd -v /opt/zhmcexporter:/root/myconfig -p 9291:9291 --name zhmcexporter zhmcexporter:latest -c /root/myconfig/hmccreds.yaml -v
Environment information
zhmc_prometheus_exporter version: 1.7.0.dev1
zhmcclient version: 1.17.0
Verbosity level: 1
HMC certificate validation: False
HMC version: 2.15.0
HMC API version: 3.13
HMC features: None
Command output
Log file
zhmcexporter.log
The text was updated successfully, but these errors were encountered: