What Should I Do If No GPU Monitoring Record Is Displayed?
If no GPU monitoring record is displayed on the OS monitoring page when you view server monitoring metrics, check whether your server supports GPUs. If it does and the driver works properly, perform the following operations to rectify the fault.
During the Agent development, the GPU metric collection capability is also constantly improved. Table 1 outlines the metric collection capability of each version.
Version Description | Agent Version |
---|---|
An independent plug-in for collecting GPU metrics is required. | 1.x.x (earlier version) |
The Cloud Eye Agent integrates the GPU metric collection capability. | 2.x.x (new version) |
If you find that no record is displayed on the GPU page, check the version of the installed Agent and select an appropriate solution.
- Confirm the version of the installed Agent.
- Linux
- Log in to an ECS as user root.
- Check the Agent version.
if [[ -f /usr/local/uniagent/extension/install/telescope/bin/telescope ]]; then
/usr/local/uniagent/extension/install/telescope/bin/telescope -v; elif [[ -f /usr/local/telescope/bin/telescope ]]; then echo "old agent"; else echo 0; fi
- If old agent is returned, the early version of the Agent is used. Manage the Agent based on the Agent version.
- If a version is returned, the new version of the Agent is used. Manage the Agent based on the Agent version.
- If 0 is returned, the Agent is not installed.
- Windows: The Agent has two versions: new version and earlier version. Determine the Agent version based on the installation path.
- The default installation path of the new version of the Agent is C:\Program Files\uniagent\extension\install\telescope.
- The default installation path of the Agent (earlier version) is C:\Program Files\telescope.
- Linux
- Solution (earlier version):
- Uninstall the current Agent by running the command in Uninstalling the Agent or Uninstalling the Agent.
- Install the Agent Enhanced Edition: Add .1 to the -t value in the installation command. For example, change -t a.b.c to -t a.b.c.1.
- Solution (new version):
- Uninstall the Agent Basic Edition.
- Linux: Log in to a server and run the bash /usr/local/uniagent/script/uninstall.sh command.
- Windows: In the C:\Program Files\uniagent\script directory where the Agent installation package is stored, double-click the uninstall.bat script.
- Install the Agent Enhanced Edition: Add .1 to the -t value in the installation command. For example, change -t a.b.c to -t a.b.c.1.
Table 2 Basic Edition and Enhanced Edition of the Agent Edition
Description
Basic Edition
Provides basic OS monitoring metrics, such as CPU, memory, disk, and NIC metrics, improving system performance.
Generally, the version number consists of three digits, for example, 2.7.5.
Enhanced Edition
Provides GPU, NPU, and BMS hardware monitoring, in addition to the capabilities of the Basic Edition.
Generally, the version number consists of four digits, for example, 2.7.5.1.
CAUTION:The Agent Enhanced Edition collects more metrics, which may occupy more server resources. Install it as needed.
- Uninstall the Agent Basic Edition.