Troubleshoot
Important
Amazon Elastic Graphics will reach end of life on January 8, 2024. Starting September 5, 2023 the service is no longer accepting new customer
accounts. If you are currently using Amazon Elastic Graphics, we recommend that you migrate your workloads to Amazon EC2 G4ad instances, G4dn instances, or G5 instances at your
earliest convenience. If you have concerns or questions regarding end of life of the service, contact Amazon Web Services Support
The following are common errors and troubleshooting steps.
Contents
Investigate application performance issues
Elastic Graphics uses the instance network to send OpenGL commands to a remotely attached graphics card. In addition, a desktop running an OpenGL application with an Elastic Graphics accelerator is usually accessed using a remote access technology. It is important to distinguish between a performance problem related to the OpenGL rendering or the desktop remote access technology.
OpenGL rendering performance issues
The OpenGL rendering performance is determined by the number of OpenGL commands and frames generated on the remote instance.
Rendering performance may vary depending on the following factors:
-
Elastic Graphics accelerator performance
-
Network performance
-
CPU performance
-
Rendering model, scenario complexity
-
OpenGL application behavior
An easy way to evaluate performance is to display the number of rendered frames on the remote instance. Elastic Graphics accelerators display a maximum of 25 FPS on the remote instance to achieve the best perceived quality while reducing network usage.
To show the number of frames produced
-
Open the following file in a text editor. If the file does not exist, create it.
C:\Program Files\Amazon\EC2ElasticGPUs\conf\eg.conf
-
Identify the
[Application]
section, or add it if it is not present, and add the following configuration parameter:[Application] show_fps=1
-
Restart the application and check the FPS again.
If the FPS reaches 15-25 FPS when updating the rendered scene, then the Elastic Graphics accelerator is performing at peak. Any other performance problems you experience are likely related to the remote access to the instance desktop. If that is the case, see the Remote Access Performance Issues section.
If the FPS number is lower than 15, you can try the following:
-
Improve Elastic Graphics accelerator performance by selecting a more powerful graphics accelerator type.
-
Improve overall network performance by using these tips:
-
Check the amount of incoming and outgoing bandwidth to and from the Elastic Graphics accelerator endpoint. The Elastic Graphics accelerator endpoint can be retrieved with the following PowerShell command:
PS C:\>
(Invoke-WebRequest http://169.254.169.254/latest/meta-data/elastic-gpus/associations/[ELASTICGPU_ID]).content -
The network traffic from the instance to the Elastic Graphics accelerator endpoint relates to the volume of commands the OpenGL application is producing.
-
The network traffic from the Elastic Graphics accelerator endpoint to the instance relates to the number of frames generated by the graphics accelerator.
-
If you see the network usage reaching the instances maximum network throughput, try using an instance with a higher network throughput allowance.
-
-
Improve CPU performance:
-
Applications may require a lot of CPU resources in addition to what the Elastic Graphics accelerator requires. If Windows Task Manager is reporting a high usage of CPU resources, try using an instance with more CPU power.
-
Remote access performance issues
An instance with an attached Elastic Graphics accelerator can be accessed using different remote access technologies. Performance and quality may vary depending on:
-
The remote access technology
-
Instance performance
-
Client performance
-
Network latency and bandwidth between the client and the instance
Possible choices for the remote access protocol include:
-
Microsoft Remote Desktop Connection
-
NICE DCV
-
VNC
For more information about optimization, see the specific protocol.
Resolve unhealthy status issues
If the Elastic Graphics accelerator is in an unhealthy state, use the following troubleshooting steps to resolve the issue.
Check the instance configuration
If the Elastic Graphics command line tool, egcli.exe
, returns output similar
to the following, ensure that your security
group is properly configured and that you launched the instance with Instance
Metadata Service enabled.
EG Version
1.0.7.4240
(Manager) / N/A (OpenGL Library) / N/A (OpenGL Redirector) EG Status: Out Of Service Something prevented the EG Infrastructure to work properly.
Stop and start the instance
If your Elastic Graphics accelerator is in an unhealthy state, stopping the instance and starting it again is the simplest option. For more information, see Manually stop and start an instances.
Warning
When you stop an instance, the data on any instance store volumes is erased. To keep data from instance store volumes, be sure to back it up to persistent storage.
Verify the installed components
Open the Windows Control Panel and confirm that the following components are installed:
-
Amazon Elastic Graphics Manager
-
Amazon Elastic Graphics OpenGL Library
-
Amazon EC2 Elastic GPUs OpenGL Redirector
If any of these items are missing, you must install them manually. For more information, see Install the required software for Elastic Graphics.
Check the Elastic Graphics logs
Open the Windows Event Viewer, expand the Application and Services Logs section, and search for errors in the following event logs:
-
EC2ElasticGPUs
-
EC2ElasticGPUs GUI
Why am I seeing multiple ENIs?
When calling StartInstances on an EC2 instance with an Elastic Graphics accelerator, a new Elastic Network Interface (ENI) is created on the instance to allow OpenGL commands to be sent to the remotely attached graphics card.
If you call StartInstances many times in a short period of time (a few seconds or less) on the same EC2 instance, a new network interface is created on each call. However:
-
Only one network interface will be used by the Elastic Graphics accelerator.
-
Extra network interfaces don't incur any charges and will be automatically released in 24 hours.