Skip to content

/AWS1/CL_NED=>STARTMLMODELTRAININGJOB()

About StartMLModelTrainingJob

Creates a new Neptune ML model training job. See Model training using the modeltraining command.

When invoking this operation in a Neptune cluster that has IAM authentication enabled, the IAM user or role making the request must have a policy attached that allows the neptune-db:StartMLModelTrainingJob IAM action in that cluster.

Method Signature

IMPORTING

Required arguments:

IV_DATAPROCESSINGJOBID TYPE /AWS1/NEDSTRING /AWS1/NEDSTRING

The job ID of the completed data-processing job that has created the data that the training will work with.

IV_TRAINMODELS3LOCATION TYPE /AWS1/NEDSTRING /AWS1/NEDSTRING

The location in Amazon S3 where the model artifacts are to be stored.

Optional arguments:

IV_ID TYPE /AWS1/NEDSTRING /AWS1/NEDSTRING

A unique identifier for the new job. The default is An autogenerated UUID.

IV_PREVIOUSMODELTRNJOBID TYPE /AWS1/NEDSTRING /AWS1/NEDSTRING

The job ID of a completed model-training job that you want to update incrementally based on updated data.

IV_SAGEMAKERIAMROLEARN TYPE /AWS1/NEDSTRING /AWS1/NEDSTRING

The ARN of an IAM role for SageMaker execution.This must be listed in your DB cluster parameter group or an error will occur.

IV_NEPTUNEIAMROLEARN TYPE /AWS1/NEDSTRING /AWS1/NEDSTRING

The ARN of an IAM role that provides Neptune access to SageMaker and Amazon S3 resources. This must be listed in your DB cluster parameter group or an error will occur.

IV_BASEPROCESSINGINSTTYPE TYPE /AWS1/NEDSTRING /AWS1/NEDSTRING

The type of ML instance used in preparing and managing training of ML models. This is a CPU instance chosen based on memory requirements for processing the training data and model.

IV_TRAININGINSTANCETYPE TYPE /AWS1/NEDSTRING /AWS1/NEDSTRING

The type of ML instance used for model training. All Neptune ML models support CPU, GPU, and multiGPU training. The default is ml.p3.2xlarge. Choosing the right instance type for training depends on the task type, graph size, and your budget.

IV_TRNINSTANCEVOLUMESIZEINGB TYPE /AWS1/NEDINTEGER /AWS1/NEDINTEGER

The disk volume size of the training instance. Both input data and the output model are stored on disk, so the volume size must be large enough to hold both data sets. The default is 0. If not specified or 0, Neptune ML selects a disk volume size based on the recommendation generated in the data processing step.

IV_TRAININGTIMEOUTINSECONDS TYPE /AWS1/NEDINTEGER /AWS1/NEDINTEGER

Timeout in seconds for the training job. The default is 86,400 (1 day).

IV_MAXHPONUMBEROFTRNJOBS TYPE /AWS1/NEDINTEGER /AWS1/NEDINTEGER

Maximum total number of training jobs to start for the hyperparameter tuning job. The default is 2. Neptune ML automatically tunes the hyperparameters of the machine learning model. To obtain a model that performs well, use at least 10 jobs (in other words, set maxHPONumberOfTrainingJobs to 10). In general, the more tuning runs, the better the results.

IV_MAXHPOPARALLELTRNJOBS TYPE /AWS1/NEDINTEGER /AWS1/NEDINTEGER

Maximum number of parallel training jobs to start for the hyperparameter tuning job. The default is 2. The number of parallel jobs you can run is limited by the available resources on your training instance.

IT_SUBNETS TYPE /AWS1/CL_NEDSTRINGLIST_W=>TT_STRINGLIST TT_STRINGLIST

The IDs of the subnets in the Neptune VPC. The default is None.

IT_SECURITYGROUPIDS TYPE /AWS1/CL_NEDSTRINGLIST_W=>TT_STRINGLIST TT_STRINGLIST

The VPC security group IDs. The default is None.

IV_VOLUMEENCRYPTIONKMSKEY TYPE /AWS1/NEDSTRING /AWS1/NEDSTRING

The Amazon Key Management Service (KMS) key that SageMaker uses to encrypt data on the storage volume attached to the ML compute instances that run the training job. The default is None.

IV_S3OUTPUTENCRYPTIONKMSKEY TYPE /AWS1/NEDSTRING /AWS1/NEDSTRING

The Amazon Key Management Service (KMS) key that SageMaker uses to encrypt the output of the processing job. The default is none.

IV_ENABLEMANAGEDSPOTTRAINING TYPE /AWS1/NEDBOOLEAN /AWS1/NEDBOOLEAN

Optimizes the cost of training machine-learning models by using Amazon Elastic Compute Cloud spot instances. The default is False.

IO_CUSTOMMODELTRAININGPARAMS TYPE REF TO /AWS1/CL_NEDCUSTMODELTRNPARAMS /AWS1/CL_NEDCUSTMODELTRNPARAMS

The configuration for custom model training. This is a JSON object.

RETURNING

OO_OUTPUT TYPE REF TO /AWS1/CL_NEDSTRTMLMDELTRNJOB01 /AWS1/CL_NEDSTRTMLMDELTRNJOB01