AWS Fundamentals Logo
AWS Fundamentals
L1 ConstructAWS::SageMaker::InferenceComponent

CfnInferenceComponent

Creates an inference component, which is a SageMaker AI hosting object that you can use to deploy a model to an endpoint. In the inference component settings, you specify the model, the endpoint, and how the model utilizes the resources that the endpoint hosts. You can optimize resource utilization by tailoring how the required CPU cores, accelerators, and memory are allocated. You can deploy multiple inference components to an endpoint, where each inference component contains one model and the resource utilization needs for that individual model. After you deploy an inference component, you can directly invoke the associated model when you use the InvokeEndpoint API action.

Import

import { CfnInferenceComponent } from 'aws-cdk-lib/aws-sagemaker';

Or use the module namespace:

import * as sagemaker from 'aws-cdk-lib/aws-sagemaker';
// sagemaker.CfnInferenceComponent

Properties

Configuration passed to the constructor as CfnInferenceComponentProps.

endpointNameRequired
string

The name of the endpoint that hosts the inference component.

specificationRequired
IResolvable | InferenceComponentSpecificationProperty

The specification for the inference component.

deploymentConfigOptional
IResolvable | InferenceComponentDeploymentConfigProperty

The deployment configuration for an endpoint, which contains the desired deployment strategy and rollback configurations.

endpointArnOptional
string

The Amazon Resource Name (ARN) of the endpoint that hosts the inference component.

inferenceComponentNameOptional
string

The name of the inference component.

runtimeConfigOptional
IResolvable | InferenceComponentRuntimeConfigProperty

The runtime config for the inference component.

tagsOptional
CfnTag[]

An array of tags to apply to the resource.

variantNameOptional
string

The name of the production variant that hosts the inference component.

CloudFormation Resource

This L1 construct maps directly to the following CloudFormation resource type.

Learn AWS the Practical Way

Our bi-weekly newsletter teaches hands-on AWS fundamentals. No certification fluff - just practical knowledge.

Subscribe to Newsletter