AWS::SageMaker::InferenceComponentCreates an inference component, which is a SageMaker AI hosting object that you can use to deploy a model to an endpoint. In the inference component settings, you specify the model, the endpoint, and how the model utilizes the resources that the endpoint hosts. You can optimize resource utilization by tailoring how the required CPU cores, accelerators, and memory are allocated. You can deploy multiple inference components to an endpoint, where each inference component contains one model and the resource utilization needs for that individual model. After you deploy an inference component, you can directly invoke the associated model when you use the InvokeEndpoint API action.
import { CfnInferenceComponent } from 'aws-cdk-lib/aws-sagemaker';Or use the module namespace:
import * as sagemaker from 'aws-cdk-lib/aws-sagemaker';
// sagemaker.CfnInferenceComponentConfiguration passed to the constructor as CfnInferenceComponentProps.
endpointNameRequiredstringThe name of the endpoint that hosts the inference component.
specificationRequiredIResolvable | InferenceComponentSpecificationPropertyThe specification for the inference component.
deploymentConfigOptionalIResolvable | InferenceComponentDeploymentConfigPropertyThe deployment configuration for an endpoint, which contains the desired deployment strategy and rollback configurations.
endpointArnOptionalstringThe Amazon Resource Name (ARN) of the endpoint that hosts the inference component.
inferenceComponentNameOptionalstringThe name of the inference component.
runtimeConfigOptionalIResolvable | InferenceComponentRuntimeConfigPropertyThe runtime config for the inference component.
tagsOptionalCfnTag[]An array of tags to apply to the resource.
variantNameOptionalstringThe name of the production variant that hosts the inference component.
This L1 construct maps directly to the following CloudFormation resource type.
Our bi-weekly newsletter teaches hands-on AWS fundamentals. No certification fluff - just practical knowledge.
Subscribe to Newsletteraws-sagemakerAWS::SageMaker::InferenceComponent