aws glue265 CLI commands available for AWS Glue.
| Command | API Operation | Sample |
|---|---|---|
batch-create-partitionCreates one or more partitions in a batch operation. | BatchCreatePartition | |
batch-delete-connectionDeletes a list of connection definitions from the Data Catalog. | BatchDeleteConnection | |
batch-delete-partitionDeletes one or more partitions in a batch operation. | BatchDeletePartition | |
batch-delete-tableDeletes multiple tables at once. After completing this operation, you no longer have access to the table versions and partitions that belong to the deleted table. Glue deletes these "orphaned" resources asynchronously in a timely manner, at the discretion of the service. To ensure the immediate del | BatchDeleteTable | |
batch-delete-table-versionDeletes a specified batch of versions of a table. | BatchDeleteTableVersion | |
batch-get-blueprintsRetrieves information about a list of blueprints. | BatchGetBlueprints | |
batch-get-crawlersReturns a list of resource metadata for a given list of crawler names. After calling the ListCrawlers operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions that uses tags. | BatchGetCrawlers | |
batch-get-custom-entity-typesRetrieves the details for the custom patterns specified by a list of names. | BatchGetCustomEntityTypes | |
batch-get-data-quality-resultRetrieves a list of data quality results for the specified result IDs. | BatchGetDataQualityResult | |
batch-get-dev-endpointsReturns a list of resource metadata for a given list of development endpoint names. After calling the ListDevEndpoints operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions th | BatchGetDevEndpoints | |
batch-get-jobsReturns a list of resource metadata for a given list of job names. After calling the ListJobs operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions that uses tags. | BatchGetJobs | |
batch-get-partitionRetrieves partitions in a batch request. | BatchGetPartition | |
batch-get-table-optimizerReturns the configuration for the specified table optimizers. | BatchGetTableOptimizer | |
batch-get-triggersReturns a list of resource metadata for a given list of trigger names. After calling the ListTriggers operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions that uses tags. | BatchGetTriggers | |
batch-get-workflowsReturns a list of resource metadata for a given list of workflow names. After calling the ListWorkflows operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions that uses tags. | BatchGetWorkflows | |
batch-put-data-quality-statistic-annotationAnnotate datapoints over time for a specific data quality statistic. The API requires both profileID and statisticID as part of the InclusionAnnotation input. The API only works for a single statisticId across multiple profiles. | BatchPutDataQualityStatisticAnnotation | |
batch-stop-job-runStops one or more job runs for a specified job definition. | BatchStopJobRun | |
batch-update-partitionUpdates one or more partitions in a batch operation. | BatchUpdatePartition | |
cancel-data-quality-rule-recommendation-runCancels the specified recommendation run that was being used to generate rules. | CancelDataQualityRuleRecommendationRun | |
cancel-data-quality-ruleset-evaluation-runCancels a run where a ruleset is being evaluated against a data source. | CancelDataQualityRulesetEvaluationRun | |
cancel-ml-task-runCancels (stops) a task run. Machine learning task runs are asynchronous tasks that Glue runs on your behalf as part of various machine learning workflows. You can cancel a machine learning task run at any time by calling CancelMLTaskRun with a task run's parent transform's TransformID and the task r | CancelMLTaskRun | |
cancel-statementCancels the statement. | CancelStatement | |
check-schema-version-validityValidates the supplied schema. This call has no side effects, it simply validates using the supplied schema using DataFormat as the format. Since it does not take a schema set name, no compatibility checks are performed. | CheckSchemaVersionValidity | |
create-blueprintRegisters a blueprint with Glue. | CreateBlueprint | |
create-catalogCreates a new catalog in the Glue Data Catalog. | CreateCatalog | |
create-classifierCreates a classifier in the user's account. This can be a GrokClassifier, an XMLClassifier, a JsonClassifier, or a CsvClassifier, depending on which field of the request is present. | CreateClassifier | |
create-column-statistics-task-settingsCreates settings for a column statistics task. | CreateColumnStatisticsTaskSettings | |
create-connectionCreates a connection definition in the Data Catalog. Connections used for creating federated resources require the IAM glue:PassConnection permission. | CreateConnection | |
create-crawlerCreates a new crawler with specified targets, role, configuration, and optional schedule. At least one crawl target must be specified, in the s3Targets field, the jdbcTargets field, or the DynamoDBTargets field. | CreateCrawler | |
create-custom-entity-typeCreates a custom pattern that is used to detect sensitive data across the columns and rows of your structured data. Each custom pattern you create specifies a regular expression and an optional list of context words. If no context words are passed only a regular expression is checked. | CreateCustomEntityType | |
create-data-quality-rulesetCreates a data quality ruleset with DQDL rules applied to a specified Glue table. You create the ruleset using the Data Quality Definition Language (DQDL). For more information, see the Glue developer guide. | CreateDataQualityRuleset | |
create-databaseCreates a new database in a Data Catalog. | CreateDatabase | |
create-dev-endpointCreates a new development endpoint. | CreateDevEndpoint | |
create-glue-identity-center-configurationCreates a new Glue Identity Center configuration to enable integration between Glue and Amazon Web Services IAM Identity Center for authentication and authorization. | CreateGlueIdentityCenterConfiguration | |
create-integrationCreates a Zero-ETL integration in the caller's account between two resources with Amazon Resource Names (ARNs): the SourceArn and TargetArn. | CreateIntegration | |
create-integration-resource-propertyThis API can be used for setting up the ResourceProperty of the Glue connection (for the source) or Glue database ARN (for the target). These properties can include the role to access the connection or database. To set both source and target properties the same API needs to be invoked with the Glue | CreateIntegrationResourceProperty | |
create-integration-table-propertiesThis API is used to provide optional override properties for the the tables that need to be replicated. These properties can include properties for filtering and partitioning for the source and target tables. To set both source and target properties the same API need to be invoked with the Glue conn | CreateIntegrationTableProperties | |
create-jobCreates a new job definition. | CreateJob | |
create-ml-transformCreates an Glue machine learning transform. This operation creates the transform and all the necessary parameters to train it. Call this operation as the first step in the process of using a machine learning transform (such as the FindMatches transform) for deduplicating data. You can provide an opt | CreateMLTransform | |
create-partitionCreates a new partition. | CreatePartition | |
create-partition-indexCreates a specified partition index in an existing table. | CreatePartitionIndex | |
create-registryCreates a new registry which may be used to hold a collection of schemas. | CreateRegistry | |
create-schemaCreates a new schema set and registers the schema definition. Returns an error if the schema set already exists without actually registering the version. When the schema set is created, a version checkpoint will be set to the first version. Compatibility mode "DISABLED" restricts any additional sche | CreateSchema | |
create-scriptTransforms a directed acyclic graph (DAG) into code. | CreateScript | |
create-security-configurationCreates a new security configuration. A security configuration is a set of security properties that can be used by Glue. You can use a security configuration to encrypt data at rest. For information about using security configurations in Glue, see Encrypting Data Written by Crawlers, Jobs, and Devel | CreateSecurityConfiguration | |
create-sessionCreates a new session. | CreateSession | |
create-tableCreates a new table definition in the Data Catalog. | CreateTable | |
create-table-optimizerCreates a new table optimizer for a specific function. | CreateTableOptimizer | |
create-triggerCreates a new trigger. Job arguments may be logged. Do not pass plaintext secrets as arguments. Retrieve secrets from a Glue Connection, Amazon Web Services Secrets Manager or other secret management mechanism if you intend to keep them within the Job. | CreateTrigger | |
create-usage-profileCreates an Glue usage profile. | CreateUsageProfile | |
create-user-defined-functionCreates a new function definition in the Data Catalog. | CreateUserDefinedFunction | |
create-workflowCreates a new workflow. | CreateWorkflow | |
delete-blueprintDeletes an existing blueprint. | DeleteBlueprint | |
delete-catalogRemoves the specified catalog from the Glue Data Catalog. After completing this operation, you no longer have access to the databases, tables (and all table versions and partitions that might belong to the tables) and the user-defined functions in the deleted catalog. Glue deletes these "orphaned" r | DeleteCatalog | |
delete-classifierRemoves a classifier from the Data Catalog. | DeleteClassifier | |
delete-column-statistics-for-partitionDelete the partition column statistics of a column. The Identity and Access Management (IAM) permission required for this operation is DeletePartition. | DeleteColumnStatisticsForPartition | |
delete-column-statistics-for-tableRetrieves table statistics of columns. The Identity and Access Management (IAM) permission required for this operation is DeleteTable. | DeleteColumnStatisticsForTable | |
delete-column-statistics-task-settingsDeletes settings for a column statistics task. | DeleteColumnStatisticsTaskSettings | |
delete-connectionDeletes a connection from the Data Catalog. | DeleteConnection | |
delete-connection-typeDeletes a custom connection type in Glue. The connection type must exist and be registered before it can be deleted. This operation supports cleanup of connection type resources and helps maintain proper lifecycle management of custom connection types. | DeleteConnectionType | |
delete-crawlerRemoves a specified crawler from the Glue Data Catalog, unless the crawler state is RUNNING. | DeleteCrawler | |
delete-custom-entity-typeDeletes a custom pattern by specifying its name. | DeleteCustomEntityType | |
delete-data-quality-rulesetDeletes a data quality ruleset. | DeleteDataQualityRuleset | |
delete-databaseRemoves a specified database from a Data Catalog. After completing this operation, you no longer have access to the tables (and all table versions and partitions that might belong to the tables) and the user-defined functions in the deleted database. Glue deletes these "orphaned" resources asynchro | DeleteDatabase | |
delete-dev-endpointDeletes a specified development endpoint. | DeleteDevEndpoint | |
delete-glue-identity-center-configurationDeletes the existing Glue Identity Center configuration, removing the integration between Glue and Amazon Web Services IAM Identity Center. | DeleteGlueIdentityCenterConfiguration | |
delete-integrationDeletes the specified Zero-ETL integration. | DeleteIntegration | |
delete-integration-resource-propertyThis API is used for deleting the ResourceProperty of the Glue connection (for the source) or Glue database ARN (for the target). | DeleteIntegrationResourceProperty | |
delete-integration-table-propertiesDeletes the table properties that have been created for the tables that need to be replicated. | DeleteIntegrationTableProperties | |
delete-jobDeletes a specified job definition. If the job definition is not found, no exception is thrown. | DeleteJob | |
delete-ml-transformDeletes an Glue machine learning transform. Machine learning transforms are a special type of transform that use machine learning to learn the details of the transformation to be performed by learning from examples provided by humans. These transformations are then saved by Glue. If you no longer ne | DeleteMLTransform | |
delete-partitionDeletes a specified partition. | DeletePartition | |
delete-partition-indexDeletes a specified partition index from an existing table. | DeletePartitionIndex | |
delete-registryDelete the entire registry including schema and all of its versions. To get the status of the delete operation, you can call the GetRegistry API after the asynchronous call. Deleting a registry will deactivate all online operations for the registry such as the UpdateRegistry, CreateSchema, UpdateSch | DeleteRegistry | |
delete-resource-policyDeletes a specified policy. | DeleteResourcePolicy | |
delete-schemaDeletes the entire schema set, including the schema set and all of its versions. To get the status of the delete operation, you can call GetSchema API after the asynchronous call. Deleting a registry will deactivate all online operations for the schema, such as the GetSchemaByDefinition, and Registe | DeleteSchema | |
delete-schema-versionsRemove versions from the specified schema. A version number or range may be supplied. If the compatibility mode forbids deleting of a version that is necessary, such as BACKWARDS_FULL, an error is returned. Calling the GetSchemaVersions API after this call will list the status of the deleted version | DeleteSchemaVersions | |
delete-security-configurationDeletes a specified security configuration. | DeleteSecurityConfiguration | |
delete-sessionDeletes the session. | DeleteSession | |
delete-tableRemoves a table definition from the Data Catalog. After completing this operation, you no longer have access to the table versions and partitions that belong to the deleted table. Glue deletes these "orphaned" resources asynchronously in a timely manner, at the discretion of the service. To ensure | DeleteTable | |
delete-table-optimizerDeletes an optimizer and all associated metadata for a table. The optimization will no longer be performed on the table. | DeleteTableOptimizer | |
delete-table-versionDeletes a specified version of a table. | DeleteTableVersion | |
delete-triggerDeletes a specified trigger. If the trigger is not found, no exception is thrown. | DeleteTrigger | |
delete-usage-profileDeletes the Glue specified usage profile. | DeleteUsageProfile | |
delete-user-defined-functionDeletes an existing function definition from the Data Catalog. | DeleteUserDefinedFunction | |
delete-workflowDeletes a workflow. | DeleteWorkflow | |
describe-connection-typeThe DescribeConnectionType API provides full details of the supported options for a given connection type in Glue. The response includes authentication configuration details that show supported authentication types and properties, and RestConfiguration for custom REST-based connection types register | DescribeConnectionType | |
describe-entityProvides details regarding the entity used with the connection type, with a description of the data model for each field in the selected entity. The response includes all the fields which make up the entity. | DescribeEntity | |
describe-inbound-integrationsReturns a list of inbound integrations for the specified integration. | DescribeInboundIntegrations | |
describe-integrationsThe API is used to retrieve a list of integrations. | DescribeIntegrations | |
get-blueprintRetrieves the details of a blueprint. | GetBlueprint | |
get-blueprint-runRetrieves the details of a blueprint run. | GetBlueprintRun | |
get-blueprint-runsRetrieves the details of blueprint runs for a specified blueprint. | GetBlueprintRuns | |
get-catalogThe name of the Catalog to retrieve. This should be all lowercase. | GetCatalog | |
get-catalog-import-statusRetrieves the status of a migration operation. | GetCatalogImportStatus | |
get-catalogsRetrieves all catalogs defined in a catalog in the Glue Data Catalog. For a Redshift-federated catalog use case, this operation returns the list of catalogs mapped to Redshift databases in the Redshift namespace catalog. | GetCatalogs | |
get-classifierRetrieve a classifier by name. | GetClassifier | |
get-classifiersLists all classifier objects in the Data Catalog. | GetClassifiers | |
get-column-statistics-for-partitionRetrieves partition statistics of columns. The Identity and Access Management (IAM) permission required for this operation is GetPartition. | GetColumnStatisticsForPartition | |
get-column-statistics-for-tableRetrieves table statistics of columns. The Identity and Access Management (IAM) permission required for this operation is GetTable. | GetColumnStatisticsForTable | |
get-column-statistics-task-runGet the associated metadata/information for a task run, given a task run ID. | GetColumnStatisticsTaskRun | |
get-column-statistics-task-runsRetrieves information about all runs associated with the specified table. | GetColumnStatisticsTaskRuns | |
get-column-statistics-task-settingsGets settings for a column statistics task. | GetColumnStatisticsTaskSettings | |
get-connectionRetrieves a connection definition from the Data Catalog. | GetConnection | |
get-connectionsRetrieves a list of connection definitions from the Data Catalog. | GetConnections | |
get-crawlerRetrieves metadata for a specified crawler. | GetCrawler | |
get-crawler-metricsRetrieves metrics about specified crawlers. | GetCrawlerMetrics | |
get-crawlersRetrieves metadata for all crawlers defined in the customer account. | GetCrawlers | |
get-custom-entity-typeRetrieves the details of a custom pattern by specifying its name. | GetCustomEntityType | |
get-data-catalog-encryption-settingsRetrieves the security configuration for a specified catalog. | GetDataCatalogEncryptionSettings | |
get-data-quality-modelRetrieve the training status of the model along with more information (CompletedOn, StartedOn, FailureReason). | GetDataQualityModel | |
get-data-quality-model-resultRetrieve a statistic's predictions for a given Profile ID. | GetDataQualityModelResult | |
get-data-quality-resultRetrieves the result of a data quality rule evaluation. | GetDataQualityResult | |
get-data-quality-rule-recommendation-runGets the specified recommendation run that was used to generate rules. | GetDataQualityRuleRecommendationRun | |
get-data-quality-rulesetReturns an existing ruleset by identifier or name. | GetDataQualityRuleset | |
get-data-quality-ruleset-evaluation-runRetrieves a specific run where a ruleset is evaluated against a data source. | GetDataQualityRulesetEvaluationRun | |
get-databaseRetrieves the definition of a specified database. | GetDatabase | |
get-databasesRetrieves all databases defined in a given Data Catalog. | GetDatabases | |
get-dataflow-graphTransforms a Python script into a directed acyclic graph (DAG). | GetDataflowGraph | |
get-dev-endpointRetrieves information about a specified development endpoint. When you create a development endpoint in a virtual private cloud (VPC), Glue returns only a private IP address, and the public IP address field is not populated. When you create a non-VPC development endpoint, Glue returns only a public | GetDevEndpoint | |
get-dev-endpointsRetrieves all the development endpoints in this Amazon Web Services account. When you create a development endpoint in a virtual private cloud (VPC), Glue returns only a private IP address and the public IP address field is not populated. When you create a non-VPC development endpoint, Glue returns | GetDevEndpoints | |
get-entity-recordsThis API is used to query preview data from a given connection type or from a native Amazon S3 based Glue Data Catalog. Returns records as an array of JSON blobs. Each record is formatted using Jackson JsonNode based on the field type defined by the DescribeEntity API. Spark connectors generate sche | GetEntityRecords | |
get-glue-identity-center-configurationRetrieves the current Glue Identity Center configuration details, including the associated Identity Center instance and application information. | GetGlueIdentityCenterConfiguration | |
get-integration-resource-propertyThis API is used for fetching the ResourceProperty of the Glue connection (for the source) or Glue database ARN (for the target) | GetIntegrationResourceProperty | |
get-integration-table-propertiesThis API is used to retrieve optional override properties for the tables that need to be replicated. These properties can include properties for filtering and partition for source and target tables. | GetIntegrationTableProperties | |
get-jobRetrieves an existing job definition. | GetJob | |
get-job-bookmarkReturns information on a job bookmark entry. For more information about enabling and using job bookmarks, see: Tracking processed data using job bookmarks Job parameters used by Glue Job structure | GetJobBookmark | |
get-job-runRetrieves the metadata for a given job run. Job run history is accessible for 365 days for your workflow and job run. | GetJobRun | |
get-job-runsRetrieves metadata for all runs of a given job definition. GetJobRuns returns the job runs in chronological order, with the newest jobs returned first. | GetJobRuns | |
get-jobsRetrieves all current job definitions. | GetJobs | |
get-mappingCreates mappings. | GetMapping | |
get-materialized-view-refresh-task-runGet the associated metadata/information for a task run, given a task run ID. | GetMaterializedViewRefreshTaskRun | |
get-ml-task-runGets details for a specific task run on a machine learning transform. Machine learning task runs are asynchronous tasks that Glue runs on your behalf as part of various machine learning workflows. You can check the stats of any task run by calling GetMLTaskRun with the TaskRunID and its parent trans | GetMLTaskRun | |
get-ml-task-runsGets a list of runs for a machine learning transform. Machine learning task runs are asynchronous tasks that Glue runs on your behalf as part of various machine learning workflows. You can get a sortable, filterable list of machine learning task runs by calling GetMLTaskRuns with their parent transf | GetMLTaskRuns | |
get-ml-transformGets an Glue machine learning transform artifact and all its corresponding metadata. Machine learning transforms are a special type of transform that use machine learning to learn the details of the transformation to be performed by learning from examples provided by humans. These transformations ar | GetMLTransform | |
get-ml-transformsGets a sortable, filterable list of existing Glue machine learning transforms. Machine learning transforms are a special type of transform that use machine learning to learn the details of the transformation to be performed by learning from examples provided by humans. These transformations are then | GetMLTransforms | |
get-partitionRetrieves information about a specified partition. | GetPartition | |
get-partition-indexesRetrieves the partition indexes associated with a table. | GetPartitionIndexes | |
get-partitionsRetrieves information about the partitions in a table. | GetPartitions | |
get-planGets code to perform a specified mapping. | GetPlan | |
get-registryDescribes the specified registry in detail. | GetRegistry | |
get-resource-policiesRetrieves the resource policies set on individual resources by Resource Access Manager during cross-account permission grants. Also retrieves the Data Catalog resource policy. If you enabled metadata encryption in Data Catalog settings, and you do not have permission on the KMS key, the operation ca | GetResourcePolicies | |
get-resource-policyRetrieves a specified resource policy. | GetResourcePolicy | |
get-schemaDescribes the specified schema in detail. | GetSchema | |
get-schema-by-definitionRetrieves a schema by the SchemaDefinition. The schema definition is sent to the Schema Registry, canonicalized, and hashed. If the hash is matched within the scope of the SchemaName or ARN (or the default registry, if none is supplied), that schema’s metadata is returned. Otherwise, a 404 or NotFou | GetSchemaByDefinition | |
get-schema-versionGet the specified schema by its unique ID assigned when a version of the schema is created or registered. Schema versions in Deleted status will not be included in the results. | GetSchemaVersion | |
get-schema-versions-diffFetches the schema version difference in the specified difference type between two stored schema versions in the Schema Registry. This API allows you to compare two schema versions between two schema definitions under the same schema. | GetSchemaVersionsDiff | |
get-security-configurationRetrieves a specified security configuration. | GetSecurityConfiguration | |
get-security-configurationsRetrieves a list of all security configurations. | GetSecurityConfigurations | |
get-sessionRetrieves the session. | GetSession | |
get-statementRetrieves the statement. | GetStatement | |
get-tableRetrieves the Table definition in a Data Catalog for a specified table. | GetTable | |
get-table-optimizerReturns the configuration of all optimizers associated with a specified table. | GetTableOptimizer | |
get-table-versionRetrieves a specified version of a table. | GetTableVersion | |
get-table-versionsRetrieves a list of strings that identify available versions of a specified table. | GetTableVersions | |
get-tablesRetrieves the definitions of some or all of the tables in a given Database. | GetTables | |
get-tagsRetrieves a list of tags associated with a resource. | GetTags | |
get-triggerRetrieves the definition of a trigger. | GetTrigger | |
get-triggersGets all the triggers associated with a job. | GetTriggers | |
get-unfiltered-partition-metadataRetrieves partition metadata from the Data Catalog that contains unfiltered metadata. For IAM authorization, the public IAM action associated with this API is glue:GetPartition. | GetUnfilteredPartitionMetadata | |
get-unfiltered-partitions-metadataRetrieves partition metadata from the Data Catalog that contains unfiltered metadata. For IAM authorization, the public IAM action associated with this API is glue:GetPartitions. | GetUnfilteredPartitionsMetadata | |
get-unfiltered-table-metadataAllows a third-party analytical engine to retrieve unfiltered table metadata from the Data Catalog. For IAM authorization, the public IAM action associated with this API is glue:GetTable. | GetUnfilteredTableMetadata | |
get-usage-profileRetrieves information about the specified Glue usage profile. | GetUsageProfile | |
get-user-defined-functionRetrieves a specified function definition from the Data Catalog. | GetUserDefinedFunction | |
get-user-defined-functionsRetrieves multiple function definitions from the Data Catalog. | GetUserDefinedFunctions | |
get-workflowRetrieves resource metadata for a workflow. | GetWorkflow | |
get-workflow-runRetrieves the metadata for a given workflow run. Job run history is accessible for 90 days for your workflow and job run. | GetWorkflowRun | |
get-workflow-run-propertiesRetrieves the workflow run properties which were set during the run. | GetWorkflowRunProperties | |
get-workflow-runsRetrieves metadata for all runs of a given workflow. | GetWorkflowRuns | |
import-catalog-to-glueImports an existing Amazon Athena Data Catalog to Glue. | ImportCatalogToGlue | |
list-blueprintsLists all the blueprint names in an account. | ListBlueprints | |
list-column-statistics-task-runsList all task runs for a particular account. | ListColumnStatisticsTaskRuns | |
list-connection-typesThe ListConnectionTypes API provides a discovery mechanism to learn available connection types in Glue. The response contains a list of connection types with high-level details of what is supported for each connection type, including both built-in connection types and custom connection types registe | ListConnectionTypes | |
list-crawlersRetrieves the names of all crawler resources in this Amazon Web Services account, or the resources with the specified tag. This operation allows you to see which resources are available in your account, and their names. This operation takes the optional Tags field, which you can use as a filter on t | ListCrawlers | |
list-crawlsReturns all the crawls of a specified crawler. Returns only the crawls that have occurred since the launch date of the crawler history feature, and only retains up to 12 months of crawls. Older crawls will not be returned. You may use this API to: Retrive all the crawls of a specified crawler. R | ListCrawls | |
list-custom-entity-typesLists all the custom patterns that have been created. | ListCustomEntityTypes | |
list-data-quality-resultsReturns all data quality execution results for your account. | ListDataQualityResults | |
list-data-quality-rule-recommendation-runsLists the recommendation runs meeting the filter criteria. | ListDataQualityRuleRecommendationRuns | |
list-data-quality-ruleset-evaluation-runsLists all the runs meeting the filter criteria, where a ruleset is evaluated against a data source. | ListDataQualityRulesetEvaluationRuns | |
list-data-quality-rulesetsReturns a paginated list of rulesets for the specified list of Glue tables. | ListDataQualityRulesets | |
list-data-quality-statistic-annotationsRetrieve annotations for a data quality statistic. | ListDataQualityStatisticAnnotations | |
list-data-quality-statisticsRetrieves a list of data quality statistics. | ListDataQualityStatistics | |
list-dev-endpointsRetrieves the names of all DevEndpoint resources in this Amazon Web Services account, or the resources with the specified tag. This operation allows you to see which resources are available in your account, and their names. This operation takes the optional Tags field, which you can use as a filter | ListDevEndpoints | |
list-entitiesReturns the available entities supported by the connection type. | ListEntities | |
list-integration-resource-propertiesList integration resource properties for a single customer. It supports the filters, maxRecords and markers. | ListIntegrationResourceProperties | |
list-jobsRetrieves the names of all job resources in this Amazon Web Services account, or the resources with the specified tag. This operation allows you to see which resources are available in your account, and their names. This operation takes the optional Tags field, which you can use as a filter on the r | ListJobs | |
list-materialized-view-refresh-task-runsList all task runs for a particular account. | ListMaterializedViewRefreshTaskRuns | |
list-ml-transformsRetrieves a sortable, filterable list of existing Glue machine learning transforms in this Amazon Web Services account, or the resources with the specified tag. This operation takes the optional Tags field, which you can use as a filter of the responses so that tagged resources can be retrieved as | ListMLTransforms | |
list-registriesReturns a list of registries that you have created, with minimal registry information. Registries in the Deleting status will not be included in the results. Empty results will be returned if there are no registries available. | ListRegistries | |
list-schema-versionsReturns a list of schema versions that you have created, with minimal information. Schema versions in Deleted status will not be included in the results. Empty results will be returned if there are no schema versions available. | ListSchemaVersions | |
list-schemasReturns a list of schemas with minimal details. Schemas in Deleting status will not be included in the results. Empty results will be returned if there are no schemas available. When the RegistryId is not provided, all the schemas across registries will be part of the API response. | ListSchemas | |
list-sessionsRetrieve a list of sessions. | ListSessions | |
list-statementsLists statements for the session. | ListStatements | |
list-table-optimizer-runsLists the history of previous optimizer runs for a specific table. | ListTableOptimizerRuns | |
list-triggersRetrieves the names of all trigger resources in this Amazon Web Services account, or the resources with the specified tag. This operation allows you to see which resources are available in your account, and their names. This operation takes the optional Tags field, which you can use as a filter on t | ListTriggers | |
list-usage-profilesList all the Glue usage profiles. | ListUsageProfiles | |
list-workflowsLists names of workflows created in the account. | ListWorkflows | |
modify-integrationModifies a Zero-ETL integration in the caller's account. | ModifyIntegration | |
put-data-catalog-encryption-settingsSets the security configuration for a specified catalog. After the configuration has been set, the specified encryption is applied to every catalog write thereafter. | PutDataCatalogEncryptionSettings | |
put-data-quality-profile-annotationAnnotate all datapoints for a Profile. | PutDataQualityProfileAnnotation | |
put-resource-policySets the Data Catalog resource policy for access control. | PutResourcePolicy | |
put-schema-version-metadataPuts the metadata key value pair for a specified schema version ID. A maximum of 10 key value pairs will be allowed per schema version. They can be added over one or more calls. | PutSchemaVersionMetadata | |
put-workflow-run-propertiesPuts the specified workflow run properties for the given workflow run. If a property already exists for the specified run, then it overrides the value otherwise adds the property to existing properties. | PutWorkflowRunProperties | |
query-schema-version-metadataQueries for the schema version metadata information. | QuerySchemaVersionMetadata | |
register-connection-typeRegisters a custom connection type in Glue based on the configuration provided. This operation enables customers to configure custom connectors for any data source with REST-based APIs, eliminating the need for building custom Lambda connectors. The registered connection type stores details about ho | RegisterConnectionType | |
register-schema-versionAdds a new version to the existing schema. Returns an error if new version of schema does not meet the compatibility requirements of the schema set. This API will not create a new schema set and will return a 404 error if the schema set is not already present in the Schema Registry. If this is the f | RegisterSchemaVersion | |
remove-schema-version-metadataRemoves a key value pair from the schema version metadata for the specified schema version ID. | RemoveSchemaVersionMetadata | |
reset-job-bookmarkResets a bookmark entry. For more information about enabling and using job bookmarks, see: Tracking processed data using job bookmarks Job parameters used by Glue Job structure | ResetJobBookmark | |
resume-workflow-runRestarts selected nodes of a previous partially completed workflow run and resumes the workflow run. The selected nodes and all nodes that are downstream from the selected nodes are run. | ResumeWorkflowRun | |
run-statementExecutes the statement. | RunStatement | |
search-tablesSearches a set of tables based on properties in the table metadata as well as on the parent database. You can search against text or filter conditions. You can only get tables that you have access to based on the security policies defined in Lake Formation. You need at least a read-only access to t | SearchTables | |
start-blueprint-runStarts a new run of the specified blueprint. | StartBlueprintRun | |
start-column-statistics-task-runStarts a column statistics task run, for a specified table and columns. | StartColumnStatisticsTaskRun | |
start-column-statistics-task-run-scheduleStarts a column statistics task run schedule. | StartColumnStatisticsTaskRunSchedule | |
start-crawlerStarts a crawl using the specified crawler, regardless of what is scheduled. If the crawler is already running, returns a CrawlerRunningException. | StartCrawler | |
start-crawler-scheduleChanges the schedule state of the specified crawler to SCHEDULED, unless the crawler is already running or the schedule state is already SCHEDULED. | StartCrawlerSchedule | |
start-data-quality-rule-recommendation-runStarts a recommendation run that is used to generate rules when you don't know what rules to write. Glue Data Quality analyzes the data and comes up with recommendations for a potential ruleset. You can then triage the ruleset and modify the generated ruleset to your liking. Recommendation runs are | StartDataQualityRuleRecommendationRun | |
start-data-quality-ruleset-evaluation-runOnce you have a ruleset definition (either recommended or your own), you call this operation to evaluate the ruleset against a data source (Glue table). The evaluation computes results which you can retrieve with the GetDataQualityResult API. | StartDataQualityRulesetEvaluationRun | |
start-export-labels-task-runBegins an asynchronous task to export all labeled data for a particular transform. This task is the only label-related API call that is not part of the typical active learning workflow. You typically use StartExportLabelsTaskRun when you want to work with all of your existing labels at the same time | StartExportLabelsTaskRun | |
start-import-labels-task-runEnables you to provide additional labels (examples of truth) to be used to teach the machine learning transform and improve its quality. This API operation is generally used as part of the active learning workflow that starts with the StartMLLabelingSetGenerationTaskRun call and that ultimately resu | StartImportLabelsTaskRun | |
start-job-runStarts a job run using a job definition. | StartJobRun | |
start-materialized-view-refresh-task-runStarts a materialized view refresh task run, for a specified table and columns. | StartMaterializedViewRefreshTaskRun | |
start-ml-evaluation-task-runStarts a task to estimate the quality of the transform. When you provide label sets as examples of truth, Glue machine learning uses some of those examples to learn from them. The rest of the labels are used as a test to estimate quality. Returns a unique identifier for the run. You can call GetMLT | StartMLEvaluationTaskRun | |
start-ml-labeling-set-generation-task-runStarts the active learning workflow for your machine learning transform to improve the transform's quality by generating label sets and adding labels. When the StartMLLabelingSetGenerationTaskRun finishes, Glue will have generated a "labeling set" or a set of questions for humans to answer. In the c | StartMLLabelingSetGenerationTaskRun | |
start-triggerStarts an existing trigger. See Triggering Jobs for information about how different types of trigger are started. | StartTrigger | |
start-workflow-runStarts a new run of the specified workflow. | StartWorkflowRun | |
stop-column-statistics-task-runStops a task run for the specified table. | StopColumnStatisticsTaskRun | |
stop-column-statistics-task-run-scheduleStops a column statistics task run schedule. | StopColumnStatisticsTaskRunSchedule | |
stop-crawlerIf the specified crawler is running, stops the crawl. | StopCrawler | |
stop-crawler-scheduleSets the schedule state of the specified crawler to NOT_SCHEDULED, but does not stop the crawler if it is already running. | StopCrawlerSchedule | |
stop-materialized-view-refresh-task-runStops a materialized view refresh task run, for a specified table and columns. | StopMaterializedViewRefreshTaskRun | |
stop-sessionStops the session. | StopSession | |
stop-triggerStops a specified trigger. | StopTrigger | |
stop-workflow-runStops the execution of the specified workflow run. | StopWorkflowRun | |
tag-resourceAdds tags to a resource. A tag is a label you can assign to an Amazon Web Services resource. In Glue, you can tag only certain resources. For information about what resources you can tag, see Amazon Web Services Tags in Glue. | TagResource | |
test-connectionTests a connection to a service to validate the service credentials that you provide. You can either provide an existing connection name or a TestConnectionInput for testing a non-existing connection input. Providing both at the same time will cause an error. If the action is successful, the service | TestConnection | |
untag-resourceRemoves tags from a resource. | UntagResource | |
update-blueprintUpdates a registered blueprint. | UpdateBlueprint | |
update-catalogUpdates an existing catalog's properties in the Glue Data Catalog. | UpdateCatalog | |
update-classifierModifies an existing classifier (a GrokClassifier, an XMLClassifier, a JsonClassifier, or a CsvClassifier, depending on which field is present). | UpdateClassifier | |
update-column-statistics-for-partitionCreates or updates partition statistics of columns. The Identity and Access Management (IAM) permission required for this operation is UpdatePartition. | UpdateColumnStatisticsForPartition | |
update-column-statistics-for-tableCreates or updates table statistics of columns. The Identity and Access Management (IAM) permission required for this operation is UpdateTable. | UpdateColumnStatisticsForTable | |
update-column-statistics-task-settingsUpdates settings for a column statistics task. | UpdateColumnStatisticsTaskSettings | |
update-connectionUpdates a connection definition in the Data Catalog. | UpdateConnection | |
update-crawlerUpdates a crawler. If a crawler is running, you must stop it using StopCrawler before updating it. | UpdateCrawler | |
update-crawler-scheduleUpdates the schedule of a crawler using a cron expression. | UpdateCrawlerSchedule | |
update-data-quality-rulesetUpdates the specified data quality ruleset. | UpdateDataQualityRuleset | |
update-databaseUpdates an existing database definition in a Data Catalog. | UpdateDatabase | |
update-dev-endpointUpdates a specified development endpoint. | UpdateDevEndpoint | |
update-glue-identity-center-configurationUpdates the existing Glue Identity Center configuration, allowing modification of scopes and permissions for the integration. | UpdateGlueIdentityCenterConfiguration | |
update-integration-resource-propertyThis API can be used for updating the ResourceProperty of the Glue connection (for the source) or Glue database ARN (for the target). These properties can include the role to access the connection or database. Since the same resource can be used across multiple integrations, updating resource proper | UpdateIntegrationResourceProperty | |
update-integration-table-propertiesThis API is used to provide optional override properties for the tables that need to be replicated. These properties can include properties for filtering and partitioning for the source and target tables. To set both source and target properties the same API need to be invoked with the Glue connecti | UpdateIntegrationTableProperties | |
update-jobUpdates an existing job definition. The previous job definition is completely overwritten by this information. | UpdateJob | |
update-job-from-source-controlSynchronizes a job from the source control repository. This operation takes the job artifacts that are located in the remote repository and updates the Glue internal stores with these artifacts. This API supports optional parameters which take in the repository information. | UpdateJobFromSourceControl | |
update-ml-transformUpdates an existing machine learning transform. Call this operation to tune the algorithm parameters to achieve better results. After calling this operation, you can call the StartMLEvaluationTaskRun operation to assess how well your new parameters achieved your goals (such as improving the quality | UpdateMLTransform | |
update-partitionUpdates a partition. | UpdatePartition | |
update-registryUpdates an existing registry which is used to hold a collection of schemas. The updated properties relate to the registry, and do not modify any of the schemas within the registry. | UpdateRegistry | |
update-schemaUpdates the description, compatibility setting, or version checkpoint for a schema set. For updating the compatibility setting, the call will not validate compatibility for the entire set of schema versions with the new compatibility setting. If the value for Compatibility is provided, the VersionNu | UpdateSchema | |
update-source-control-from-jobSynchronizes a job to the source control repository. This operation takes the job artifacts from the Glue internal stores and makes a commit to the remote repository that is configured on the job. This API supports optional parameters which take in the repository information. | UpdateSourceControlFromJob | |
update-tableUpdates a metadata table in the Data Catalog. | UpdateTable | |
update-table-optimizerUpdates the configuration for an existing table optimizer. | UpdateTableOptimizer | |
update-triggerUpdates a trigger definition. Job arguments may be logged. Do not pass plaintext secrets as arguments. Retrieve secrets from a Glue Connection, Amazon Web Services Secrets Manager or other secret management mechanism if you intend to keep them within the Job. | UpdateTrigger | |
update-usage-profileUpdate an Glue usage profile. | UpdateUsageProfile | |
update-user-defined-functionUpdates an existing function definition in the Data Catalog. | UpdateUserDefinedFunction | |
update-workflowUpdates an existing workflow. | UpdateWorkflow |
Our bi-weekly newsletter teaches hands-on AWS fundamentals. No certification fluff - just practical knowledge.
Subscribe to NewsletterOur bi-weekly newsletter teaches hands-on AWS fundamentals. No certification fluff - just practical knowledge.
Subscribe to NewsletterGlue