camel-aws2-s3-kafka-connector source configuration
Connector Description: Store and retrieve objects from AWS S3 Storage Service using AWS SDK version 2.x.
When using camel-aws2-s3-kafka-connector as source make sure to use the following Maven dependency to have support for the connector:
<dependency>
<groupId>org.apache.camel.kafkaconnector</groupId>
<artifactId>camel-aws2-s3-kafka-connector</artifactId>
<version>x.x.x</version>
<!-- use the same version as your Camel Kafka connector version -->
</dependency>
To use this source connector in Kafka connect you’ll need to set the following connector.class
connector.class=org.apache.camel.kafkaconnector.aws2s3.CamelAws2s3SourceConnector
The camel-aws2-s3 source connector supports 87 options, which are listed below.
Name | Description | Default | Priority |
---|---|---|---|
Required Bucket name or ARN. |
HIGH |
||
Reference to a com.amazonaws.services.s3.AmazonS3 in the registry. |
MEDIUM |
||
An S3 Presigner for Request, used mainly in createDownloadLink operation. |
MEDIUM |
||
Setting the autocreation of the S3 bucket bucketName. This will apply also in case of moveAfterRead option enabled and it will create the destinationBucket if it doesn’t exist already. |
false |
MEDIUM |
|
Set the need for overidding the endpoint. This option needs to be used in combination with uriEndpointOverride option. |
false |
MEDIUM |
|
If we want to use a POJO request as body or not. |
false |
MEDIUM |
|
The policy for this queue to set in the com.amazonaws.services.s3.AmazonS3#setBucketPolicy() method. |
MEDIUM |
||
To define a proxy host when instantiating the SQS client. |
MEDIUM |
||
Specify a proxy port to be used inside the client definition. |
MEDIUM |
||
To define a proxy protocol when instantiating the S3 client One of: [HTTP] [HTTPS]. Enum values:
|
"HTTPS" |
MEDIUM |
|
The region in which S3 client needs to work. When using this parameter, the configuration will expect the lowercase name of the region (for example ap-east-1) You’ll need to use the name Region.EU_WEST_1.id(). |
MEDIUM |
||
If we want to trust all certificates in case of overriding the endpoint. |
false |
MEDIUM |
|
Set the overriding uri endpoint. This option needs to be used in combination with overrideEndpoint option. |
MEDIUM |
||
Set whether the S3 client should expect to load credentials through a default credentials provider or to expect static credentials to be passed in. |
false |
MEDIUM |
|
Define the customer algorithm to use in case CustomerKey is enabled. |
MEDIUM |
||
Define the id of Customer key to use in case CustomerKey is enabled. |
MEDIUM |
||
Define the MD5 of Customer key to use in case CustomerKey is enabled. |
MEDIUM |
||
Allows for bridging the consumer to the Camel routing Error Handler, which mean any exceptions occurred while the consumer is trying to pickup incoming messages, or the likes, will now be processed as a message and handled by the routing Error Handler. By default the consumer will use the org.apache.camel.spi.ExceptionHandler to deal with exceptions, that will be logged at WARN or ERROR level and ignored. |
false |
MEDIUM |
|
Delete objects from S3 after they have been retrieved. The delete is only performed if the Exchange is committed. If a rollback occurs, the object is not deleted. If this option is false, then the same objects will be retrieve over and over again on the polls. Therefore you need to use the Idempotent Consumer EIP in the route to filter out duplicates. You can filter using the AWS2S3Constants#BUCKET_NAME and AWS2S3Constants#KEY headers, or only the AWS2S3Constants#KEY header. |
true |
MEDIUM |
|
The delimiter which is used in the com.amazonaws.services.s3.model.ListObjectsRequest to only consume objects we are interested in. |
MEDIUM |
||
Define the destination bucket where an object must be moved when moveAfterRead is set to true. |
MEDIUM |
||
Define the destination bucket prefix to use when an object must be moved and moveAfterRead is set to true. |
MEDIUM |
||
Define the destination bucket suffix to use when an object must be moved and moveAfterRead is set to true. |
MEDIUM |
||
If provided, Camel will only consume files if a done file exists. |
MEDIUM |
||
To get the object from the bucket with the given file name. |
MEDIUM |
||
If it is true, the S3 Object Body will be ignored completely, if it is set to false the S3 Object will be put in the body. Setting this to true, will override any behavior defined by includeBody option. |
false |
MEDIUM |
|
If it is true, the S3Object exchange will be consumed and put into the body and closed. If false the S3Object stream will be put raw into the body and the headers will be set with the S3 object metadata. This option is strongly related to autocloseBody option. In case of setting includeBody to true because the S3Object stream will be consumed then it will also be closed, while in case of includeBody false then it will be up to the caller to close the S3Object stream. However setting autocloseBody to true when includeBody is false it will schedule to close the S3Object stream automatically on exchange completion. |
true |
MEDIUM |
|
If it is true, the folders/directories will be consumed. If it is false, they will be ignored, and Exchanges will not be created for those. |
true |
MEDIUM |
|
Set the maxConnections parameter in the S3 client configuration. |
60 |
MEDIUM |
|
Gets the maximum number of messages as a limit to poll at each polling. Gets the maximum number of messages as a limit to poll at each polling. The default value is 10. Use 0 or a negative number to set it as unlimited. |
10 |
MEDIUM |
|
Move objects from S3 bucket to a different bucket after they have been retrieved. To accomplish the operation the destinationBucket option must be set. The copy bucket operation is only performed if the Exchange is committed. If a rollback occurs, the object is not moved. |
false |
MEDIUM |
|
The prefix which is used in the com.amazonaws.services.s3.model.ListObjectsRequest to only consume objects we are interested in. |
MEDIUM |
||
If the polling consumer did not poll any files, you can enable this option to send an empty message (no body) instead. |
false |
MEDIUM |
|
If this option is true and includeBody is false, then the S3Object.close() method will be called on exchange completion. This option is strongly related to includeBody option. In case of setting includeBody to false and autocloseBody to false, it will be up to the caller to close the S3Object stream. Setting autocloseBody to true, will close the S3Object stream automatically. |
true |
MEDIUM |
|
To let the consumer use a custom ExceptionHandler. Notice if the option bridgeErrorHandler is enabled then this option is not in use. By default the consumer will deal with exceptions, that will be logged at WARN or ERROR level and ignored. |
MEDIUM |
||
Sets the exchange pattern when the consumer creates an exchange. One of: [InOnly] [InOut] [InOptionalOut]. Enum values:
|
MEDIUM |
||
A pluggable org.apache.camel.PollingConsumerPollingStrategy allowing you to provide your custom implementation to control error handling usually occurred during the poll operation before an Exchange have been created and being routed in Camel. |
MEDIUM |
||
The number of subsequent error polls (failed due some error) that should happen before the backoffMultipler should kick-in. |
MEDIUM |
||
The number of subsequent idle polls that should happen before the backoffMultipler should kick-in. |
MEDIUM |
||
To let the scheduled polling consumer backoff if there has been a number of subsequent idles/errors in a row. The multiplier is then the number of polls that will be skipped before the next actual attempt is happening again. When this option is in use then backoffIdleThreshold and/or backoffErrorThreshold must also be configured. |
MEDIUM |
||
Milliseconds before the next poll. |
500L |
MEDIUM |
|
If greedy is enabled, then the ScheduledPollConsumer will run immediately again, if the previous run polled 1 or more messages. |
false |
MEDIUM |
|
Milliseconds before the first poll starts. |
1000L |
MEDIUM |
|
Specifies a maximum limit of number of fires. So if you set it to 1, the scheduler will only fire once. If you set it to 5, it will only fire five times. A value of zero or negative means fire forever. |
0L |
MEDIUM |
|
The consumer logs a start/complete log line when it polls. This option allows you to configure the logging level for that. One of: [TRACE] [DEBUG] [INFO] [WARN] [ERROR] [OFF]. Enum values:
|
"TRACE" |
MEDIUM |
|
Allows for configuring a custom/shared thread pool to use for the consumer. By default each consumer has its own single threaded thread pool. |
MEDIUM |
||
To use a cron scheduler from either camel-spring or camel-quartz component. Use value spring or quartz for built in scheduler. |
"none" |
MEDIUM |
|
To configure additional properties when using a custom scheduler or any of the Quartz, Spring based scheduler. |
MEDIUM |
||
Whether the scheduler should be auto started. |
true |
MEDIUM |
|
Time unit for initialDelay and delay options. One of: [NANOSECONDS] [MICROSECONDS] [MILLISECONDS] [SECONDS] [MINUTES] [HOURS] [DAYS]. Enum values:
|
"MILLISECONDS" |
MEDIUM |
|
Controls if fixed delay or fixed rate is used. See ScheduledExecutorService in JDK for details. |
true |
MEDIUM |
|
Amazon AWS Access Key. |
MEDIUM |
||
Amazon AWS Secret Key. |
MEDIUM |
||
Reference to a com.amazonaws.services.s3.AmazonS3 in the registry. |
MEDIUM |
||
An S3 Presigner for Request, used mainly in createDownloadLink operation. |
MEDIUM |
||
Setting the autocreation of the S3 bucket bucketName. This will apply also in case of moveAfterRead option enabled and it will create the destinationBucket if it doesn’t exist already. |
false |
MEDIUM |
|
The component configuration. |
MEDIUM |
||
Set the need for overidding the endpoint. This option needs to be used in combination with uriEndpointOverride option. |
false |
MEDIUM |
|
If we want to use a POJO request as body or not. |
false |
MEDIUM |
|
The policy for this queue to set in the com.amazonaws.services.s3.AmazonS3#setBucketPolicy() method. |
MEDIUM |
||
To define a proxy host when instantiating the SQS client. |
MEDIUM |
||
Specify a proxy port to be used inside the client definition. |
MEDIUM |
||
To define a proxy protocol when instantiating the S3 client One of: [HTTP] [HTTPS]. Enum values:
|
"HTTPS" |
MEDIUM |
|
The region in which S3 client needs to work. When using this parameter, the configuration will expect the lowercase name of the region (for example ap-east-1) You’ll need to use the name Region.EU_WEST_1.id(). |
MEDIUM |
||
If we want to trust all certificates in case of overriding the endpoint. |
false |
MEDIUM |
|
Set the overriding uri endpoint. This option needs to be used in combination with overrideEndpoint option. |
MEDIUM |
||
Set whether the S3 client should expect to load credentials through a default credentials provider or to expect static credentials to be passed in. |
false |
MEDIUM |
|
Define the customer algorithm to use in case CustomerKey is enabled. |
MEDIUM |
||
Define the id of Customer key to use in case CustomerKey is enabled. |
MEDIUM |
||
Define the MD5 of Customer key to use in case CustomerKey is enabled. |
MEDIUM |
||
Allows for bridging the consumer to the Camel routing Error Handler, which mean any exceptions occurred while the consumer is trying to pickup incoming messages, or the likes, will now be processed as a message and handled by the routing Error Handler. By default the consumer will use the org.apache.camel.spi.ExceptionHandler to deal with exceptions, that will be logged at WARN or ERROR level and ignored. |
false |
MEDIUM |
|
Delete objects from S3 after they have been retrieved. The delete is only performed if the Exchange is committed. If a rollback occurs, the object is not deleted. If this option is false, then the same objects will be retrieve over and over again on the polls. Therefore you need to use the Idempotent Consumer EIP in the route to filter out duplicates. You can filter using the AWS2S3Constants#BUCKET_NAME and AWS2S3Constants#KEY headers, or only the AWS2S3Constants#KEY header. |
true |
MEDIUM |
|
The delimiter which is used in the com.amazonaws.services.s3.model.ListObjectsRequest to only consume objects we are interested in. |
MEDIUM |
||
Define the destination bucket where an object must be moved when moveAfterRead is set to true. |
MEDIUM |
||
Define the destination bucket prefix to use when an object must be moved and moveAfterRead is set to true. |
MEDIUM |
||
Define the destination bucket suffix to use when an object must be moved and moveAfterRead is set to true. |
MEDIUM |
||
If provided, Camel will only consume files if a done file exists. |
MEDIUM |
||
To get the object from the bucket with the given file name. |
MEDIUM |
||
If it is true, the S3 Object Body will be ignored completely, if it is set to false the S3 Object will be put in the body. Setting this to true, will override any behavior defined by includeBody option. |
false |
MEDIUM |
|
If it is true, the S3Object exchange will be consumed and put into the body and closed. If false the S3Object stream will be put raw into the body and the headers will be set with the S3 object metadata. This option is strongly related to autocloseBody option. In case of setting includeBody to true because the S3Object stream will be consumed then it will also be closed, while in case of includeBody false then it will be up to the caller to close the S3Object stream. However setting autocloseBody to true when includeBody is false it will schedule to close the S3Object stream automatically on exchange completion. |
true |
MEDIUM |
|
If it is true, the folders/directories will be consumed. If it is false, they will be ignored, and Exchanges will not be created for those. |
true |
MEDIUM |
|
Move objects from S3 bucket to a different bucket after they have been retrieved. To accomplish the operation the destinationBucket option must be set. The copy bucket operation is only performed if the Exchange is committed. If a rollback occurs, the object is not moved. |
false |
MEDIUM |
|
The prefix which is used in the com.amazonaws.services.s3.model.ListObjectsRequest to only consume objects we are interested in. |
MEDIUM |
||
If this option is true and includeBody is false, then the S3Object.close() method will be called on exchange completion. This option is strongly related to includeBody option. In case of setting includeBody to false and autocloseBody to false, it will be up to the caller to close the S3Object stream. Setting autocloseBody to true, will close the S3Object stream automatically. |
true |
MEDIUM |
|
Whether autowiring is enabled. This is used for automatic autowiring options (the option must be marked as autowired) by looking up in the registry to find if there is a single instance of matching type, which then gets configured on the component. This can be used for automatic configuring JDBC data sources, JMS connection factories, AWS Clients, etc. |
true |
MEDIUM |
|
Amazon AWS Access Key. |
MEDIUM |
||
Amazon AWS Secret Key. |
MEDIUM |
The camel-aws2-s3 source connector supports 1 converters out of the box, which are listed below.
-
org.apache.camel.kafkaconnector.aws2s3.converters.S3ObjectConverter
The camel-aws2-s3 source connector supports 3 transforms out of the box, which are listed below.
-
org.apache.camel.kafkaconnector.aws2s3.transformers.JSONToRecordTransforms
-
org.apache.camel.kafkaconnector.aws2s3.transformers.RecordToJSONTransforms
-
org.apache.camel.kafkaconnector.aws2s3.transformers.S3ObjectTransforms
The camel-aws2-s3 source connector supports 1 aggregation strategies out of the box, which are listed below.
-
org.apache.camel.kafkaconnector.aws2s3.aggregation.NewlineAggregationStrategy