You can't use this until you register with us!




Description

Convert content in PDF files stored on S3 to string array data stored in a MySQL database.


Purchasing Options



Available Actions

Convert Data

Read text content from PDF files in an S3 bucket, convert it to string array data and store the data in a table in a MySQL database.


Variables

Source Parameters
source_db required

Select the MySQL database where the query will be executed to find PDF's to process.

sql_query required

Specify the SQL query to obtain the list of PDF ID's and S3 bucket paths for the PDF's to process.

range_start_point

Specify a primary key value. Conversions will begin with the row containing a primary key value that is after the value specified in incremental sort order. Leave this field blank to process all rows in the query result. This field will be automatically set to the primary key value of each row processed during Task execution.

aws_config

AWS Access Keys to use to access the S3 bucket where the PDF's to process are stored.

iam_role

Optional. ARN of an IAM role to use for S3 bucket access.

bucket required

Name of the S3 Bucket where the PDF's to process are stored.

strategy required

Method used to identify the boundaries of the strings to store.


Destination Parameters
target_db required

Select the MySQL database where the string array data will be stored.

target_table_name required

Specify the name of the table, in schema.table format, where the string array data will be stored. If the table does not exist, it will be created.

target_pdf_id_column required

Specify the name of the column in the destination table where the source PDF ID for each string will be stored.