Convert content in PDF files stored on S3 to string array data stored in a MySQL database.
Read text content from PDF files in an S3 bucket, convert it to string array data and store the data in a table in a MySQL database.
Variables
Select the MySQL database where the query will be executed to find PDF's to process.
Specify the name of the table, in schema.table format, where the file data is stored.
Specify the name of primary key column.
Specify the SQL query to obtain the list of primary keys and S3 bucket paths for the PDF's to process.
Specify a primary key value. Conversions will begin with the row containing a primary key value that is after the value specified in incremental sort order. Leave this field blank to process all rows in the query result. This field will be automatically set to the primary key value of each row processed during Task execution.
Select the AWS Access Keys to use to access the S3 bucket where the PDF's are stored.
Specify the name of the S3 bucket where the PDF's are stored.
Select the MySQL database where the string array data will be stored.
Specify the name of the table, in schema.table format, where the string array data will be stored. If the table does not exist, it will be created.
Specify the name of the column in the destination table where the primary key for each converted PDF will be stored.
If enabled, existing entries in the destination table with the same primary keys as the PDF's being processed will be overwritten.