You can't use this until you register with us!




Description

Converts the content in PDF's stored on S3 to JSON data stored in a MySQL database.


Purchasing Options



Available Actions

Convert Data

Executes the query specified against a source MySQL database to obtain a list of PDF's to process. S3 location and other metadata returned from the query is used to locate and read the PDF's, convert their content to JSON data and store that converted data in a destination MySQL database.


Variables

Source Parameters
source_db required

Select the MySQL database where the query will be executed to find PDF's to process.

last_run_end_point

Specify a primary key value. Conversions will begin with the row containing a primary key value that is after the value specified in incremental sort order. Leave this field blank to process all rows in the query result. This field will be automatically set to the primary key value of each row processed during Task execution.

sql_query required

Specify the SQL query to obtain the list of primary keys and S3 bucket paths for the PDF's to process.

aws_token required

Select the AWS Access Keys to use to access the S3 bucket where the PDF's are stored.

bucket required

Specify the name of the S3 bucket where the PDF's are stored.


Destination Parameters
destination_db required

Select the MySQL database where the JSON output will be stored.

destination_table_name required

Specify the name of the table, in schema.table format, where the JSON output will be stored. If the table does not exist, it will be created.

destination_column_pdfid required

Specify the name of the column in the destination table where the primary key for each converted PDF will be stored.

destination_column_jsondata required

Specify the name of the column in the destination table where the JSON data from each converted PDF will be stored

destination_column_status required

Specify the name of the column in the destination table where a success message or error information about the conversion will be stored.

allow_overwrite required

If enabled, existing entries in the destination table with the same primary keys as the PDF's being processed will be overwritten.