What meta-data does Docparser include?
Docparser has a number of additional columns or values included in both the downloadable formats and webhooks. All of these values are optional and can be excluded from the final data by modifying the download or webhook settings. Below is a list of values we currently provide.
Document ID | document_id | The document ID is an alpha-numeric value generated when a document is first uploaded. |
Remote ID | remote_id | A remote_id can be set via the API or through the Email Reception system to give a document a value that can be passed to the endpoint (255 max characters). |
Filename | file_name | The file_name is the name of the uploaded document, this cannot be changed or parsed. |
Media Link | media_link | media_link is a URL that contains a direct download to the uploaded file. If preprocessing has been applied this will be included. |
Media Link Original | media_link_original | media_link_original contains a download to the original file as it was uploaded (without preprocessing). |
Media Link Data | media_link_data | The media_link_data parameter contains a download for the parsed data in the default file format (this can be specified when setting up a download). |
Page Count | page_count | page_count will output the total number of pages in an uploaded document. |
Received At | uploaded_at | The time that a document was uploaded. |
Processed At | processed_at | processed_at denotes the time a document completed the parsing queue. |
Received At(UTC) | uploaded_at_utc | The time a document was uploaded in coordinated universal time. |
Received At(User Timezone) | uploaded_at_user | The time a document was uploaded in your accounts specified timezone. |
Processed At(UTC) | processed_at_utc | The time a document was processed in coordinated universal time. |
Processed At(User Timezone) | processed_at_user | The time a document was processed in your accounts specified timezone. |