Helping you find the right solutions for your Information Management needs.
We offer Innovative Solutions to suit the needs of our customers.
Our services are Available 24/7 nationwide.

Metadata Extraction Demonstration

 

Because most of today's douments are created electronically, it is quite possible that you would have literally created a million documents in the past few years. There might come a need for you to archive all these documents and store them in a secure location. There might also be a need for you to save valuable storage space. For this purpose, Lexdata can streamline your archiving process by creating a document repository that consists of PDF/A archive formats converted from you existing file formats and a database which is coded based on the metadata present in each of your document. The following is an example for archiving and extracting metadata from a mail message:

 

 

 

As shown above, the outlook mail message is first converted to the required format. In most cases, this is a PDF/A file format. TIFF is neccessary for some third party document management repositories.

 

 

 

Once the documents are converted and OCRed, the document is then objectively coded into a database. The structure of the database can be your own design or a third party compliant design. For the purpose of this sample, we have a simple MS Excel database with basic fields neccessary to identify the document. Not that there are also hyperlinks in the database which can be used to navigate to the relevant document. The sample files are available for download below:

 

Mail Message - Sample outlook mail message in .msg format

 

Converted PDF Document - Mail message converted to PDF format

 

Converted TIFF Image - Mail message converted to TIFF format. Required for certain document management repositories.

 

MS Excel Metadata Database - Coded database file in MS Excel format. The database can also be in XML, MS Access, CSV, or SQL Query Format

 

Notes on Test Conditions
You will be needing the latest version of Adobe Reader to view the above PDF documents.