 Overview
Introduction
Service Features
Service Process
Output Quality
|
|
 |
|
 |
|
INTRODUCTION TO AUTOMATED DOCUMENT PROCESSING
Beginner
What is automated document processing?
ADP, sometimes called machine-assisted document processing, is the
systematic use of technology to assist human beings in performing
document processing tasks. Typical tasks in litigation support include:
- scanning paper pages to image (scanning, imaging, TIFFing)
- converting electronic files from one format to another (EDD, e-conversion, conversion to TIFF)
- dividing pages into logical document boundaries (LDD, unitization, packaging)
- indexing the documents (coding, data capture, data entry)
- culling documents based on certain attributes (culling, de-duping)
- storing the documents and/or digital files (hosting)
- producing or printing select documents or files
Automated document processing accomplishes the tasks above with the use
of technology to assist the human beings completing the task. The
result of the automated approach is typically to lower costs, reduce
time and increase quality and consistency.
What is unitization?
Unitization, also called logical document determination, is a process
that groups pages into their proper documents. The technique
ensures that no pages are inadvertently mixed together or accidentally
split apart at scan time. Unitization also results in the strongest
document encoding because there is no omission or mixing of pertinent
document information. Both electronic files and paper documents can be
unitized.
What is document encoding?
Document encoding, also called indexing, is the process of creating a
database of bibliographic information about a set of documents.
Bibliographic information covers information about the document, such as
its title, its creation date and its author. Historically, trained
paralegals and legal temp workers performed this task by hand.
Automated encoding uses sophisticated computer software to accomplish
the same process, but much more quickly and at a lower cost.
What is e-conversion?
E-conversion, also called EDD, is the process of converting many
different types of "native" electronic files into a single, common type
that can be used for a wide variety or purposes. The most common
technique converts electronic files (email, desktop publishing,
spreadsheets, PDF images, etc.) to TIFF images for viewing and
associated text files for searching. Many e-conversion processes also
yield Metadata, which is file-level information about each file, such as
its size, file extension type and last modified date. (See more on
Metadata, below.)
Intermediate
How does Valora perform automated encoding?
Valora's patent-pending software system makes several passes through a
document set, identifying tell-tale document characteristics and
extracting useful information. A trained Quality Control team reviews
and enhances this information using custom-designed analysis tools that
allow them to process documents at many times the speed of manual
coding. The system then exports the data in the format requested by
the client.
How is Valora different from other auto-coding vendors?
Most other vendors who provide "auto-coding" are providing a
software-only service. That is, they run the images and text through
their software and see what results. This "get-what-you-get" approach
does not offer strong accuracy or complete coverage of all documents.
This technique is sometimes called "dirty coding" since the raw software
output is unenhanced by human review.
All Valora's services pass through both computer processes and human
processes to ensure the best quality results. In fact, we guarantee our
population coverage at 100% and our accuracy at 98%. We are the only
automated service provider to do so.
How is Valora different from off-shore coding?
Offshore coding is simply traditional, manual data entry that is
happening somewhere outside the United States, typically India or the
Philippines. The work is sent offshore to capitalize on cheaper labor
rates than are available here in the United States, where we
have minimum wage laws. The only difference between offshore coding and
onshore coding is where the work is being performed, the familiarity of
the staff with normal English business documents, and the resultant
price of that work. There is no technological advantage in offshore
coding.
All of Valora's services are provisioned directly in our Burlington, MA
(USA) headquarters. Nothing is ever outsourced and Valora does not use
offshore labor of any kind. Additionally, both Valora's Burlington
facility and most of our personnel hold "Public Trust" security
clearance from the United States government. All of Valora's staff are
American citizens.
Advanced
What is "strategic coding"?
Strategic coding begins with a detailed analysis of your document
population. This is usually done with a report that details what kind
of documents you have and in what relative amounts. With this data, a
strategic coding plan is produced that only captures the important
fields per document type, rather than all fields all the time. This
by-the-document approach ensures the best possible value for your
limited coding budget. Valora's diagnostic tool is called the Document
Distribution Report (DDR), which is created within 48 hours of receiving
your data. With the DDR you can understand what's inside your
population, how much of each type of document you have and ultimately,
how best to process it.
You said automated processing reduces costs. How can I use automation to the best cost advantage?
By utilizing a DDR for paper populations and DDR + culling techniques
(de-duping, near-duping, key word searching, etc.) on electronic
populations, you can often get the processing costs under control.
There is no need to fully code every document in a population. Instead,
you can strategically code each document type to its best value. For
example, you can code file folders for title only, but emails for title,
date, author, recipient and copyees.
Why would you code electronic files? What's wrong with metadata?
Converted electronic files consist of three main pools of data: an
image, text and metadata. On some files, metadata is accurate and
useful information. On others, it is non-existent or wrong. Coding of
electronic files ensures complete, accurate bibliographic information
for each document, regardless of its source of origin.
Below is a classic example of metadata versus coded data. Imagine you
have a Word file. This could be a "native" hard drive file or an email
attachment. It contains a letter from Susie Smith to Jimmy Jones. It
is dated inside the letter as 3/3/2002, but the file was opened several
times and resaved. It has the Re line text: "Pursuant to our meeting
in Tampa." Compare the results below:
| |
Metadata |
True Data |
| Title |
letter.doc |
Pursuant to our meeting in Tampa |
| Date |
5/4/2002 |
3/3/2002 |
| Author |
Registered User |
Smith, Susan |
| Recipient |
|
Jones, James |
While the metadata is accurate for the Word file, it is not accurate for
the document content and thus of very limited use. In contrast,
TrueCodingTM, Valora's term for true, accurate coding of electronic
files, captures the true author, title, etc. from the document content.
With TrueCoding, you have 98% accuracy or higher in your coded output.
Next >>
|
|
 |
|
 |
Copyright © 2000-2008 Valora Technologies, Inc. All Rights Reserved.
|