Best Practices
Enterprise Content Management
Sherry Roark
4/29/2010
INTRODUCTION
We like most Universities are drowning in documents — from applications for enrollment, student
records, and human resource forms. A clear strategy is needed for our faculty, staff, and departments to
manage their documents throughout their lifecycle.
ECM refers to the technologies, strategies, methods and tools used to capture, manage, store, preserve,
and deliver content and documents related to an organization and its processes.
The four primary categories of ECM include:
o Imaging capture software and scanners needed for paper documents making text
searchable and retrievable
o Workflow automatically routing files to the appropriate places and providing a mechanism
for approvals online
o Collaboration providing communities for team collaboration and making it easy for users to
work together
o Document management using content management software needed for storing, sharing,
security, and accessing
ECM tools allow the management of an enterprise level organization's information. In Information
Technology at the University of Louisville there are four primary tools for ECM to assist in the gathering
and finding of content information. They are SharePoint, KnowledgeLake, Plone, and OnBase.
Each ECM tool has various strengths and weaknesses with respect t o the types of content. Each ECM
tool has been evaluated to create the most successful plan for use by the customer. In this document we
will define each ECM solution and define its intended University use.
The goal of this document of ECM Best Practice is to leverage the strengths of a particular ECM tool and
to create a best practice that will be easy for University departments to follow.
1 Available Solutions
Plone (Public Facing – University Branding) Java or .NET (C#)
Static Content Dynamic Content
Public facing Public facing
Student authentication (LDAP) Student authentication (LDAP)
Simple forms Database supported forms
Automation for the webmaster function Complicated Workflow
Exposed to search engines(Google, Yahoo)
SharePoint (Current / Live Documents) OnBase (Historical Documents)
Electronic Documents Paper to Electronic image documents.
Paper Documents (supported by KnowledgeLake) PeopleSoft integration for lookup and retrieval
Photo Libraries Keyword indexing of documents for search and
retrieval (emplid, grant id).
Extensive collaboration abilities (Blogs, Wiki, Large Volume capture (50+ documents per day)
Discussion Boards)
Versioning , check in – check out process Automated capture and indexing of documents.
Content searchable (within documents) Access to very specific work groups which are
manually defined and maintained by IT staff.
No global student authentication.
Intelligent Workflow Intelligent Workflow for content management.
Security (ability to set your own security) Data retention / archiving
KnowledgeLake (SharePoint Integrated Solution) Desktop Scanners & Copiers
Paper to Electronic image documents for use in Can be integrated to capture paper documents for
SharePoint use with SharePoint / KnowledgeLake or Onbase
Large Volume capture (50+ documents per day)
Mass electronic document loading
2 Characteristics of Content:
2.1 Public Facing (Plone)
Public facing content is available to users on the World Wide Web. The content needs to be accessed by
anyone by the World Wide Web. This content could be used for promotion, education, research, and/or
student recruiting.
There are several things that need to be taken into consideration by making content available to users
via the World Wide Web.
o UofL Branding meets the necessary standards required to promote the University
o All electronic documents need to be made ADA compliant
o Video needs to have a transcript or captioning for ADA compliance
o Documents that are still being edited or revised should be linked through Plone to the
SharePoint environment
2.2 Current or Live Documents (SharePoint)
SharePoint was selected to be used for current or live documents because of its extensive collaboration
tools. Users can interact with one another to complete documents and share document ideas.
SharePoint was chosen to be our live document solution based on the following items.
o Electronic Documents providing communities for team
collaboration and making it easy for users to work together on documents, tasks, contacts,
events, and other information.
o SharePoint Secure internal documents, required login, collaboration, and user controlled access.
o SharePoint Public external documents, updated frequently, and linked from Plone to allow
global outside University viewing using supported Universities branding standards available
through Plone.
2.2.1 KnowledgeLake – Supporting Paper and Electronic Documents
KnowledgeLake was added to the mix to provide a solution for supporting paper and mass electronic use
within SharePoint.
KnowledgeLake brings the bellow features to support SharePoint.
o KnowledgeLake was added to our ECM tool set to allow users to scan documents into
SharePoint.
o Supporting SharePoint by providing a method to support document scanning and indexing
o Electronic Documents providing mass loading of inbound electronic documents for SharePoint
access
o Paper Documents scanning solution to support access in SharePoint
o Indexing content to make document easy to locate
o Searchable content and easy retrieval on demand
2.3 Historical Documents (OnBase)
OnBase was chosen to support historical documents because its ability to compress large amounts of
information.
B elow are the features OnBase performs to support historical documents.
o Paper Documents needed for record keeping and kept in
filing cabinets
o Electronic Documents stored in SharePoint, your hard drive, or I:drive that is no longer accessed
but needed for record retention.
o Searchable content and easy retrieval on demand.
o Workflow solutions for types of documents need to be filed together for easy retrieval.
o Paper Documents needed for record keeping and kept in
filing cabinets
3 Security
3.1 Confidential
To a certain extent all the content management tools can protect the content and present it to the
correct audience. If you would like to maintain your own security SharePoint is the best option for you.
Onbase requires an IT professional to setup security and also make any future security changes.
3.2 Permanent
All the content management tools participate in backup processes and disaster recovery preparation. If
content is deemed a permanent record the best tool may be determined by how the records are
searched.
4 Capture
4.1 Paper -> electronic file -> ECM tool
SharePoint with the use of KnowledgeLake or OnBase can be used for converting paper documents into
an electronic image when the volume is high or low.
4.2 Directory -> ECM tool
All the content management tools are capable of moving an electronic document from a local drive into
the content management tool. SharePoint with the use of KnowledgeLake or OnBase has the additional
capability to scrape a set of documents from a directory and place them as a group within either
document management system.
4.3 Forms based content capture
Plone or Java is capable of presenting a web form and gathering information. The main distinction is the
use of a database to store or retrieve information. Plone forms are easy to create and store information
in a standard file format. Plone can also email form submissions. Java is required if the data is to be
stored in a database or the form is to be pre-populated with data from a database. SharePoint and
OnBase also have form capability. Presently SharePoint not OnBase can be setup for public facing
access via the World Wide Web.
4.4 Automated / Manual Capture
SharePoint with the use of KnowledgeLake or OnBase is oriented towards large volumes of documents
gathered through repeatable processes and stored in a manner to facilitate search and retrieval.
Plone / Java have a similar dichotomy between manual and automated capture. Plone presents content
based on a manual derivation of paths and uploading of content. Java represents the custom
development capability of U of L which can through design automate processes.
4.5 Indexing
4.5.1 Automatic
SharePoint with the use of KnowledgeLake or OnBase are both capable of scraping a directory of files
and automatically indexing the group based on an index file in the same directory. Both provide a
keyword indexing capability that can read from a predefined section of a document and update an
index.
JAVA or .NET (C#) can be developed to accomplish similar levels of automation. This may be needed to
accommodate public facing content.
4.5.1.1 Whole text indexing
SharePoint with the use of KnowledgeLake or OnBase are both capable of whole text indexing. This is
where an index is created for every word of the content. Functionally this is similar to a search engine
on the World Wide Web.
Plone or Java uses this functionality for World Wide Web search engines such as Yahoo and Google.
4.5.1.2 Content off paper
Once paper is converted to an image document there are various technologies to extract content from
the image. Examples of these techniques include bar codes. Both SharePoint with the use of
KnowledgeLake and OnBase has the capability of indexing scanned documents based extracting an index
from the scanned image.
4.5.2 Manual
Manual indexing is another phrase for simply creating a directory path to a document or content. In a
web context the path is part of the URL. SharePoint and Plone the path is incorporated into menu items
and / or displayed as breadcrumbs.
5 Storage
All the content management tools store content in an environment designed to recover from backups
and to be restored in the event of a disaster. The relative capacities of Plone, Java development,
SharePoint and OnBase are similar. This means the size of the content is not a determining factor with
regards to the best content management tool for a particular kind of content.
5.1 Archiving
Archiving is a method of compressing content that should be retained but is not frequently accessed.
SharePoint or OnBase has allowed compression of content to be stored. We are currently evaluating
solutions for archiving SharePoint.
5.2 Versions
Version control is a way of measuring the change that occurs within content. Retention and archiving
have an implication of time versioning is focused on relative change. SharePoint has tools which assist
in the versioning of posted documents.
6 Workflow
When content passes through a series of decision or branching steps this is referred to as workflow.
SharePoint and OnBase offer features to support workflows which track and capture the events that
move the content to the next step. SharePoint uses the .NET (C#) programming language to provide
complex workflow solutions.
7 Integration
Integration is accomplished through COTS (Commercial Off the Shelf) tools that are designed to
integrate two different applications or custom development. The goal of integration is to leverage the
best of differing applications and more fundamentally keep content in one primary place. The
presentation of the content may be best done from a different application than the storage of the same
content.
This section was added to highlight the flow of content through capture, storage, and presentation. All
the Enterprise Content Management tools are capable of integration.
7.1 Data passed out from ECM system
7.1.1 PeopleSoft
SharePoint .NET (C#) workflow, OnBase, and Java custom applications have the capability to present
content based on index values derived within PeopleSoft.
7.2 Data passed into ECM system
7.2.1 Electronic Data Interchange style feed
Both SharePoint and OnBase have EDI capabilities to receive content from other applications in a
predefined format.
7.2.2 Bulk import of records
Both SharePoint with KnowledgeLake and OnBase has functionality to process sets of content with a file
defining the appropriate index.
7.3 Email
At this point the placing of Email into the content management tools is a manual process.