<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
    <channel>
        <title>Autonomy ZANTAZ</title>
        <link>http://autonomy.dciginc.com/</link>
        <description>Autonomy ZANTAZ is the leader in archiving, eDiscovery and Proactive Information Risk Management markets.  It is the only vendor that offers an entire spectrum of Proactive Information Risk Management solutions ranging from real-time policy management, records management and consolidated archiving to early case assessment, enterprise legal hold and EDD, review and production.  ZANTAZ solutions run on common platform, IDOL, which supports more than 100 languages and 1,000 file types.  ZANTAZ solutions are available as hosted services, on-site software or ac ombination of both.  ZANTAZ customers include 9 of the 10 top global law firms, 11 of the fortune 25 and 14 of the top 20 financial securities firms.  Customers include Abbot Laboratories, Capital One, JMP Securities, Johnson and Johnson, Liberty Mutual, Linklaters, Philip Morris International and the US Department of Interior.</description>
        <language>en</language>
        <copyright>Copyright 2008</copyright>
        <lastBuildDate>Wed, 16 Jul 2008 14:56:12 -0600</lastBuildDate>
        <generator>http://www.sixapart.com/movabletype/</generator>
        <docs>http://www.rssboard.org/rss-specification</docs>
        
        <item>
    	    <author>
	        <name>Greg Buckles</name>
        	<uri>http://www.Reason-eD.com</uri>
	    </author>
            <title>The Challenge of Historical Corporate Collections, Leveraging Legal Demands - Part 1 of 2</title>
            <description><![CDATA[<p class="MsoNormal" style="MARGIN: 0in 0in 10pt; TEXT-INDENT: 0.5in"><font face="Calibri" color="#000000" size="3">As corporations slowly face the consequences of unmanaged information assets, they have started to form </font><a href="http://en.wikipedia.org/wiki/Electronically_Stored_Information"><font face="Calibri" color="#800080" size="3">ESI</font></a><font face="Calibri" color="#000000" size="3"> retention policies, acquire email archives and other enterprise technologies needed to track and dispose of newly created communications. It is much simpler to enable policy, process and technology to handle the go forward content than to deal with years or decades of accumulated unstructured content. Most public corporations have existing preservation requirements to deal with on top of possible long term retention regulations. </font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 0pt; LINE-HEIGHT: normal; mso-layout-grid-align: none"><font color="#000000"><font face="Calibri"><font size="3">The combination of the 2000-2002 public corporate scandals and heavy sanctions like the </font><i><span style="FONT-SIZE: 10pt; mso-bidi-font-family: Arial">Zubulake v. UBS Warburg LLC, et al.</span></i><span style="FONT-SIZE: 10pt; mso-bidi-font-family: Arial">, S.D.N.Y 02 CV 1234 (SAS) 7/20/04; 2004 U.S. Dist. LEXIS (S.D.N.Y, July 20, 2004)</span><font size="3"> case resulted in panicked phone calls from Legal to IT asking, "Are we keeping everything?"We know that IT was not keeping EVERYTHING before, but suddenly tape backup systems were repurposed from one month disaster recovery systems to impromptu preservation systems. Even if the company has faced those growing stockpiles of tape and conquered that mountain, the revised </font></font></font><a href="http://en.wikipedia.org/wiki/Federal_Rules_of_Civil_Procedure"><span style="COLOR: windowtext; TEXT-DECORATION: none; text-underline: none"><font face="Calibri" size="3">FRCP</font></span></a><font size="3"><font color="#000000"><font face="Calibri"> are so new that most General Counsels have been reluctant to approve of the resumption of destruction policies without a full assessment and revision of existing paper-based policies.<span style="FONT-SIZE: 10pt; COLOR: #333333; FONT-FAMILY: Times-Roman"><o:p></o:p></span></font></font></font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt; TEXT-INDENT: 0.5in"><font size="3"><font color="#000000"><font face="Calibri"><span style="mso-spacerun: yes">&nbsp;</span>This leaves many companies, municipalities and other entities struggling with whether to tackle their historical collections before or after putting new systems and policies in place. If they are not careful, it can paralyze their new retention initiative and leave them at even greater risk for increased cost and even sanctions. It is important to remember that the judiciary is always more critical of your current efforts to satisfy your retention and production obligations as compared to your historical 'gaps'. In this light, we can see that the first priority is to put the process, technology and people in place to prevent ongoing data loss and control the costs as the company moves forward. You cannot change the past, but you can change your capabilities to meet current and future challenges.</font></font></font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt; TEXT-INDENT: 0.5in"><font face="Calibri" color="#000000" size="3">Does that mean that you just ignore all the old ESI? Do you just ignore the terabytes of 'Litigation Landmines' (PST files) living on local and network drives until a litigation request demands them? That is actually a business cost versus risk decision that every entity above a certain size must make in light of their regulatory or litigation profile. It is definitely not 'best practice' to leave all that unstructured content out there, but the current business environment does not always allow us the luxury of true 'best practice' solutions without a clear short term </font><a href="http://en.wikipedia.org/wiki/Return_on_investment"><font face="Calibri" color="#800080" size="3">ROI</font></a><font face="Calibri" color="#000000" size="3"> justification. </font></p><span style="FONT-SIZE: 11pt; FONT-FAMILY: 'Calibri','sans-serif'; mso-bidi-font-family: 'Times New Roman'; mso-ascii-theme-font: minor-latin; mso-fareast-font-family: Calibri; mso-fareast-theme-font: minor-latin; mso-hansi-theme-font: minor-latin; mso-bidi-theme-font: minor-bidi; mso-ansi-language: EN-US; mso-fareast-language: EN-US; mso-bidi-language: AR-SA"><font color="#000000">One method of attack is to leverage high priority litigation and regulatory events to fund and implement enterprise systems such as search, </font><a href="http://www.zantaz.com/products/idol/index.htm"><font color="#800080">categorization</font></a><font color="#000000">, </font><a href="http://www.zantaz.com/products/information-governance/aungate-real-time-policy/index.htm"><font color="#800080">real time policy enforcement</font></a><font color="#000000">, archiving, </font><a href="http://inmage.dciginc.com/2008/07/cdp-and-backup-software-the-fu.html"><font color="#800080">continuous data protection</font></a><font color="#000000"> and desktop backup. All of these technologies are a part of an overall discovery solution, but have immediate business applications that can justify their cost. The key to getting budget is the open communication and coordination between legal and IT. Instead of spending millions on service providers to respond to a single case, legal should bring IT, the business director and the insurance adjustor to the table to talk over in-house systems. As an example, a shareholder lawsuit might require such broad preservation and collection to justify conversion and ingestion of old tapes into an </font><a href="http://www.zantaz.com/products/archiving/index.htm"><font color="#800080">archive</font></a><font color="#000000"> where they can be searched, categorized and even disposed of properly. Legal is a corporate cost center, but it is time to acknowledge that litigation is a fact of life in corporate America and quit spending like it is a one time event.</font></span>]]></description>
            <link>http://autonomy.dciginc.com/2008/07/the-challenge-of-historical-co.html</link>
            <guid>http://autonomy.dciginc.com/2008/07/the-challenge-of-historical-co.html</guid>
            
            
                <category domain="http://www.sixapart.com/ns/types#tag">Archiving</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Electronic Discovery</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">eMail Archive</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Governance Risk and Compliance</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Legal Hold</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Litigation Readiness</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Microsoft Exchange</category>
            
            <pubDate>Wed, 16 Jul 2008 14:56:12 -0600</pubDate>
        </item>
        
        <item>
    	    <author>
	        <name>Greg Buckles</name>
        	<uri>http://www.Reason-eD.com</uri>
	    </author>
            <title>Exploring the Secondary ROI of Discovery Purchases - DLP, Archiving, Endpoint Security</title>
            <description><![CDATA[<p class="MsoNormal" style="MARGIN: 0in 0in 10pt"><font color="#000000"><font size="3"><font face="Calibri"><span style="mso-tab-count: 1">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span>In looking back at the earliest generations of </font></font></font><a href="http://en.wikipedia.org/wiki/Information_lifecycle_management"><font face="Calibri" color="#800080" size="3">Information Lifecycle Management (ILM)</font></a><font face="Calibri" color="#000000" size="3">, </font><a href="http://en.wikipedia.org/wiki/Business_analytics"><font face="Calibri" color="#800080" size="3">Business Analytics</font></a><font face="Calibri" color="#000000" size="3"> and </font><a href="http://en.wikipedia.org/wiki/Data_loss"><font face="Calibri" color="#800080" size="3">Data Loss Prevention (DLP)</font></a><font face="Calibri" color="#000000" size="3"> products, we can see a wasteland of interesting technology that was too early for the market. We are now seeing the hints of resurgence in products adjacent to enterprise discovery based on the 'secondary benefits' of corporate archiving, preservation and collection. Basically, corporations seem to be recognizing that the infrastructure required to establish an efficient, defensible discovery process can and should be leveraged to provide other business functionality. </font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt; TEXT-INDENT: 0.5in"><font face="Calibri" color="#000000" size="3">In recent </font><a href="http://autonomy.dciginc.com/2008/06/the-roots-of-search-engines-wh.html"><font face="Calibri" color="#800080" size="3">blogs</font></a><font face="Calibri" color="#000000" size="3">, I have talked about how these early enterprise analytic products have tried to transform themselves into e-Discovery 'solutions' with mixed results. It is sometimes the secondary analytical features of these products that can provide an unexpected ROI far beyond the discovery compliance driver that got the budget out of legal. Although opposing counsel and many courts are not yet ready to rely exclusively on conceptual search and other 'black box' (i.e. not easily understood) technologies to winnow the wheat from the ESI chaff, these systems can dramatically reduce the cost of review and provide essential QA for Boolean criteria.</font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt; TEXT-INDENT: 0.5in"><font face="Calibri" color="#000000" size="3">More relevant to IT, HR and Compliance needs, enterprise systems can be leveraged to shape data management policies, enforce computer/messaging usage policies and monitor a growing concern, Data Loss Prevention (DLP). These peripheral usage scenarios have not been a big enough pain point to drive widespread adoptions and budget allocation outside of tightly regulated verticals, but they are happy to make use of the new enterprise applications and appliances funded by legal. </font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt; TEXT-INDENT: 0.5in"><font face="Calibri" color="#000000" size="3">In fact, the fastest ROI on search or archiving platforms frequently comes from resolving potential litigation through Early Case Assessment searches. I recall answering a simple request from a new director regarding the communications over a multi-million dollar contract by the former director. The company had just finished loading all the old email from tapes into an archive to comply with a discovery request, so we had everything possible online. The savings from finding that one email clearly expressing the mutual understanding of a term saved the company twice the price of the archive system. Instant ROI that had nothing to do with the original purpose of the system.</font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt; TEXT-INDENT: 0.5in"><font face="Calibri" color="#000000" size="3">The biggest potential benefit of conceptual filters and categorization engines is to shift the corporation's capability to a proactive stance, rather than a reactive response. Data Loss Prevention (DLP) is all about preventing critical trade secrets and confidential information from reaching the wrong eyes. The same technology that helps review teams catch potentially privileged email before they are produced can be used to stop your pricing lists and source code from leaving the domain. With the widest ESI type capability, it is not surprising to </font><a href="http://www.autonomy.com/content/News/Releases/2008/0701.en.html"><font face="Calibri" color="#800080" size="3">hear</font></a><font face="Calibri" color="#000000" size="3"> that Autonomy's IDOL platform is used by a majority of the </font><a href="http://www.cbronline.com/article_news.asp?guid=940EFEF8-DC43-4CA7-8840-4282079B34BD"><font face="Calibri" color="#800080" size="3">DLP market leaders</font></a><font face="Calibri" color="#000000" size="3">. The focus is on the messaging systems, but endpoint control and enterprise encryption are also starting to make sales, again driven by the complexity of collecting and preserving ESI from user desktops. So when legal is fighting for budget to address a discovery gap, it is critical to involve the other departments who have a vested interest in gaining these new capabilities and slowly moving the corporation towards the goal of true information management.</font></p>]]></description>
            <link>http://autonomy.dciginc.com/2008/07/exploring-the-secondary-roi-of.html</link>
            <guid>http://autonomy.dciginc.com/2008/07/exploring-the-secondary-roi-of.html</guid>
            
            
                <category domain="http://www.sixapart.com/ns/types#tag">Archiving</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Continuous Data Protection</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Data Retention</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Early Case Assessment</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Electronic Discovery</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">eMail Archive</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Information Governance</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Litigation Readiness</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Search</category>
            
            <pubDate>Tue, 08 Jul 2008 10:03:48 -0600</pubDate>
        </item>
        
        <item>
    	    <author>
	        <name>Greg Buckles</name>
        	<uri>http://www.Reason-eD.com</uri>
	    </author>
            <title>The Roots of Search Engines - Why you should care what is behind your enterprise Search</title>
            <description><![CDATA[<p class="MsoNormal" style="MARGIN: 0in 0in 10pt"><font color="#000000"><font size="3"><font face="Calibri"><span style="mso-tab-count: 1">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span>Enterprise level discovery requires enterprise level technology or by definition it becomes 'unduly burdensome' for any but the smallest cases.<span style="mso-spacerun: yes">&nbsp; </span>Recent opinions from federal judges and appellate courts on both coasts have made it clear that the discovery process, technology and personnel are under increasing scrutiny. Plaintiff's counsel are reading the same opinions from Judge John Facciola (<em><span style="FONT-FAMILY: 'Calibri','sans-serif'; mso-ascii-theme-font: minor-latin; mso-hansi-theme-font: minor-latin; mso-bidi-font-family: 'Times New Roman'; mso-bidi-theme-font: minor-bidi">United States v. O'Keefe</span></em>, 537 F. Supp. 2d 14 (D.D.C. 2008)) and the 9<sup>th</sup> Circuit (</font></font></font><a href="http://www.ca9.uscourts.gov/ca9/newopinions.nsf/D2CDDB4098D7AFB28825746C0048ED24/$file/0755282.pdf?openelement"><font face="Calibri" color="#800080" size="3">Quon v. Arch Wireless</font></a><font face="Calibri" color="#000000" size="3">). Many will interpret them as a signal to initiate </font><a href="http://en.wikipedia.org/wiki/Daubert_v._Merrell_Dow_Pharmaceuticals"><i style="mso-bidi-font-style: normal"><font face="Calibri" color="#800080" size="3">Daubert</font></i></a><font size="3"><font color="#000000"><font face="Calibri"><i style="mso-bidi-font-style: normal"> </i>style hearings to force corporations to defend their search engines, communication systems and the discovery effort in response to discovery demands.</font></font></font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt"><font size="3"><font color="#000000"><font face="Calibri"><span style="mso-tab-count: 1">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span>Corporate counsel may then turn to the vendor that sold them an 'enterprise solution' and demand an expert to defend the search engine. Some of the vendors have made the serious investment to validate their technology through 3<sup>rd</sup> party testing or by recruiting resident experts. The reality of our industry is that there have been few if any public challenges to the accuracy and completeness of software sold to the e-discovery market. Until recently, most of this software was purchased and run by service providers who had an interest in defending the product that they proposed and operated behind the magic curtains. </font></font></font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt"><font color="#000000"><font size="3"><font face="Calibri"><span style="mso-tab-count: 1">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span>The majority of applications now focused on the corporate e-Discovery spend were originally created for information management, knowledge management, process analytics, storage management, disaster recovery and other non-litigation business needs. </font></font></font><a href="http://en.wikipedia.org/wiki/Search_engine_%28computing%29"><font face="Calibri" color="#800080" size="3">Enterprise search engines</font></a><font face="Calibri" color="#000000" size="3"> were mostly born from the research into scaling internet search engines that started back in the </font><a href="http://en.wikipedia.org/wiki/Web_search_engine#History"><font face="Calibri" size="3">early 1990's</font></a><font face="Calibri" color="#000000" size="3">. The problem with web based search is that the systems are optimized for speed and relevance, rather than completeness.<span style="mso-spacerun: yes">&nbsp; </span>A recent </font><a href="http://www.autonomy.com/content/News/Releases/2008/0521a.en.html"><font face="Calibri" color="#800080" size="3">Autonomy release</font></a><font face="Calibri" color="#000000" size="3"> drew attention to the engine 'jump out' issue, where some systems actually stop or skip indexes when they think it unlikely to find a match. </font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt"><font color="#000000"><font size="3"><font face="Calibri"><span style="mso-tab-count: 1">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span>Not all applications that use search engines created for the internet are unsuitable for discovery search. The</font></font></font><a href="http://en.wikipedia.org/wiki/Search_engine_indexing"><font face="Calibri" color="#800080" size="3"> index engine</font></a><font face="Calibri" color="#000000" size="3"> 'schema' or configuration controls how it crawls through files to extract out the information needed to get search results as well as how it behaves while running a search. Many enterprise applications like email archives and document management systems utilize open source or OEM'd index engines like </font><a href="http://en.wikipedia.org/wiki/Fast_Search_%26_Transfer"><font face="Calibri" size="3">FAST</font></a><font face="Calibri" color="#000000" size="3">,</font><a href="http://www.autonomy.com/idolserver"><font face="Calibri" size="3"> IDOL</font></a><font face="Calibri" color="#000000" size="3"> and </font><a href="http://en.wikipedia.org/wiki/Lucene"><font face="Calibri" color="#800080" size="3">Lucene</font></a><font face="Calibri" color="#000000" size="3">. Most of these index engines are dependent upon outside software to recognize divergent types of ESI and extract the text in discrete words (a process called tokenization) with a few exceptions like Autonomy's IDOL platform. The configuration and options in a system can result in dropping out critical ESI from the index/search or it can bring the system to its knees by forcing it to try to index file types which contain no text or character sets that it cannot handle. </font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt"><font size="3"><font color="#000000"><font face="Calibri"><span style="mso-tab-count: 1">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span>The critical take away from these new cases is that you need to know the capabilities and limitations of your chosen search engine(s). Not just what the software provider tells you it can do, but actually how it performs on your ESI. Think about standing before a court and answering the question, "How do you know that your search got everything that matched your criteria?" Judges are not seeking some impossible level of absolute certainty, but they expect to hear about your reasonable effort to validate your tools. </font></font></font></p>]]></description>
            <link>http://autonomy.dciginc.com/2008/06/the-roots-of-search-engines-wh.html</link>
            <guid>http://autonomy.dciginc.com/2008/06/the-roots-of-search-engines-wh.html</guid>
            
            
                <category domain="http://www.sixapart.com/ns/types#tag">Electronic Discovery</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">eMail Archive</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Search</category>
            
            <pubDate>Thu, 26 Jun 2008 08:50:01 -0600</pubDate>
        </item>
        
        <item>
    	    <author>
	        <name>Greg Buckles</name>
        	<uri>http://www.Reason-eD.com</uri>
	    </author>
            <title>&quot;All keyword searches are not created equal&quot;</title>
            <description><![CDATA[<p class="MsoNormal" style="MARGIN: 0in 0in 10pt"><font size="3"><font color="#000000"><font face="Calibri"><span style="mso-tab-count: 1">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span>A recent federal civil case, Stanley v. Creative Pipe (F.Supp.2d ----, 2008 WL 2221841 (D. Md.)) has raised interesting questions about the standard of reasonableness when using keyword searches to select or exclude ESI from a large collection. The fact pattern is a long tangled story about using searches to segregate privileged ESI and boils down to the waiver of privilege of 165 contested documents that were produced because they did not get hits from the search engine used by a computer forensic expert. </font></font></font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt"><font size="3"><font color="#000000"><font face="Calibri"><span style="mso-tab-count: 1">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span>The legal ramifications of the case are better left to attorney interpretation, but U.S. Magistrate Judge Paul Grimm made several interesting statements that give insight into how the increasingly sophisticated judiciary is raising the bar on search. The first leads in with, "there is a growing body of literature that highlights the risks associated with conducting an unreliable or inadequate keyword search or relying exclusively on such searches for privilege review."</font></font></font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt"><font size="3"><font color="#000000"><font face="Calibri"><span style="mso-tab-count: 1">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span>So the bedrock of retrieval, the venerated keyword search, does not stand all on its own. In this case, the seventy-odd search terms were neither disclosed to, nor agreed upon by the opposing party. The terms were created by counsel and client, then executed by their hired gun. All the items that could not be indexed were manually reviewed for privilege, but it appears that no one reviewed the 'searchable' items that did not contain the list of search terms before being produced. </font></font></font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt"><font size="3"><font color="#000000"><font face="Calibri"><span style="mso-tab-count: 1">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span>Judge Grimm continues, "Common sense suggests that even a properly designed and executed keyword search may prove to be over-inclusive or under-inclusive, resulting in the identification of documents as privileged which are not, and non-privileged which, in fact, are. The only prudent way to test the reliability of the keyword search is to perform some appropriate sampling of the documents determined to be privileged and those determined not to be in order to arrive at a comfort level that the categories are neither over-inclusive nor under-inclusive."</font></font></font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt"><font size="3"><font color="#000000"><font face="Calibri"><span style="mso-tab-count: 1">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span>Upon receiving the production, the opposing party used a "readily-available desktop search tool." to find some of the formerly privileged items. It is unfortunate that the order does not specify the technology used by each side, but it is clear that quite a bit of effort was expended to manually wade through PDF and TIFF files that could have been searched with the right tools. Even hiring a 'computer forensic expert' did not save the Defendants' from their own decisions. </font></font></font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt"><span style="mso-tab-count: 1"><font face="Calibri" color="#000000" size="3">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </font></span><a href="http://www.zantaz.com/challenges/index.htm"><font face="Calibri" color="#800080" size="3">Keywords</font></a><font face="Calibri" color="#000000" size="3"> are easy to understand and even easier to misuse. Beyond the science of 'precision and recall' there needs to be process, communication and metrics to reach the standard of reasonable due diligence, this 'comfort level' that Judge Grimm cites. The process needs to identify the known exceptions to your chosen technology. Have you asked or tested for the ability to search across different formats of email, files, databases, images, voice, video and languages? Do you even know the composition of your ESI collection and how that will affect any searches?</font></p>]]></description>
            <link>http://autonomy.dciginc.com/2008/06/all-keyword-searches-are-not-c.html</link>
            <guid>http://autonomy.dciginc.com/2008/06/all-keyword-searches-are-not-c.html</guid>
            
            
                <category domain="http://www.sixapart.com/ns/types#tag">Electronic Discovery</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">eMail Archive</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Search</category>
            
            <pubDate>Mon, 16 Jun 2008 21:41:36 -0600</pubDate>
        </item>
        
        <item>
    	    <author>
	        <name>Greg Buckles</name>
        	<uri>http://www.Reason-eD.com</uri>
	    </author>
            <title>Are all search results equal?</title>
            <description><![CDATA[<p class="MsoNormal" style="MARGIN: 0in 0in 10pt"><font size="3"><font color="#000000"><font face="Calibri"><span style="mso-tab-count: 1">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span>Ever since George Socha and other analysts called out the raw size of the eDiscovery market and the rapid growth of technology sales driven by corporations efforts to become compliant with the new Federal Rules of Civil Procedure (FRCP), software developers with dollar signs in their eyes have adapted existing products for the eDiscovery market. The problem is that many or most of them have no real understanding of the unique requirements and challenges of legal search. The Information Lifecycle Management (ILM), Knowledge Management (KM) and Enterprise Content Management (ECM) movements have left the market littered with interesting applications that the corporate market was not yet ready to pay for. Many of these products are now showing up dressed in an eDiscovery GUI and being sold as eDiscovery solutions without even rudimentary testing on the myriad types of corporate ESI. </font></font></font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt"><font size="3"><font color="#000000"><font face="Calibri"><span style="mso-tab-count: 1">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span>Just because a system is marketed for use in eDiscovery does not relieve the service provider or corporate legal department from the responsibility to conduct fundamental validation testing of the systems basic functions. Too many customers believe that the software company somehow warrants or guarantees the accuracy of search results, just because it was sold specifically for legal search. The entire industry has suffered from a bit of willful blindness for several years now. Blaming inaccurate results on the software or appliance will not get you much pity from the judiciary when you fail to produce large portions of custodians communications because of the myriad ways that Microsoft Exchange can retain addresses. </font></font></font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt"><font size="3"><font color="#000000"><font face="Calibri"><span style="mso-tab-count: 1">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span>At the recent Electronic Discovery Reference Model (EDRM) 2008-2009 project kickoff conference, I was pleased to participate in the formation of a new group focused on legal search. The primary goal of this group is to explore how to document, audit and defend the search process used. The recent Sedona Conference commentaries for Working Group 1 have also called out the need to validate search, culling, review and all other methods that comprise your discovery response process. This is exciting stuff and not just for us techno-junkies. </font></font></font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt"><font size="3"><font color="#000000"><font face="Calibri"><span style="mso-tab-count: 1">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span>Even when search engines perform exactly as designed, counsel and their technical support need to understand how the systems handle their unique blend of corporate ESI. The configurations of enterprise messaging and file systems change over time and that actually changes the characteristics and metadata available to search engines. Consider the major changes in how message envelopes are formatted between Exchange 2003 and Exchange 2007. Now consider the fact the most corporate major upgrades are spread over quarters and you realize that almost every large enterprise environment has multiple systems that are running in a mixed mode (i.e. more than one version of the system running simultaneously). </font></font></font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt"><font size="3"><font color="#000000"><font face="Calibri"><span style="mso-tab-count: 1">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; </span>All of this does not mean that it is impossible to conduct a reasonable, defensible search, preservation or collection. Instead it points out the need to understand your environment and to document the efforts made to test and analyze the capabilities and exceptions of potential eDiscovery technology before putting them to use. With more innovative CIO's looking to implement enterprise search and analytics, it is critical that the legal department collaborates on the system requirements and testing process.</font></font></font></p>
<p class="MsoNormal" style="MARGIN: 0in 0in 10pt; TEXT-INDENT: 0.5in"><font face="Calibri" color="#000000" size="3">This concept of test before you trust applies to service providers as well as applications. The Autonomy Book on Compliance, Risk, Legal and Regulatory Technologies gives excellent basic guidelines for key testing considerations in a Proof of Concept scenario. They expand on these basic concepts in a full whitepaper titled, "A Guide to a Successful PoC: The Dos and Don'ts for Selecting Unstructured Data Management Software." We need more awareness of the pitfalls around implementation and adoption of untested technology and process. Do not let your faith in your existing system keep you from giving it a pop quiz. </font></p>]]></description>
            <link>http://autonomy.dciginc.com/2008/06/are-all-search-results-equal.html</link>
            <guid>http://autonomy.dciginc.com/2008/06/are-all-search-results-equal.html</guid>
            
            
                <category domain="http://www.sixapart.com/ns/types#tag">Electronic Discovery</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">eMail Archive</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Search</category>
            
            <pubDate>Wed, 04 Jun 2008 13:47:14 -0600</pubDate>
        </item>
        
        <item>
    	    <author>
	        <name>Greg Buckles</name>
        	<uri>http://www.Reason-eD.com</uri>
	    </author>
            <title>Concept Search, where does it fit in discovery?</title>
            <description><![CDATA[<p class="MsoNormal" style="margin: 0in 0in 10pt;"><font size="3"><font color="#000000"><font face="Calibri">This is a common question from large corporations when they are evaluating enterprise systems for search, preservation and collection. The answer is not simple or definitive. First let us look at what conceptual clustering is and what it can and cannot do for us. Conceptual engines use a variety of methods to group common sets of words, phrases and even properties to create associated lists of concepts, topics, facets and even more names than can be relayed here. They all have a different secret sauce for ranking and grouping these related words and properties to extract analytical relationships between the vast numbers of items in the typical collection. <o:p></o:p></font></font></font></p>
<p class="MsoNormal" style="margin: 0in 0in 10pt;"><font size="3"><font color="#000000"><font face="Calibri">A conceptual engine essentially lets the ESI talk for itself. Rather than a person creating a search from their preconceptions of what criteria will retrieve all items related to a given request, the systems analyze and present the items back as folders, dot clusters and other visual diagrams to help the user make sense of the complex relationships. All of this sounds like just what the attorney asked for. "Give me everything relating to this deal." <o:p></o:p></font></font></font></p>
<p class="MsoNormal" style="margin: 0in 0in 10pt;"><font size="3"><font color="#000000"><font face="Calibri">But what one should consider is that concepts are just groups of words or phrases that express similar ideas. They do not directly answer the typical request for production demanding 'any and all documents or communications that constitute, contain, embody, comprise, reflect, identify, state, refer to, deal with, comment on, respond to, describe, involve, mention, discus, record, support or are in any way pertinent to' the <span style="">&nbsp;</span>topic at hand. That means that you cannot just plug in the text of the demand into a conceptual search and expect that you will retrieve everything required. <o:p></o:p></font></font></font></p>
<p class="MsoNormal" style="margin: 0in 0in 10pt;"><font size="3"><font color="#000000"><font face="Calibri">To make matters even more complicated, the secret sauce of conceptual engines is a lot like black magic to those who do not spend all of their time figuring out how to build a better search engine, meaning most of us. Trying to explain to the court how such an engine determined that all these documents were related and why the cluster was named that way can challenge the bravest of testifying experts and drown the veracity of your efforts in a sea of technical fog. <o:p></o:p></font></font></font></p>
<p class="MsoNormal" style="margin: 0in 0in 10pt;"><font color="#000000" face="Calibri" size="3">"So what good are concepts if I cannot search on them?", asks counsel. No one doubts the worth of conceptual analysis to find the key items produced by the opposing party, but how does that help the corporation find all their relevant email? The first step of incident response in the </font><a href="http://www.edrm.net/"><font color="#800080" face="Calibri" size="3">EDRM</font></a><font size="3"><font color="#000000"><font face="Calibri"> model is the Identification phase. The traditional approach is to interview the primary custodians associated with the matter and try to generate the preservation instructions. When dealing with enterprise archives like Autonomy's EAS, that means coming up with search criteria to place your legal holds. How do you know what criteria are related to the matter? Up to this point, counsel inside and out have been doing their best to define the criteria, but few have leverage a defensible, explainable process to support their criteria. <o:p></o:p></font></font></font></p>
<p class="MsoNormal" style="margin: 0in 0in 10pt;"><font size="3"><font color="#000000"><font face="Calibri">Maybe you have already collected the initial, key ESI that the custodians thought relevant to the matter. Or you have run a search using the truly unique terms, names and dates to perform your risk assessment. If you have a collection that you know is relevant, conceptual search engines like Autonomy's IDOL can show you the concepts within your collection to define or expand your actual hard Boolean search criteria. Now you can demonstrate the tools used to verify your due diligence and catch the variations and jargon used to talk about your topic. <o:p></o:p></font></font></font></p><span style="font-size: 11pt; line-height: 115%; font-family: 'Calibri','sans-serif';"><font color="#000000">So I do not feel that counsel and the courts are ready to rely on conceptual search as the sole means to retrieve ESI, but they can enable, defend and control the cost of corporate preservation and collection efforts when used in a structured process. In the end, it is not concept vs. keyword. Instead the products and solutions that integrate them will provide the most bang for the buck.<span style="">&nbsp; </span></font></span>]]></description>
            <link>http://autonomy.dciginc.com/2008/05/concept-search-where-does-it-f.html</link>
            <guid>http://autonomy.dciginc.com/2008/05/concept-search-where-does-it-f.html</guid>
            
            
                <category domain="http://www.sixapart.com/ns/types#tag">Electronic Discovery</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">eMail Archive</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Legal Hold</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Litigation Readiness</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Search</category>
            
            <pubDate>Fri, 23 May 2008 17:45:21 -0600</pubDate>
        </item>
        
        <item>
    	    <author>
	        <name>Greg Buckles</name>
        	<uri>http://www.Reason-eD.com</uri>
	    </author>
            <title>So what about the voicemail?</title>
            <description><![CDATA[<p class="MsoNormal" style="margin: 0in 0in 10pt;"><font size="3"><font color="#000000"><font face="Calibri">The dreaded question plagues discovery vendors, IT and even industry experts shy away from tackling the costs and complexities created by emerging unified communications systems. Office Communications Server 2007 and other communication systems feed divergent media streams into enterprise archives, corporate legal hold repositories and litigation collections. This 'simplification' for the users actually poses serious challenges for search technologies that have traditionally focused exclusively on text. <o:p></o:p></font></font></font></p>
<p class="MsoNormal" style="margin: 0in 0in 10pt;"><font size="3"><font color="#000000"><font face="Calibri">Finding key terms and phrases buried inside of mountains of recorded phone conversations, voice mails and IM chats can devour discovery budgets and send counsel crying 'undue burden' to the court. There seem to be two dominant speech analytics methods: phonetic indexing (first brought to eDiscovery by Nexidia) and transcription or speech-to-text (long dominated by Autonomy's engine which supports both methods). Phonetic search renders sound wave forms into simplified strings of phonemes that can be indexed and searched. This makes the technology effectively content agnostic, but makes it challenging to integrate with text based search. Speech-to-text has been the foundation for automated conceptual search and improvements in speaker recognition technologies have also increased the value in what was effectively raw dialogue. <o:p></o:p></font></font></font></p>
<p class="MsoNormal" style="margin: 0in 0in 10pt;"><font size="3"><font color="#000000"><font face="Calibri">Knowing that one can search digitized conversations, the next question is can users effectively search everything within the enterprise system from unified federated search? There is little doubt that the archiving systems are aggressively pursuing acquisitions, partnerships and development to enable ingestion and indexing of every conceivable data stream. All of them started with email back in the late 1990's.<span style="">&nbsp;&nbsp; </span>For example, Symantec doesn't have audio, but jumped ahead with early products to handle IM, file shares, Sharepoint through merger and acquisition.<o:p></o:p></font></font></font></p>
<p class="MsoNormal" style="margin: 0in 0in 10pt;"><font size="3"><font color="#000000"><font face="Calibri">Mergers inject complexity by requiring integration of technology, services and cultures. Having experienced the Symantec-Veritas integration first hand, it always surprises me to see pundits and bloggers jumping all over the expected personnel departures.<span style="">&nbsp; </span>Moreover, other restructurings that occur in the wake of big M&amp;A moves. Instead of looking at who departed, which products are End-of-Lifed and which partners jump ship, I think that we should look at who stays, fundamental IP integrations and new solution offerings to get a better idea of where the new joint entity is headed. <o:p></o:p></font></font></font></p>
<p class="MsoNormal" style="margin: 0in 0in 10pt;"><font size="3"><font color="#000000"><font face="Calibri">As a former customer of Zantaz's Introspect, I watched press releases closely after the Autonomy acquisition last July. I was surprised to see how quickly all of Zantaz's products were integrated to the IDOL search architecture. A two to three month integration tells me that the business units got the resources and investment needed to make necessary changes under the hood. The back channel continues to be positive.<span style="">&nbsp; </span>For example, after speaking with two big Introspect customers at a conference last week, it was clear that Autonomy had greatly improved search speed and performance. The back end administration and database complexity still seems to be an issue with some customers, but systems that offer enterprise scale will require enterprise level investments in architecture, support and consistent project management to succeed. Perhaps that is at the root of the shift within the Zantaz channel to go up market and target larger sales? <o:p></o:p></font></font></font></p>
<p class="MsoNormal" style="margin: 0in 0in 10pt;"><font size="3"><font color="#000000"><font face="Calibri">If we are going to demand that large public corporations make all of their communication data streams accessible and manageable to respond to the new FRCP requirements, then we have to expect further consolidation in the maze of different applications used today. Discovery, ILM, Retention Management, Enterprise Content Management and all the other flavors of alphabet soup are just ways of saying that companies are responsible for administrating their information assets, including formerly transient forms such as audio, IM, PIN-2-PIN, etc.<span style="">&nbsp;&nbsp; </span></font></font></font><o:p></o:p></p>]]></description>
            <link>http://autonomy.dciginc.com/2008/05/so-what-about-the-voicemail.html</link>
            <guid>http://autonomy.dciginc.com/2008/05/so-what-about-the-voicemail.html</guid>
            
            
                <category domain="http://www.sixapart.com/ns/types#tag">Electronic Discovery</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">eMail Archive</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Legal Hold</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Litigation Readiness</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Search</category>
            
            <pubDate>Wed, 14 May 2008 19:42:34 -0600</pubDate>
        </item>
        
        <item>
    	    <author>
	        <name>Joshua L. Konkle</name>
        	<uri>http://www.dciginc.com/about/joshualkonkle</uri>
	    </author>
            <title>There&apos;s a new sheriff in Information Governance and she goes by Autonomy</title>
            <description><![CDATA[Since 2000 the requirements for broad based archiving of unstructured content have been on the rise.&nbsp; The creation of the majority of unstructured content was widely done in electronic mail; or so we have assumed.&nbsp; However, file attachments have become prolific in the last ten years increasing in both size and quantity within electronic mail.&nbsp; These files must be created by someone and stored somewhere.<br /><br />Anecdotal evidence suggests that these file attachments are stored locally and on file servers.&nbsp; The files are outside the reach of traditional enterprise content management (ECM) and record management business processes.&nbsp; The files are a product of non-deliberate business processes.&nbsp; Non-deliberate processes, as I described in a <a href="http://www.dciginc.com/2008/04/emc-content-archiving-possibility-or-reality-pt2.html">previous blog with DCIGInc.com</a>, are difficult to manage because they involve certain levels of creativity that fluctuate from day to day.<br /><br />Processes that fluctuate according to the needs of the business, department and staff result in unique and unstructured content.&nbsp; The content and processes require a special type of handling, in the form of Intelligent Archiving.&nbsp; The archiving and ECM markets need to radically change their perception of solutions to these unstructured data problems.&nbsp; In an attempt to address this <a href="http://www.dciginc.com/redirect.php?site=http://www.autonomy.com">Autonomy</a> is announcing Autonomy Information Governance, the first intelligent information governance platform.&nbsp; Their plan, according to their <a href="http://www.dciginc.com/redirect.php?site=http://www.autonomy.com/content/News/Releases/2008/0414.en.html">press release</a>, is to address three cost and risk oriented business processes.<br /><br /><span class="mt-enclosure mt-enclosure-image" style="display: inline;"><a href="http://zantaz.dciginc.com/Autonomy%20Information%20Governance%20Platform1.html" onclick="window.open('http://zantaz.dciginc.com/Autonomy%20Information%20Governance%20Platform1.html','popup','width=540,height=405,scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0'); return false"><img src="http://zantaz.dciginc.com/Autonomy%20Information%20Governance%20Platform-thumb-540x405.jpg" alt="Autonomy Information Governance Platform.jpg" class="mt-image-left" style="margin: 0pt 20px 20px 0pt; float: left;" height="405" width="540" /></a></span><br /><br />Compliance, Enterprise Legal Hold and Disposition Management are the three business processes driving up costs and creating risks.&nbsp; Compliance is aimed at automating information control to address and enforce internal risks, such as those felt by <a href="http://www.dciginc.com/redirect.php?site=http://blogs.wsj.com/law/2008/03/05/siemens-bribery-scandal-getting-rooted-out-from-the-inside/">Siemens in the bribery scandals</a>.&nbsp; Enterprise Legal Hold is aimed at automating and managing the requirement to maintain information for specific legal cases.&nbsp; Finally, disposition management is aimed at managing data retention according to the three-way relationship between the burden of user access, risks to the company and need for legal production.<br /><br />On Friday April 11th, 2008, Autonomy held a press briefing in San Francisco to inform news agencies of these plans in the wake of sub-prime credit investment issues, federal rules for civil procedure and potential activism around executive pay at failed companies. <br /><br />At the press briefing, Autonomy had several panelists sharing their thoughts on the subject of unstructured content, risks and business processes.&nbsp; One of the panelists, Browning Marean, partner, DLA Piper US LLP commented on the complexity of the problem, "Millions in legal or internal costs can be incurred just trying to comply with legal hold requests, in addition to fines resulting from not delivering fast enough.&nbsp; It was technology (electronic information) that got us into this mess, and technology will have to get us out."&nbsp; Marean doesn't decry IDOL as the answer, but it is obvious without an index, like Autonomy IDOL, all you have are globs of unstructured data with little or no relationship.<br /><br />To build and establish a relationship, Autonomy is setting in motion their grand plans for an Intelligent Information Governance Platform. At the center of the platform will be IDOL and the same philosophy that founded the company. Dr. Mike Lynch founded Autonomy on the basis of indexing "human friendly" information so that computers could understand its "meaning" and managing sound.&nbsp; Using that philosophy and associated algorithms, Autonomy will leverage IDOL in this new offering.&nbsp; As we break through the silence on Intelligent Information Governance Platform, you can expect more analysis on it <a href="http://zantaz.dciginc.com/">here</a>. <div><br /></div><div><br /></div>]]></description>
            <link>http://autonomy.dciginc.com/2008/04/theres-a-new-sheriff-in-inform.html</link>
            <guid>http://autonomy.dciginc.com/2008/04/theres-a-new-sheriff-in-inform.html</guid>
            
            
                <category domain="http://www.sixapart.com/ns/types#tag">Governance Risk and Compliance</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Information Governance</category>
            
                <category domain="http://www.sixapart.com/ns/types#tag">Legal Hold</category>
            
            <pubDate>Mon, 14 Apr 2008 10:30:00 -0600</pubDate>
        </item>
        
    </channel>
</rss>