Wed. Sep 28th, 2022
    2010-2018 ARCHIVED CONTENT
    You are viewing ARCHIVED CONTENT released online from 1 April, 2010 to August 24, 2018. Content in this archive site is NOT UPDATED, and links may not function. For current information, go to ComplexDiscovery.com.

    120 Seconds of Predictive Coding – Survey Results

    Provided below for your consideration and use are the in-progress results of the Predictive Coding and Providers: A 120-Second Survey launched by ComplexDiscovery on 2/10/13.  The goal of the survey is to provide a quick, non-comprehensive overview of the use of the technology-assisted review feature of predictive coding among leading eDiscovery providers as represented by those providers. The areas covered in the short survey include:

    • Technology Development (Developed Internally, Licensed Externally or Hybrid Development)
    • Technology Partners (Content Analyst, Equivio, OrcaTec, Recommind or Other)
    • Offering Integration (Architectural Integration, Process Integration or Other)
    • Machine Learning Approach (Supervised Learning, Active Learning or Other)
    • Sampling Approach (Random, Judgmental or Other)

    The in-progress results consist of survey answers harvested directly from the online survey form as completed by provider representatives. Additional survey responses from the eDiscovery provider community will be added to this listing as they are completed.

    Additional responders are welcome and encouraged. Click here to go to survey.

    Updated 7/23/2013

    Current Responders

    • @Legal
    • AccessData
    • AlphaLit
    • Altep
    • BIA
    • Catalyst Repository Systems
    • Content Analyst
    • D4
    • Daegis
    • Deloitte
    • Driven
    • DTI
    • Equivio
    • Exterro
    • Hudson Legal
    • Huron Legal
    • kCura
    • Kroll Ontrack
    • NightOwl Discovery
    • Nuix
    • Orange Legal Technologies
    • OrcaTec
    • Prolorem
    • Rational Retention
    • Recommind
    • RenewData
    • Servient
    • Symantec/Clearwell
    • TCDI
    • UBIC
    • UHY Advisors
    • Valora Technologies
    • Xerox Litigation Services
    • ZyLAB

    Current Responses (By Provider)

    @Legal (CasePoint TAR) @AtLegal

    • Developed Internally: Proprietary development of predictive coding technology.
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    AccessData (Summation – Predictive Coding Feature) @AccessData Group 

    • Developed Internally: Proprietary development of predictive coding technology.
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    AlphaLit (E-Predict | Backstop – Content Analyst)

    • Hybrid Development: Combination of proprietary and licensed predictive coding technology.
    • Technology Partners:  Backstop, Content Analyst
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.

    Altep (Tag | Content Analyst) @Altep_Inc

    • Developed Internally: Proprietary development of predictive coding technology.
    • Technology Partners:  Content Analyst
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform. (Available as a standalone application and integrated into the Inspicio offering.)
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.

    BIA (BIA Predictive Coding Engine | Coda™) @BIA_eDiscovery

    • Developed Internally: Proprietary development of predictive coding technology.
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Process Integrated: Predictive coding technology is available as an eDiscovery offering, but not integrated into the software architecture of core eDiscovery platform (process integration).
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    Catalyst Repository Systems (Insight Predict) @CatalystSecure

    • Developed Internally: Proprietary development of predictive coding technology.
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Globally Adaptive K-Nearest Neighbor machine learning approach.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    Content Analyst (Conceptual Categorization) @Content_Analyst

    • Developed Internally: Proprietary development of predictive coding technology. (Content Analyst provides analytics technology, not end user solutions).
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly (based on licensee approach).
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection (based on licensee approach).
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample (based on licensee approach).
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample (based on licensee approach).

    D4 Discovery (Equivio Relevance – Relativity Assisted Review) @D4Discovery

    • Licensed Externally: Predictive coding technology completely licensed from software developer.
    • Technology Partners:  Equivio, Content Analyst
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    Daegis (Acumen) @Daegis

    • Developed Internally: Proprietary development of predictive coding technology.
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.

    Deloitte (Dynamic Review) @DeloitteFinSvcs

    • Developed Internally: Proprietary development of predictive coding technology.
    • Process Integrated: Predictive coding technology is available as an eDiscovery offering, but not integrated into the software architecture of core eDiscovery platform (process integration).
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    Driven (START (Simplified Technology Assisted Review Tool)) (Content Analyst)

    • Developed Internally: Proprietary development of predictive coding technology.
    • Technology Partners:  Content Analyst
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.

    DTI (Technology-Assisted Review | Equivio)

    • Licensed Externally: Predictive coding technology completely licensed from software developer.
    • Technology Partner:  Equivio
    • Process Integrated: Predictive coding technology is available as an eDiscovery offering, but not integrated into the software architecture of core eDiscovery platform (process integration).
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.

    Equivio (Relevance) @Equivio

    • Developed Internally: Proprietary development of predictive coding technology.
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.

    Exterro (Fusion Predictive Intelligence™) @Exterro

    • Developed Internally: Proprietary development of predictive coding technology.
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    Hudson Legal (Predictive Coding | Recommind – Relativity – Kroll Ontrack) @HudsonLegal

    • Licensed Externally: Predictive coding technology completely licensed from software developer.
    • Technology Partner:  Recommind (Offer Relativity and Kroll Ontrack capabilities also.)
    • Process Integrated: Predictive coding technology is available as an eDiscovery offering, but not integrated into the software architecture of core eDiscovery platform (process integration).
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.
    • Random Sampling: A sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.

    Huron Legal (Integrated Analytics | PureDiscovery) @HuronLegal

    • Licensed Externally: Predictive coding technology completely licensed from software developer.
    • Technology Partner:  PureDiscovery
    • Process Integrated: Predictive coding technology is available as an eDiscovery offering, but not integrated into the software architecture of core eDiscovery platform (process integration).
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    kCura (Relativity Assisted Review | Content Analyst) @kCura

    • Hybrid Development: Combination of proprietary and licensed predictive coding technology.
    • Technology Partner:  Content Analyst
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    Kroll Ontrack (Intelligent Review Technology (IRT) | Ontrack® Inview™) @KrollOntrack

    • Developed Internally: Proprietary development of predictive coding technology.
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    NightOwl Discovery (Predictive Coding | Equivio – Relativity) @NightOwlDMS 

    • Licensed Externally: Predictive coding technology completely licensed from software developer.
    • Technology Partners: Equivio, Relativity
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    Nuix (Automatic Classification) @Nuix

    • Developed Internally: Proprietary development of predictive coding technology.
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    Orange Legal Technologies (Predictive Review | OrcaTec) @OrangeLT

    • Hybrid Development: Combination of proprietary and licensed predictive coding technology.
    • Technology Partner:  OrcaTec
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.

     OrcaTec (OrcaPredict) @OrcaTec

    • Developed Internally: Proprietary development of predictive coding technology.
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.

    Prolorem (Prolorem eDi)

    • Hybrid Development: Combination of proprietary and licensed predictive coding technology.
    • Technology Partners:  Open Source Licensed APIs and Tools
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.

    Rational Retention (Rational Intelligence)

    • Developed Internally: Proprietary development of predictive coding technology.
    • Process Integrated: Predictive coding technology is available as an eDiscovery offering, but not integrated into the software architecture of core eDiscovery platform (process integration).
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    Recommind (Predictive Coding | Axelerate) @Recommind

    • Developed Internally: Proprietary development of predictive coding technology.
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.

    RenewData (Computer-Assisted Coding | Equivio – Content Analyst) @RenewData

    • Licensed Externally: Predictive coding technology completely licensed from software developer.
    • Technology Partners:  Equivio, Content Analyst
    • Process Integrated: Predictive coding technology is available as an eDiscovery offering, but not integrated into the software architecture of core eDiscovery platform (process integration).
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.

    Servient (Servient Predictive Review) @Servient

    • Developed Internally: Proprietary development of predictive coding technology.
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    Symantec (Clearwell Systems) (Transparent Predictive Coding)  @SYMCeDiscovery

    • Developed Internally: Proprietary development of predictive coding technology.
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    TCDI (Suggestive Coding | Content Analyst)

    • Hybrid Development: Combination of proprietary and licensed predictive coding technology.
    • Technology Partner:  Content Analyst
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    UBIC (CJK TAR)

    • Developed Internally: Proprietary development of predictive coding technology.
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    UHY Advisors (Assisted Review | Relativity)

    • Licensed Externally: Predictive coding technology completely licensed from software developer.
    • Technology Partners: Relativity
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    Valora Technologies (Valora Auto Review Services) @ValoraTech

    • Developed Internally: Proprietary development of predictive coding technology.
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Probabilistic Hierarchical Context-Free Grammars approach to machine learning.
    • Stratified Sampling: A statistical sampling approach in which populations are stratified by attributes (which can be automatically obtained or obtained from human input), then selected for sampling so that the sample strata match the population strata.

    Xerox Litigation Services (CategoriX) @Xerox_XLS

    • Developed Internally: Proprietary development of predictive coding technology.
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    Xerox Litigation Services – Lateral Data (View Point Assisted Review) @Xerox_XLS

    • Developed Internally: Proprietary development of predictive coding technology.
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.
    • Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    ZyLAB (ZyLAB Technology-Assisted Review) @ZyLAB

    • Developed Internally: Proprietary development of predictive coding technology.
    • Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.
    • Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.
    • Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.

    Current Responses (By Answer)

    Technology Development

    Developed Internally: Proprietary development of predictive coding technology.

    • @Legal
    • AccessData
    • Altep
    • BIA
    • Catalyst Repository Systems
    • Content Analyst (Content Analyst provides analytics technology, not end user solutions).
    • Daegis
    • Deloitte
    • Driven
    • Equivio
    • Exterro
    • Nuix
    • OrcaTec
    • Kroll Ontrack
    • Rational Retention
    • Recommind
    • Servient
    • Symantec/Clearwell
    • UBIC
    • Valora Technologies
    • Xerox Litigation Services (Viewpoint Assisted Review and CategoriX)
    • ZyLAB

    Licensed Externally: Predictive coding technology completely licensed from software developer.

    • D4 (Equivio, Content Analyst)
    • DTI (Equivio)
    • Hudson Legal
    • Huron Legal (PureDiscovery)
    • NightOwl Discovery
    • RenewData (Equivio, Content Analyst)
    • UHY Advisors

    Hybrid Development: Combination of proprietary and licensed predictive coding technology.

    • AlphaLit (Backstop, Content Analyst)
    • kCura (Content Analyst)
    • Orange Legal Technologies (OrcaTec)
    • Prolorem (Open Source Licensed APIs and Tools)
    • TCDI (Content Analyst)

    Offering Integration

    Architecturally Integrated: Predictive coding technology is integrated into the software architecture (application integration) of core eDiscovery platform.

    • @Legal
    • AccessData
    • AlphaLit
    • Altep (Available as a standalone offering or as part of the Inspicio offering.)
    • Catalyst Repository Systems
    • Content Analyst (Content Analyst provides analytics technology, not end user solutions.)
    • D4
    • Daegis
    • Driven
    • Equivio
    • Exterro
    • kCura
    • Kroll Ontrack
    • NightOwl Discovery
    • Nuix
    • OrcaTec
    • Prolorem
    • Recommind
    • Servient
    • Symantec/Clearwell
    • TCDI
    • UBIC
    • UHY Advisors
    • Valora Technologies
    • Xerox Litigation Services (Viewpoint Assisted Review and CategoriX)
    • ZyLAB

    Process Integrated: Predictive coding technology is available as an eDiscovery offering, but not integrated into the software architecture of core eDiscovery platform (process integration).

    • BIA
    • DTI
    • Deloitte
    • Hudson Legal
    • Huron Legal
    • Orange Legal Technologies
    • Rational Retention
    • RenewData

    Machine Learning Approach

    Supervised Learning: Human chooses a set of documents (training set) and feeds the documents into the system. The system learns the difference between responsive and non-responsive documents and classifies remaining documents accordingly.

    • @Legal
    • AccessData
    • AlphaLit
    • Altep
    • BIA
    • Content Analyst (Based on licensee machine learning approach as Content Analyst provides analytics technology, not end user solutions).
    • Deloitte
    • Driven
    • Exterro
    • Huron Legal
    • kCura
    • Kroll Ontrack
    • NightOwl Discovery
    • Nuix
    • Orange Legal Technologies
    • OrcaTec
    • Prolorem
    • Rational Retention
    • Recommind
    • RenewData
    • Servient
    • Symantec/Clearwell
    • TCDI
    • UBIC
    • UHY Advisors
    • Valora Technologies
    • Xerox Litigation Services (CategoriX)
    • ZyLAB

    Active Learning: System chooses a set of documents and feeds the documents to humans. Humans make a decision about the documents and then the system applies the learning provided by the humans against the rest of the documents in the collection.

    • Altep
    • BIA
    • Content Analyst (Based on licensee machine learning approach as Content Analyst provides analytics technology, not end user solutions).
    • Exterro
    • D4
    • Daegis
    • DTI
    • Equivio
    • Exterro
    • Hudson Legal
    • kCura
    • Kroll Ontrack
    • NightOwl Discovery
    • Orange Legal Technologies
    • OrcaTec
    • Servient
    • Symantec/Clearwell
    • TCDI
    • UBIC
    • UHY Advisors
    • Xerox Litigation Services (Viewpoint Assisted Review)

    Other Machine Learning Approaches:

    • Catalyst Repository Systems:  Globally Adaptive K-Nearest Neighbor machine learning approach.
    • Valora Technologies: Probabilistic Hierarchical Context-Free Grammars approach to machine learning.

    Sampling Approach

    Random Sampling: A statistical sampling approach that gives each document an equal chance of being chosen for inclusion within a sample.

    • @Legal
    • AlphaLit
    • Altep
    • BIA
    • Catalyst Repository Systems
    • Content Analyst (Based on licensee sampling approach as Content Analyst provides analytics technology, not end user solutions).
    • Exterro
    • D4
    • Daegis
    • Deloitte
    • Driven
    • DTI
    • Equivio
    • Exterro
    • Hudson Legal
    • Huron Legal
    • kCura
    • Kroll Ontrack
    • NightOwl Discovery
    • Orange Legal Technologies
    • OrcaTec
    • Prolorem
    • Rational Retention (Preferred Method)
    • Recommind
    • RenewData
    • Servient
    • Symantec/Clearwell
    • TCDI
    • UBIC
    • UHY Advisors
    • Xerox Litigation Services (Viewpoint Assisted Review and CategoriX)
    • ZyLAB

    Judgmental Sampling: A sampling approach that draws in part from subjective factors when determining inclusion within a sample.

    • @Legal
    • AccessData
    • BIA
    • Catalyst Repository Systems
    • Content Analyst (Based on licensee machine sampling approach as Content Analyst provides analytics technology, not end user solutions).
    • Exterro
    • D4
    • Deloitte
    • Exterro
    • Huron Legal
    • kCura
    • Kroll Ontrack
    • NightOwl Discovery
    • Nuix
    • Rational Retention
    • Servient
    • Symantec/Clearwell
    • TCDI
    • UBIC
    • UHY Advisors
    • Xerox Litigation Services (Viewpoint Assisted Review and CategoriX)

    Other Sampling Approach

    • Valora Technologies: Stratified Sampling: A statistical sampling approach in which populations are stratified by attributes (which can be automatically obtained or obtained from human input), then selected for sampling so that the sample strata match the population strata.

    End of Survey Results

    Updated to remove “Statistical” from Sampling Approach “header” and from Judgmental Sampling definition.

    MountainsofData

     

     

    Have a Request?

    If you have information or offering requests that you would like to ask us about, please let us know and we will make our response to you a priority.

    ComplexDiscovery is an online publication that highlights cyber, data, and legal discovery insight and intelligence ranging from original research to aggregated news for use by cybersecurity, information governance, and eDiscovery professionals. The highly targeted publication seeks to increase the collective understanding of readers regarding cyber, data, and legal discovery information and issues and to provide an objective resource for considering trends, technologies, and services related to electronically stored information.

    ComplexDiscovery OÜ is a technology marketing firm providing strategic planning and tactical execution expertise in support of cyber, data, and legal discovery organizations. Focused primarily on supporting the ComplexDiscovery publication, the company is registered as a private limited company in the European Union country of Estonia, one of the most digitally advanced countries in the world. The company operates virtually worldwide to deliver marketing consulting and services.

    Data Corpus Minimization? IDC Recognizes DISCO as Early Case Assessment Software Leader

    “As innovation continues to impact and disrupt the legal world, we’ve...

    Building a Cybersecurity Workforce? The European Cybersecurity Skills Framework

    According to ENISA's Executive Director, Juhan Lepassaar, "The future security of...

    Leaning Forward? The CISA 2023-2025 Strategic Plan

    The purpose of the CISA Strategic Plan is to communicate the...

    Continuous Risk Improvement? Q3 Cyber Round-Up From Cowbell Cyber

    According to Manu Singh, director of risk engineering at Cowbell, "Every...

    Revealing Response? Nuix Responds to ASX Request for Information

    The following investor news update from Nuix shares a written response...

    Revealing Reports? Nuix Notes Press Speculation

    According to a September 9, 2022 market release from Nuix, the...

    Regards to Broadway? HaystackID® Acquires Business Intelligence Associates

    According to HaystackID CEO Hal Brooks, “BIA is a leader in...

    One Large Software and Cloud Business? OpenText to Acquire Micro Focus

    According to OpenText CEO & CTO Mark J. Barrenechea, “We are...

    On the Move? 2022 eDiscovery Market Kinetics: Five Areas of Interest

    Recently ComplexDiscovery was provided an opportunity to share with the eDiscovery...

    Trusting the Process? 2021 eDiscovery Processing Task, Spend, and Cost Data Points

    Based on the complexity of cybersecurity, information governance, and legal discovery,...

    The Year in Review? 2021 eDiscovery Review Task, Spend, and Cost Data Points

    Based on the complexity of cybersecurity, information governance, and legal discovery,...

    A 2021 Look at eDiscovery Collection: Task, Spend, and Cost Data Points

    Based on the complexity of cybersecurity, information governance, and legal discovery,...

    Five Great Reads on Cyber, Data, and Legal Discovery for September 2022

    From privacy legislation and special masters to acquisitions and investigations, the...

    Five Great Reads on Cyber, Data, and Legal Discovery for August 2022

    From AI and Big Data challenges to intriguing financial and investment...

    Five Great Reads on Cyber, Data, and Legal Discovery for July 2022

    From lurking business undercurrents to captivating deepfake developments, the July 2022...

    Five Great Reads on Cyber, Data, and Legal Discovery for June 2022

    From eDiscovery ecosystem players and pricing to data breach investigations and...

    Bubble Trouble? eDiscovery Operational Metrics in the Fall of 2022

    In the fall of 2022, 89 eDiscovery Business Confidence Survey participants...

    Cooler Temperatures? Fall 2022 eDiscovery Business Confidence Survey Results

    Since January 2016, 2,874 individual responses to twenty-eight quarterly eDiscovery Business...

    Inflection or Deflection? An Aggregate Overview of Eight Semi-Annual eDiscovery Pricing Surveys

    Initiated in the winter of 2019 and conducted eight times with...

    Changing Currents? Eighteen Observations on eDiscovery Business Confidence in the Summer of 2022

    In the summer of 2022, 54.8% of survey respondents felt that...

    Perception and Reality? Ukraine Conflict Assessments in Maps (September 22 – 26, 2022)

    According to a recent update from the Institute for the Study...

    Nuclear Options? Ukraine Conflict Assessments in Maps (September 17 – 21, 2022)

    According to a recent update from the Institute for the Study...

    Mass Graves and Torture Chambers? Ukraine Conflict Assessments in Maps (September 12 – 16, 2022)

    According to a recent update from the Institute for the Study...

    On The Run? Ukraine Conflict Assessments in Maps (September 7 – 11, 2022)

    According to a recent update from the Institute for the Study...