Skip to content
LegalSifter merges with Contract Logix, a leader in data-driven contract management

The Sifter Factory™

A First-of-Its-Kind Factory

We develop our Sifters® with machine learning and natural language processing to identify a specific concept like a hidden indemnity provision. Our lawyers find hundreds or thousands of examples of a specific concept. Our Data Science team finds the intangible common thread that binds the writing across all styles and grammatical flairs. Our Sifter Factory produces new and updated Sifters weekly. 

Our Sifters use the latest AI technologies

Our Sifters use machine learning (ML), natural language processing (NLP), and generative AI (GenAI).

What is machine learning?
Machine learning is a subfield of artificial intelligence, which is broadly defined as the capability of a machine to imitate intelligent human behavior. Artificial intelligence systems use ML to perform complex tasks in a way that is similar to how humans solve problems.
What is natural language processing?
Natural language processing (NLP) combines computational linguistics, machine learning, and deep learning models to process human language. NLP teaches computers to interpret human language as written, regardless of style. Each Sifter uses NLP to convert text into meaning. 
What is GenAI?

Generative artificial intelligence (GenAI) is artificial intelligence capable of generating text, images, or other data using generative models and often in response to prompts. Generative AI models learn the patterns and structure of their input training data and then generate new data that has similar characteristics.

Fun Facts about the Sifter Factory:

The only one of its kind in business
  • Each Sifter is built in our Sifter Factory by a team of data scientists and attorneys equipped with the most advanced AI in the world to train algorithms that can read a contract.

  • Each Sifter goes through a 9-stage development process involving 1-2 data scientists and 2-3 lawyers. We release new and improved Sifters every week to all of our clients at no additional cost.

  • For the past several years, our Sifters come out of the Sifter Factory with 95-97% F1 scores. These scores continue to improve over time and with use.

How it works

Our clients constantly contribute to the ongoing improvement of Sifters.

  • Our Sifters continuously improve with crowdsourced feedback from our clients, partners, and team members.

  • The Sifter Trainer is a feature of our software for users and our Sifting Services team to report Sifter errors.

  • Improved Sifters are released back into production weekly.

  • When we started in 2013, our Sifters were performing at F1 levels of around 90%. Now they exceed 97% thanks to technology advances and ongoing feedback from our users.

Benefits to you

  • Our clients are up and running immediately since they don't have to train any models before using our software.

  • New clients get the benefit of all the learnings gained over our company's history and from our other clients, built into the Sifters themselves.

How we measure Sifter quality

  • We measure Sifter quality with an F1 score, a measure of predictive performance commonly used in statistical analysis of binary classification and information retrieval systems.

  • F1 combines both precision and recall into a single metric, providing a balanced assessment of a model’s effectiveness.

  • Here’s how it’s calculated:
    ·       Precision: The number of true positive results divided by the total number of samples predicted to be positive (including those not identified correctly).
    ·       Recall: The number of true positive results divided by the total number of samples that should have been identified as positive.

    The F1 score is the harmonic mean of precision and recall. It symmetrically represents both precision and recall in one metric. The value of the F1 score lies between 0 and 1, with 1 indicating perfect precision and recall, and 0 if either precision or recall is zero. 

    The F1 score balances the trade-off between precision (positive predictive value) and recall (sensitivity) and is commonly used to assess a model’s performance.
Lawyer image_4
Negotiate contracts confidently with self-service AI software that includes comprehensive guidance.​

Learn More