Deepchecks Logo Deepchecks Suite of Checks

Welcome to Deepchecks!

Deepchecks is the leading tool for validating your machine learning models and data, and it enables doing so with minimal effort. Deepchecks accompanies you through various validation needs such as verifying your data’s integrity, inspecting its distributions, validating data splits, evaluating your model and comparing between different models.

Deepchecks currently supports Tabular Data (deepchecks.tabular) and is in alpha version for Computer Vision (deepchecks.vision).

Get Started

Head over to our quickstart tutorial notebook:

and click on binder badge or on colab badge to launch it and see it in action, or see our Installation to install it locally and continue from there.

Note

The package’s output is suited for running in any Python environment. The output can be conveniently viewed inline in Jupyter Notebooks (or JupyterLab), and it can also be exported as an HTML report, to run it from any IDE.

When Should You Use Deepchecks?

While you’re in the research phase, and want to validate your data, find potential methodological problems, and/or validate your model and evaluate it.

When To Validate - ML Pipeline Schema

See the When Should You Use Section in the User Guide for an elaborate explanation of the typical scenarios.

How Does it Work?

Suites are composed of checks. Each check contains outputs to display in a notebook and/or conditions with a pass/fail/warning output. For more information head over to our Deepchecks Structure in the User Guide.

What Do You Need in Order to Start?

Depending on your phase and what you wish to validate, you’ll need a subset of the following:

  • Raw data (before pre-processing such as OHE, string processing, etc.), with optional labels

  • The model’s training data with labels

  • Test data (which the model isn’t exposed to) with labels

  • A supported model that you wish to validate.
    For tabular data, see supported models.
    For computer vision, we currently support the pytorch framework. See formatters to understand how to integrate with a custom prediciton and label format.

Supported Data Types

Currently the package supports Tabular Data, and is in alpha version for Computer Vision support.

See More

Note

In addition to perusing the documentation, please feel free to to ask questions on our Slack Community, or to post a issue or start a discussion on Github.

For additional usage examples and for understanding the best practices of how to use the package, stay tuned, as this package is in active development!