Binder badge Colab badge

New LabelΒΆ

[1]:
from deepchecks.checks.integrity.new_label import NewLabelTrainTest
from deepchecks.base import Dataset
import pandas as pd
[2]:
test_data = {"col1": [0, 1, 2, 3] * 10}
val_data = {"col1": [4, 5, 6, 7, 8, 9] * 10}
test = Dataset(pd.DataFrame(data=test_data), label="col1", label_type="classification_label")
val = Dataset(pd.DataFrame(data=val_data), label="col1", label_type="classification_label")
[3]:
NewLabelTrainTest().run(test, val)

New Label Train Test

Find new labels in test. Read More...

Additional Outputs
  Percent new labels in sample New labels
Label column    
col1 100% [4, 5, 6, 7, 8, 9]
[4]:
test_data = {"col1": ["a", "b", "a", "c"] * 10, "col2": [1,2,2,3]*10}
val_data = {"col1": ["a","b","d"] * 10, "col2": [1, 4, 5]*10}
test = Dataset(pd.DataFrame(data=test_data), label="col2", label_type="classification_label")
val = Dataset(pd.DataFrame(data=val_data), label="col2", label_type="classification_label")
[5]:
NewLabelTrainTest().run(test, val)

New Label Train Test

Find new labels in test. Read More...

Additional Outputs
  Percent new labels in sample New labels
Label column    
col2 66.67% [4, 5]