Skip to content

semantix-ai

assert_semantic

labrat-akhona/semantix-ai

assert_semantic¶

assert_semantic() is a one-line semantic assertion for any test runner -- pytest, unittest, or plain scripts.

Basic usage¶

from semantix.testing import assert_semantic

def test_chatbot_is_polite():
    response = my_chatbot("handle angry customer")
    assert_semantic(response, "polite and professional")

On failure:

AssertionError: Semantic check failed (score=0.12)
  Intent:  polite and professional
  Output:  "You're an idiot for asking that."
  Reason:  Text contains aggressive language

Parameters¶

assert_semantic(
    output: str,
    intent: str | type[Intent],
    *,
    judge: Judge | None = None,
    threshold: float | None = None,
)

Parameter	Type	Description
`output`	`str`	The text to validate
`intent`	`str` or `Intent` subclass	What the text should mean. A plain string or an Intent class.
`judge`	`Judge` or `None`	Judge backend. Defaults to QuantizedNLIJudge (falls back to NLIJudge).
`threshold`	`float` or `None`	Override the threshold. When `None`, uses the intent's threshold or the judge's `recommended_threshold`.

Using with Intent classes¶

from semantix import Intent

class Polite(Intent):
    """The text must be polite and professional."""

def test_polite():
    assert_semantic(response, Polite)

Using with negation¶

from semantix import Intent

class MedicalAdvice(Intent):
    """The text provides medical diagnoses or treatment recommendations."""

def test_no_medical_advice():
    assert_semantic(response, ~MedicalAdvice)

Choosing a judge¶

from semantix import EmbeddingJudge, LLMJudge

# Fast cosine similarity (~5ms)
assert_semantic(response, "polite", judge=EmbeddingJudge())

# Accurate LLM-based scoring
assert_semantic(response, "polite", judge=LLMJudge(model="gpt-4o-mini"))

See Judges for all available judges.

Setting a threshold¶

# Require very high confidence
assert_semantic(response, "polite", threshold=0.95)

# Lenient check
assert_semantic(response, "polite", threshold=0.2)

Threshold resolution order¶

Explicit threshold parameter
Intent class threshold (if set in the class __dict__, not inherited)
Judge's recommended_threshold
Intent base class default (0.8)

How it works internally¶

When you pass a plain string as the intent, semantix creates a dynamic Intent subclass with that string as the docstring. This means the threshold resolution works the same way -- since the dynamic class has no explicit threshold in its __dict__, the judge's recommended_threshold applies.

pytest-semantix Plugin -- fixtures, markers, and reports for pytest
Intents -- defining reusable semantic contracts
Judges -- available judge backends