How I Study AI - Learn AI Papers & Lectures the Easy Way

Bielik Guard: Efficient Polish Language Safety Classifiers for LLM Content Moderation

Intermediate

Krzysztof Wróbel, Jan Maria Kowalski et al.Feb 8arXiv

Bielik Guard is a pair of small but strong Polish language safety models that check text for five kinds of risky content: hate/aggression, vulgar language, sexual content, crime, and self-harm.

#Polish NLP#content moderation#safety classifier

Papers1

Bielik Guard: Efficient Polish Language Safety Classifiers for LLM Content Moderation