AI Moderation uses your configured AI provider (OpenAI, Claude, Gemini, DeepSeek, or Grok) to read each chat message and automatically flag content that violates the report-reason categories you choose. It can censor offending messages automatically or queue them for a moderator.
Navigate to Moderation > AI Moderation
AI Moderation does not work on its own — it relies on an AI provider configured in your AI settings.
Tip: You can route moderation to a cheaper, faster model than the one powering your chatbot — assign each task its own provider in AI Settings.
Go to Moderation > AI Moderation:
| Setting | Description |
|---|---|
| Enable AI Moderation | Set to Yes to turn it on. |
| Categories to Moderate | Tick the report-reason categories the AI should enforce (for example Spam, Harassment, Hate Speech). The AI only flags messages that clearly match one of these. |
| Action for Flagged Content | Censor & Auto-Resolve replaces the message and closes the report automatically. Flag for Review keeps the message visible and creates a report for a moderator. |
Click Update to save.
Note: If no categories are selected, AI Moderation stays inactive. The category names come from your report reasons and can be renamed or translated under Appearance > Language.
When the action is Flag for Review, flagged messages appear under Moderation > Flaged Content together with the reason the AI matched. See IP Access & Flagged Content.
AI Moderation handles text messages. To scan uploaded images, use Image Moderation. The two can run at the same time.