AI Moderation

AI Moderation uses your configured AI provider (OpenAI, Claude, Gemini, DeepSeek, or Grok) to read each chat message and automatically flag content that violates the report-reason categories you choose. It can censor offending messages automatically or queue them for a moderator.

Navigate to Moderation > AI Moderation

How It Works


Step 1 — Set Up an AI Provider (Required)

AI Moderation does not work on its own — it relies on an AI provider configured in your AI settings.

  1. Go to General Settings > AI and enable at least one provider by entering its API key. The AI Settings article lists exactly where to generate a key for each provider (OpenAI, Anthropic/Claude, Google Gemini, DeepSeek, Grok).
  2. In the same page's Task Configuration section, set the Content Moderation task to Default or to a specific provider. If this task is set to Disabled, AI Moderation will not run.

Tip: You can route moderation to a cheaper, faster model than the one powering your chatbot — assign each task its own provider in AI Settings.


Step 2 — Configure AI Moderation

Go to Moderation > AI Moderation:

SettingDescription
Enable AI ModerationSet to Yes to turn it on.
Categories to ModerateTick the report-reason categories the AI should enforce (for example Spam, Harassment, Hate Speech). The AI only flags messages that clearly match one of these.
Action for Flagged ContentCensor & Auto-Resolve replaces the message and closes the report automatically. Flag for Review keeps the message visible and creates a report for a moderator.

Click Update to save.

Note: If no categories are selected, AI Moderation stays inactive. The category names come from your report reasons and can be renamed or translated under Appearance > Language.


Reviewing Flagged Content

When the action is Flag for Review, flagged messages appear under Moderation > Flaged Content together with the reason the AI matched. See IP Access & Flagged Content.


Cost & Performance

AI Moderation vs Image Moderation

AI Moderation handles text messages. To scan uploaded images, use Image Moderation. The two can run at the same time.