Policy Authoring
Creating effective policies is key to successful content moderation. This section covers best practices specific to the Deep Mod platform for writing clear, maintainable policies that balance automation with accuracy.
Quick Reference
Platform Limits by Tier
|
Limit |
Free Tier |
Paid Tiers |
|---|---|---|
|
Rules per policy |
25 |
100 |
|
Active policies |
5 |
Unlimited |
|
Rule content length |
5-160 characters |
5-160 characters |
Additional Requirements:
-
Webhook must be configured at organization level before activating any policy
-
Policy descriptions: 10-500 characters (optional)
Policy Structure
Start with Clear Organization
Policy Naming: Use descriptive names that reflect the policy's purpose, like "Community Guidelines" or "Product Review Standards." Policy names must be 2-100 characters. Each policy gets a unique friendly identifier (URI) automatically generated from the name for API usage.
Rule Groups: Organize rules into logical categories:
-
Safety: User protection, harassment, harmful content
-
Legal: Compliance, copyright, privacy, regulatory requirements
-
Brand: Brand standards, messaging guidelines, competitor mentions
-
Spam: Promotional content, repetitive posts, unwanted links
This organization makes policies easier to understand, manage, and tune over time.
Rule Limits and Planning
Rule Limits:
-
Free tier: Up to 25 rules per policy
-
Paid tiers: Up to 100 rules per policy
Active Policy Limits:
-
Free tier: Maximum 5 active policies
-
Paid tiers: No hard limit
Start Small: Begin with 5-10 high-impact rules covering your most critical content issues. You can always add more rules as you learn how the system performs with your content.
Writing Effective Rules
Rule Content Guidelines
Rule Names: Keep names concise but descriptive (5-80 characters)
-
Good: "Personal Information Disclosure"
-
Avoid: "PII" or "Must not contain any personally identifiable information including but not limited to..."
Rule Content: Be specific and clear (5-160 characters)
-
Good: "text containing phone numbers, email addresses, or home addresses"
-
Good: "content promoting violence or illegal drug sales"
-
Avoid: "personal information" (too vague) or "bad content" (too broad)
Using Rule Presets
Deep Mod provides pre-written rule templates to help you get started quickly:
Available Presets: Financial information, personal data, HIPAA violations, and other common content categories
When to Use Presets:
-
Starting a new policy in a familiar domain
-
Adding standard compliance rules
-
Learning effective rule-writing patterns
Customization: You can modify preset content to match your specific needs after adding them to your policy.
Cloning Existing Policies
You can clone an existing policy to create a new one with the same structure:
-
Clone organization policies you've already configured
-
Clone from system-provided policy templates
-
Optionally override the name and description during cloning
This is useful when creating similar policies for different content types or regions.
Configuration Best Practices
Confidence Thresholds
Confidence thresholds determine how certain the system must be before flagging content. Thresholds are expressed as decimals (e.g., 0.75 = 75%).
Start Conservative: Begin with a 0.75-0.80 (75-80%) confidence threshold
-
Reduces false positives while you learn system behavior
-
Easier to lower threshold later than deal with user complaints
Adjust Based on Results:
-
Too many false positives? Increase threshold
-
Missing obvious violations? Decrease threshold
-
High ambiguous rate? Consider if human review is needed
Review Mode Selection
No Review Mode (noReview): Choose when:
-
You can tolerate some false negatives
-
Content volume is high
-
Processing speed is priority
-
You don't have review team capacity
Human Review Mode (humanReview): Choose when:
-
Accuracy is critical
-
Content decisions have significant consequences
-
You have trained reviewers available
-
Regulatory compliance requires human oversight
When using Human Review mode, reviewers can make granular per-rule decisions—approving or rejecting individual rule matches rather than just the overall content result.
Common Patterns and Anti-Patterns
Effective Rule Patterns
Specific Content Types:
-
"text containing credit card numbers or banking information"
-
"content promoting illegal drug sales or usage"
-
"messages containing threats of physical violence"
Brand Guidelines:
-
"content mentioning competitor brand names"
-
"text using inappropriate language for customer communications"
-
"promotional content without proper disclaimers"
Anti-Patterns to Avoid
Overly Broad Rules:
-
Avoid: "negative content" (too vague)
-
Better: "content expressing hostility toward specific individuals"
Conflicting Rules:
-
Don't create rules that contradict each other
-
Example: One rule allowing "educational content about sensitive topics" while another blocks "any mention of sensitive topics"
Context-Insensitive Rules:
-
Consider how legitimate content might trigger your rules
-
Educational, news, and creative content often discusses sensitive topics appropriately
Testing and Iteration
Policy Testing
Use Test Mode: Always test new policies and rule changes before activating
-
Test with representative content samples
-
Check for unexpected false positives
-
Verify confidence scores align with expectations
Gradual Rollout:
-
Test with small content samples
-
Review results and adjust rules/thresholds
-
Activate for live content monitoring
-
Monitor initial results closely
-
Fine-tune based on real-world performance
Ongoing Optimization
Regular Review: Monthly assessment of policy performance
-
Check false positive/negative rates
-
Review ambiguous case patterns
-
Analyze which rules trigger most frequently
Rule Refinement:
-
Merge similar rules that always trigger together
-
Split overly broad rules that catch too much benign content
-
Remove rules that never trigger or add no value
Documentation: Keep notes on why rules were added and any modifications made. This helps with future maintenance and team handoffs.
Webhook Integration
Prerequisites
Important: You must configure a webhook before activating any policy. Policy activation will fail without a valid webhook endpoint.
Configure your organization webhook in Organization Settings before attempting to activate policies.
Activation Requirements
Webhook Testing: Verify your webhook endpoint before activating policies:
-
Confirms your system can receive results
-
Validates response format handling
-
Tests error handling for failed deliveries
Result Handling
Plan for All Result Types: Ensure your webhook handler processes all result types appropriately:
|
Result |
Description |
|---|---|
|
|
Content passed all rules |
|
|
Content violated one or more rules |
|
|
Results unclear (requires human review) |
|
|
Moderation process was abandoned |
Metadata Usage: Include relevant metadata with moderation requests to help with content routing and analytics
Scaling Considerations
Multiple Policies
Policy Separation: Create separate policies for different content types or risk levels:
-
User-generated content vs. customer support messages
-
Public content vs. internal communications
-
Different geographic regions with varying standards
Shared Learning: Apply lessons learned from one policy to improve others
Tier Considerations: Remember that free tier accounts are limited to 5 active policies. Plan your policy architecture accordingly.
Team Collaboration
Clear Ownership: Assign responsibility for policy maintenance and updates
Change Management: Establish processes for:
-
Requesting policy changes
-
Testing and approving modifications
-
Communicating updates to relevant teams
Advanced Features
AI-Powered Policy Generation
Deep Mod can automatically generate policies from existing documents:
-
Upload PDF documents containing your content guidelines
-
The system extracts rules and organizes them into rule groups
-
Review and refine the generated policy before activating
This is useful when migrating existing moderation guidelines to the platform.
Policy Templates
System-provided templates offer pre-configured policies for common use cases:
-
Use templates as starting points for new policies
-
Clone and customize templates to match your specific requirements
-
Templates cannot be edited directly—clone them first
What's Next
-
Plan Usage & Limits - Monitor policy usage and manage plan limits
-
Interpreting Results - Analyze policy performance and optimize based on results
-
Human Review Workflows - Set up human oversight for ambiguous cases
Effective policy authoring is an iterative process. Start simple, test thoroughly, and refine based on real-world performance to build robust content moderation that serves your specific needs.
Need help with policy design? Contact our support team for guidance on rule writing, threshold tuning, and policy optimization strategies.