Content Moderation

Every post on Circus passes through layered checks before it reaches anyone's screen — hash matching, machine learning, and human review. Here's how it works.

Three layers

Every post passes through three lines of protection before it reaches anyone's screen.

First line

Known harmful content never gets in

Before anything is stored, every upload is checked against databases of known illegal imagery. Exact-match text violations are caught instantly.

Second line

AI reads every post before it goes live

Models scan text, images, and video in parallel. High-confidence violations are blocked automatically; borderline cases go to a person.

Final say

Real people make every tough call

Our Trust & Safety team handles anything the automated systems can't confidently decide. No permanent action is taken by a machine alone.

Automated detection

These systems run on uploads and published content — together, not instead of each other.

Image & video

AWS Rekognition

Detects nudity, graphic content, violence, and visually explicit material
Runs on images and video thumbnails before content goes live
High-confidence hits are blocked; borderline results queue for humans

Language

AWS Comprehend

Flags hate speech, toxic language, and PII in posts, comments, and bios
Works across multiple languages
Feeds risk scores into the review queue alongside member reports

Platform models

Circus-trained ML

Trained on platform-specific patterns — satire vs. harassment, creator context
Retrained on cases our reviewers have decided
Informs prioritization; never issues final enforcement alone

Hash matching

Known-violation database

Compares image and video uploads against law-enforcement CSAM hash lists
Runs before publication — matches never reach the feed
Triggers immediate hold, account suspension, and mandatory reporting

Human review

Automated systems flag likely violations at scale. People make the final call.

Reviews complex, borderline, and high-stakes cases
Confirms every permanent ban before it takes effect
Trained on our Content Guidelines and community policies
Operates around the clock with priority queuing for urgent safety content

Response time targets:

4 hrs Urgent safety reports
(risk to life)

72 hrs Standard violation reports

14 days Appeals (standard)

72 hrs Appeals (active suspension)

Content during review

Flagged or reported content enters one of three states while we decide.

Default

Visible, under review

Most flagged content stays visible while we review. Most reports do not result in removal.

Limited reach

Visibility-restricted

When context matters, we may reduce discovery reach without removing the post while we review.

High confidence

Held pending review

CSAM, graphic violence, or known terrorist imagery is held before publication and reviewed first.

Creators are notified when their content enters review and when a decision is made.

Child safety

Zero tolerance for CSAM

Hash-matching runs on every image and video upload, comparing against law enforcement databases of known child sexual abuse material.

Immediate content block — the material is never published
Automatic account suspension pending review
Mandatory report to relevant authorities, including NCMEC's CyberTipline as required by law before user uploads go live

Content that appears to sexualize minors — even without a hash match — is escalated to specialist reviewers and to the relevant authorities.

Creator tools

Creators manage their communities before anything reaches the Circus safety team.

Content

Pin & remove

Pin posts to keep important content visible. Remove any post from your community instantly — no queue.

Members

Warn, restrict, ban

Issue formal warnings, restrict posting privileges, or remove someone from your community permanently.

Comments

Keyword filters

Block comments containing specific words or phrases. Set controls per post or members-only commenting.

Transparency

We plan to publish transparency reporting on content removed, reports received, government requests, and enforcement actions. Published statistics will live on the safety hub when available — we do not commit to a fixed publication cadence before launch.

Urgent safety issues

For content involving imminent risk to life, contact us directly. Do not wait for the in-app report flow in urgent situations.

safety@circus.app →