sensitive content moderation#

overview#

plyr.fm allows artists to upload cover art and use their Bluesky avatars. some of this content may be inappropriate for general audiences (nudity, graphic imagery, etc.). rather than blocking uploads, we blur sensitive images by default and let users opt-in to view them.

this follows our core moderation philosophy: information, not enforcement. we flag content and let users decide what they want to see.

current implementation#

database schema#

-- tracks flagged images
CREATE TABLE sensitive_images (
    id SERIAL PRIMARY KEY,
    image_id VARCHAR,          -- R2 image ID (for uploaded images)
    url TEXT,                  -- full URL (for external images like Bluesky avatars)
    reason VARCHAR,            -- why flagged: 'nudity', 'violence', etc.
    flagged_at TIMESTAMPTZ,
    flagged_by VARCHAR         -- admin identifier
);

-- user preference
ALTER TABLE user_preferences ADD COLUMN show_sensitive_artwork BOOLEAN DEFAULT false;

images can be flagged by either:

image_id - for images uploaded to R2 (track artwork, album covers)
url - for external images (Bluesky avatars synced from PDS)

frontend architecture#

┌─────────────────────────────────────────────────────────────────┐
│                        page load                                 │
└─────────────────────────────────────────────────────────────────┘
                              │
        ┌─────────────────────┼─────────────────────┐
        │                     │                     │
        ▼                     ▼                     ▼
┌───────────────┐    ┌───────────────┐    ┌───────────────┐
│ +layout.server│    │ +layout.ts    │    │ +layout.svelte│
│ fetch flagged │    │ pass through  │    │ init client   │
│ images (SSR)  │    │ to pages      │    │ moderation    │
└───────────────┘    └───────────────┘    └───────────────┘
        │                                         │
        ▼                                         ▼
┌───────────────┐                        ┌───────────────┐
│ meta tags use │                        │ SensitiveImage│
│ SSR data      │                        │ component uses│
│ (link preview)│                        │ client store  │
└───────────────┘                        └───────────────┘

two-pronged approach:

SSR - +layout.server.ts fetches flagged images for meta tag filtering (link previews)
client - moderation store fetches same data for runtime blur effect

SensitiveImage component#

wraps any image that might need blurring:

<SensitiveImage src={imageUrl}>
    <img src={imageUrl} alt="..." />
</SensitiveImage>

the component:

checks if src matches any flagged image
applies CSS blur filter if flagged
shows tooltip on hover: "sensitive - enable in portal"
respects user's show_sensitive_artwork preference

matching logic#

function checkImageSensitive(url: string, data: SensitiveImagesData): boolean {
    // exact URL match (for external images)
    if (data.urls.includes(url)) return true;

    // R2 image ID extraction and match
    const r2Match = url.match(/r2\.dev\/([^/.]+)\./);
    if (r2Match && data.image_ids.includes(r2Match[1])) return true;

    const cdnMatch = url.match(/\/images\/([^/.]+)\./);
    if (cdnMatch && data.image_ids.includes(cdnMatch[1])) return true;

    return false;
}

API endpoint#

GET /moderation/sensitive-images

Response:
{
    "image_ids": ["abc123", "def456"],
    "urls": ["https://cdn.bsky.app/..."]
}

returns arrays for SSR compatibility (Sets don't serialize to JSON).

user experience#

default behavior#

all images matching sensitive_images table are blurred
tooltip on hover explains how to enable
link previews (og:image) exclude sensitive images entirely

opt-in flow#

user navigates to portal → "your data"
toggles "sensitive artwork" to enabled
show_sensitive_artwork preference saved to database
all sensitive images immediately unblur

current limitations#

manual flagging only#

the sensitive_images table is currently populated manually by admins. this is "whack-a-mole" moderation - we flag images as we discover them.

future improvements needed:

perceptual hashing - hash images at upload time, detect re-uploads of flagged content
AI detection - integrate NSFW detection API (AWS Rekognition, Google Vision, etc.)
user reporting - let users flag inappropriate content
artist self-labeling - let artists mark their own content as sensitive

no ATProto labels yet#

unlike copyright moderation, sensitive content flags don't emit ATProto labels. this is intentional for now - we're still figuring out the right taxonomy for content labels vs. copyright labels.

future work might include:

content-warning label type
integration with Bluesky's existing content label system
respecting labels from other ATProto services

moderation workflow#

current process#

admin discovers inappropriate image (user report, browsing, etc.)
admin identifies the image source:
- R2 upload: extract image_id from URL
- external: copy full URL
admin inserts row into sensitive_images via SQL or Neon console
image is immediately blurred for all users

example: flagging an R2 image#

-- image URL: https://pub-xxx.r2.dev/images/abc123.jpg
INSERT INTO sensitive_images (image_id, reason, flagged_by)
VALUES ('abc123', 'nudity', 'admin');

example: flagging a Bluesky avatar#

-- avatar URL from artist profile
INSERT INTO sensitive_images (url, reason, flagged_by)
VALUES (
    'https://cdn.bsky.app/img/avatar/plain/did:plc:xxx/bafkrei...@jpeg',
    'nudity',
    'admin'
);

overview.md - moderation philosophy and architecture
copyright-detection.md - automated copyright scanning
atproto-labeler.md - ATProto label emission