content-safety

Here are 29 public repositories matching this topic...

openguardrails / openguardrails

#1 OpenClaw security plugin — protect your OpenClaw with real-time defense against prompt injection, data leaks, and dangerous actions.

guardrails prompt-injection data-leakage-prevention llm-safety ai-gateway content-safety ai-agent-security ai-security-gateway enterprise-ai-guardrails openclaw openclaw-security

Updated Mar 5, 2026
TypeScript

roostorg / coop

Star

Review tool for online safety; provides a dashboard, queues, routing and automatic enforcement rules, and integrations.

trust-and-safety child-safety content-safety

Updated Mar 6, 2026
TypeScript

cristofima / TaskAgent-AgenticAI

Star

An intelligent task management assistant built with .NET, Next.js, Microsoft Agent Framework, AG-UI protocol, and Azure OpenAI, demonstrating Clean Architecture and autonomous AI agent capabilities

sql-server dotnet nextjs postgresql e2e-tests clean-architecture unit-tests agent-framework ai-agent vitest playright azure-open-ai content-safety ag-ui-protocol

Updated Feb 16, 2026
C#

NudeDetect is a Python-based tool for detecting nudity and adult content in images. This project combines the capabilities of the NudeNet library, EasyOCR for text detection, and the Better Profanity library for identifying offensive language in text.

Updated Jan 1, 2025
Python

withinJoel / Content-Moderation-Engine

Sponsor

Star

A JavaScript-based content safety system designed to detect and filter sensitive media in real-time, ensuring platform compliance and user protection.

javascript compliance filtering content-safety content-safety-checks

Updated Jan 27, 2026
JavaScript

Azure-Samples / rai-content-safety-workshop

Star

Step-by-Step tutorial that teaches you how to use Azure Safety Content - the prebuilt AI service that helps ensure that content sent to user is filtered to safeguard them from risky or undesirable outcomes

azure aiml workshop-materials responsible-ai content-safety

Updated Jul 1, 2024
Jupyter Notebook

RafaelParonis / jailbench

Star

🔍 Benchmark jailbreak resilience in LLMs with JailBench for clear insights and improved model defenses against jailbreak attempts.

python flask analytics openai alignment model-evaluation ai-safety security-testing red-teaming model-robustness anthropic litellm content-safety llm-jailbreaks tool-calling llm-benchmark ai-evals textual-tui

Updated Mar 7, 2026
Python

vibheksoni / jailbench

Star

Benchmark LLM jailbreak resilience across providers with standardized tests, adversarial mode, rich analytics, and a clean Web UI.

Updated Aug 12, 2025
Python

sammydeprez / presentations

Star

Technical presentations with hands-on demos

python machine-learning ai jupyter-notebook presentations educational ai-agents azure-ai responsible-ai azure-openai llm prompt-engineering langchain content-safety langgraph

Updated Dec 1, 2025
Jupyter Notebook

Napiersnotes / AlignmentVirusV6

Star

Production-Grade LLM Alignment Engine (TruthProbe + ADT)

alignment ai-safety llm safe-ai content-safety

Updated Jan 10, 2026
Python

OrenGrinker / contentSafetyFilter

Star

A Chrome extension that uses Claude AI to protect users under 18 from inappropriate content by analyzing webpage content in real-time.

nodejs chrome-extension typescript chrome-extensions claude-ai claude-api content-safety claude-3-sonnet

Updated Nov 30, 2024
TypeScript

cristofima / Demo-AzureAIContentSafety

Star

Content moderation (text and image) in a social network demo

angular dot-net azure-storage content-moderation content-safety

Updated Jan 5, 2025
C#

joemathew2004 / Study-Buddy

Star

Study Buddy is a user-friendly AI-powered web app that helps students generate safe, factual study notes and Q&A on any topic. It features user accounts, study history, and strong content safety filters—making learning interactive and secure.

python learning education flask ai study chatbot project webapp qna groq content-safety

Updated Jul 5, 2025
HTML

khanovico / sentinelshield-ai-guard

Star

SentinelShield: Advanced AI content moderation combining Llama Prompt Guard 2, rule-based filtering, and real-time analysis. Protect your applications from harmful content, prompt injection attacks, and inappropriate material with sub-second response times.

python nlp security machine-learning ai moderation content-filtering ai-safety content-moderation fastapi hate-speech-detection prompt-injection content-safety llama-guard prompt-security

Updated Aug 7, 2025
Python

frank-bridges / profanity-checker

Star

profanity checker text moderation

python scraper checker text-processing profanity profanity-filter content-moderation text-sanitization content-safety

Updated Dec 14, 2025

putmanmodel / SteroLLOYD

Star

Public app demo showing LLOYD working with GPT-2.

ai-safety interpretability tone-analysis drift-detection guardrails streamlit content-safety semantic-drift

Updated Aug 10, 2025

amafjarkasi / hsx-context-hygiene-engine

Star

Context hygiene & risk adjudication for LLM pipelines: secrets, PII, prompt-injection, policy redaction & tokenization.

nodejs cli redaction security typescript compliance tokenization policy-engine secret-scanning data-sanitization pii-redaction llm prompt-injection llm-security content-safety context-hygiene

Updated Sep 4, 2025
TypeScript

CharlyProgrammer / Text-content-analyzer

Star

Azure safety content example using python for text analysis

python ai azure azure-sdk-for-python content-safety

Updated Sep 22, 2024
Python

melroyanthony / llm-guardrails

Star

Responsible AI toolkit for LLM applications: PII/PHI redaction, prompt injection detection, bias scoring, content safety filters, and output validation. Framework-agnostic Python library with FastAPI demo.

python compliance hipaa gdpr bias-detection fastapi guardrails responsible-ai pii-redaction llm prompt-injection content-safety

Updated Feb 17, 2026
Python

Tailwind-Stocker / Impact-Analyzer

Star

Impact Analyzer is a web app that helps you detect toxicity and analyze nuance in your writing before publishing, ensuring your content is respectful, clear, and aligned with your intent.

sentiment-analysis web-app toxicity-detection content-safety nuance-analysis content-review

Updated May 29, 2024
HTML

Improve this page

Add a description, image, and links to the content-safety topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the content-safety topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

content-safety

Here are 29 public repositories matching this topic...

openguardrails / openguardrails

roostorg / coop

cristofima / TaskAgent-AgenticAI

darkwaves-ofc / nude-detect

withinJoel / Content-Moderation-Engine

Azure-Samples / rai-content-safety-workshop

RafaelParonis / jailbench

vibheksoni / jailbench

sammydeprez / presentations

Napiersnotes / AlignmentVirusV6

OrenGrinker / contentSafetyFilter

cristofima / Demo-AzureAIContentSafety

joemathew2004 / Study-Buddy

khanovico / sentinelshield-ai-guard

frank-bridges / profanity-checker

putmanmodel / SteroLLOYD

amafjarkasi / hsx-context-hygiene-engine

CharlyProgrammer / Text-content-analyzer

melroyanthony / llm-guardrails

Tailwind-Stocker / Impact-Analyzer

Improve this page

Add this topic to your repo