Skip to content

TrustThink AI Cybersecurity Hub

Nemo Guardrails

TrustThink AI Cybersecurity Hub

Home
Cheatsheet
MCP Fact Sheet
AI Fundamentals
AI Fundamentals
- Models & Architectures
  Models & Architectures
  - Convolutional Neural Networks (CNNs)
    Convolutional Neural Networks (CNNs)
    
    Overview
    
    ResNet
    
    EfficientNet
    
    ConvNeXt
    
    MobileNet
  - Classical ML
    Classical ML
    
    Overview
    
    Linear Models
    
    Tree-Based Models
    
    Support Vector Machines (SVMs)
    
    k-Nearest Neighbors (kNN)
  - Diffusion Models
    Diffusion Models
    
    Overview
  - Generative Adversarial Networks (GANs)
    Generative Adversarial Networks (GANs)
    
    Overview
  - Recurrent Neural Networks (RNNs)
    Recurrent Neural Networks (RNNs)
    
    Overview
  - Transformers
    Transformers
    
    Overview
- Datasets & Benchmarks
  Datasets & Benchmarks
Security Guidance
Security Guidance
- Agentic AI
  Agentic AI
  - Agent Name Service
  - Model Context Protocol
    Model Context Protocol
    
    MCP Fact Sheet
    
    CSA MCP Security
  - OWASP AIVSS 0.5
  - MAESTRO
  - Taxonomy of Failure Modes in Agentic AI Systems
  - OWASP Top 10 for Agentic Applications
- Generative AI
  Generative AI
- General Guidance
  General Guidance
Governance
Governance
- Regulations
  Regulations
- Standards
  Standards
  - AIUC 1
  - ISO/IEC 5259 Series
    ISO/IEC 5259 Series
    
    ISO/IEC 5259-1:2024
    
    ISO/IEC 5259-2:2024
    
    ISO/IEC 5259-3:2024
    
    ISO/IEC 5259-4:2024
    
    ISO/IEC 5259-5:2025
  - ISO/IEC 8183:2023
  - ISO/IEC 23894:2023
  - ISO/IEC 42001:2023
  - ISO/IEC 42005:2025
  - ISO/IEC TR 24027:2021
  - ISO/PAS 8800:2024
Tools
Tools
- 3D Common Corruptions
- AI Incident Database
- AI Risk Repository
- Captum
- Garak
- AI-Enabled Medical Device List
- ModelScan
- Nemo Guardrails Nemo Guardrails
  Table of contents
- Promptfoo
- PyRIT
- LIME
- SHAP
- Spikee
- What-If Tool

NeMo Guardrails

Publisher: NVIDIA
Status: active
Version: 0.15.0
Release Date: 2025-08-08
Date Added: 2025-08-25
Source URL: https://github.com/NVIDIA/NeMo-Guardrails

Summary

Open-source toolkit for securing LLM-powered apps and AI agents. NeMo Guardrails enforces "rails" that filter inputs/outputs, identify jailbreaks and prompt injection attempts, manage conversation state, and fact-check answers. Useful for stopping data leaks, preventing rogue agent actions, and providing audit trails for security teams.

Key Takeaways

Block unsafe prompts before they reach the model or agent.
Catch jailbreaks and prompt injection attempts.
Stop unsafe outputs — detect hallucinations and fact-check results.
Write custom guardrails (Colang) for rules like PII masking or restricted actions.
Log and trace activity for audits, monitoring, and incident response.
Integrates with LangChain (agent framework).

/code/security-tools/nemo-guardrails

Additional Sources

arXiv Paper — NeMo Guardrails: A Toolkit for Controllable and Safe LLM Applications with Programmable Rails
NVIDIA NeMo Guardrails Documentation

Tags

guardrails, filtering, llm-security, agentic-ai, monitoring, jailbreak, prompt-injection, fact-checking, audit-trails, safety, alignment

License

Apache-2.0