News

Bugcrowd Launches RL Environments for AI Security Training

News | 27.05.2026

Building AI models capable of real-world security reasoning has long demanded years of infrastructure investment. Bugcrowd now offers a faster path.

Training AI to perform meaningful security tasks is far more complex than it appears. Most existing tools rely on synthetic data that fails to capture how real vulnerabilities behave, resulting in models that perform well in controlled benchmarks but struggle against actual software flaws. Bugcrowd has addressed this gap directly with the launch of Reinforcement Learning (RL) Environments — enterprise-grade training infrastructure built on real, open-source vulnerabilities.

What was announced

On May 21, 2026, Bugcrowd announced the general availability of RL Environments, a new product category designed to help AI developers build models that can find, exploit, and fix real software vulnerabilities. The offering is built on technology from Bugcrowd's acquisition of Mayhem Security and is already in active use by leading LLM providers.

The platform delivers hundreds of thousands of training environments, each derived from authentic open-source vulnerabilities with real source code and verifiable outcomes. No synthetic approximations. No customer data. No security researchers involved at any stage of the training pipeline.

The gap between what AI agents are trained on and what they encounter in the real world is where security breaks down. Our RL Environments give frontier teams the infrastructure to build AI that learns security from real vulnerabilities, not approximations of them.

Dave Gerry, CEO, Bugcrowd

Frontier AI teams can begin training on production-grade security environments within weeks, eliminating what would otherwise require years of specialized engineering work.

Why this matters for CEE

CISOs and IT security leaders across Central and Eastern Europe are under increasing pressure to evaluate and deploy AI-assisted security tooling. The challenge is not just acquiring AI — it is understanding whether the AI has been trained on data that reflects real-world threat conditions. Bugcrowd RL Environments provide a transparent answer to that question.

For CEE organizations evaluating AI security platforms from LLM providers, this announcement signals a measurable shift in training quality. Models built on RL Environments are trained end-to-end — from bug detection through exploitation and patching — rather than stopping at surface-level pattern recognition. This directly impacts the reliability of AI-assisted vulnerability management, penetration testing support, and automated code auditing at the enterprise level.

As AI security tooling becomes a procurement category in its own right, understanding how models are trained — and on what data — becomes a due diligence requirement for security teams in the region.

Technical details

  • Training scope: Hundreds of thousands of environments built from real open-source vulnerabilities with verifiable outcomes
  • Coverage: End-to-end training across bug detection, exploitation verification, patching, and code audit
  • Feedback mechanism: Immediate, objective scoring at every action step — core to the reinforcement learning cycle
  • Data integrity: All environments derived exclusively from open-source software; no customer data, no proprietary code, no human researchers in the loop
  • Infrastructure: Ready to use without additional setup; no additional platform development required by AI teams
  • Foundation technology: Built on Mayhem Security acquisition, extending autonomous code and API testing capabilities upstream
  • Target users: Large language model providers and frontier AI research teams requiring security-capable agent training at scale
  • Companion framework: ExploitBench — a benchmarking framework for measuring the exploitation capabilities of AI models across the full exploitation ladder

Softprom and Bugcrowd

Softprom is the official distributor of Bugcrowd in the CEE region. We help IT security teams, solution integrators, and enterprise procurement teams evaluate and deploy Bugcrowd's preemptive security platform — including the latest RL Environments offering for AI security infrastructure.

This content was prepared as part of the Softprom DistriFlow project — an automated system for monitoring and adapting vendor news. Original source: original article.