Stable Baselines3
# With extra dependencies (Tensorboard, etc.)
LLM Evaluation
Evaluated by: xiaomi/mimo-v2-flash:free
Last evaluated: March 29, 2026
Prompt Preview
---
name: stable-baselines3
description: Production-ready reinforcement learning algorithms (PPO, SAC, DQN, TD3, DDPG, A2C) with scikit-learn-like API. Use for standard RL experiments, quick prototyping, and well-documented algorithm implementations. Best for single-agent RL with Gymnasium environments. For high-performance parallel training, multi-agent systems, or custom vectorized environments, use pufferlib instead.
license: MIT license
metadata:
skill-author: K-Dense Inc.
---
# Stable...
Full prompt length: 9507 characters
Tools & Technologies
- python