Update Colab notebook: 1.5B model, scaled rewards, tuned hyperparameters ee8c2d4 Running nihalaninihal commited on 9 days ago
Fix critical RL reward function exploits and training hyperparameters 803c93e nihalaninihal Claude Opus 4.6 commited on 9 days ago
Align with Advanced Llama 3.2 GRPO LoRA reference notebook pattern c7d253a nihalaninihal Claude Opus 4.6 commited on 9 days ago
Fix VALID_TARGETS_FOR_ATTACK and attacker heuristic/prompt inconsistencies 3ffb78a nihalaninihal Claude Opus 4.6 commited on 9 days ago
Fix format_comparison_metrics_html to accept run_comparison() dict directly d52b449 nihalaninihal Claude Opus 4.6 commited on 9 days ago
Add run_demo_episode wrapper to demo.py for dict-based episode results fcf34b9 nihalaninihal Claude Opus 4.6 commited on 9 days ago
Fix Gradio 6 deprecation warning: move theme/css out of Blocks constructor af292c9 nihalaninihal commited on 9 days ago
Align train.py and Colab notebook with official Unsloth+OpenEnv GRPO patterns e09a415 nihalaninihal Claude Opus 4.6 commited on 9 days ago
Update metrics format with drift/oversight tracking, add colab training notebook 5e0f2b1 nihalaninihal commited on 9 days ago
Fix requirements: add pandas>=2.0, set gradio>=6.0.0 consistently 173a3e9 nihalaninihal commited on 9 days ago
Fix theme in Blocks constructor for HF Spaces compatibility ca88c10 nihalaninihal commited on 9 days ago
Add drift-specific metrics: drift events, detection, adaptation rate eb9e808 nihalaninihal commited on 9 days ago
Add oversight accuracy and explanation quality metrics to dashboard 33b6c02 nihalaninihal commited on 9 days ago
Improve HeuristicOversight explanations with specific data references 62aabbf nihalaninihal commited on 9 days ago
Add structured explanation quality scoring for oversight agent 5efcc1b nihalaninihal commited on 9 days ago
Fix schema drift renames to target actual Customer model fields 197e7c5 nihalaninihal commited on 9 days ago
Fix window_ticks policy enforcement in billing refund validation aea9d7d nihalaninihal commited on 9 days ago
Add master improvement plan with prioritized fixes for hackathon submission 7f33a54 nihalaninihal commited on 9 days ago
Add multi-agent GRPO training for all 3 agents (worker, attacker, oversight) 389e3bf nihalaninihal Claude Opus 4.6 commited on 9 days ago
Add comprehensive gap analysis and 4-hour action plan for hackathon submission ea3624f nihalaninihal Claude Opus 4.6 commited on 9 days ago
Add randomized attacker, security metrics engine, and updated Gradio dashboard 69a7e43 nihalaninihal Claude Opus 4.6 commited on 9 days ago
Add episode metrics computation and HTML formatting for SentinelOps Arena 23f3257 nihalaninihal Claude Opus 4.6 commited on 9 days ago
Replace HeuristicAttacker with RandomizedAttacker for probabilistic attacks 1f6f2a5 nihalaninihal Claude Opus 4.6 commited on 9 days ago
Improve Gradio UI layout with sidebar controls, sub-tabs, and styled score widgets e85e584 nihalaninihal Claude Opus 4.6 commited on 9 days ago
Revamp Gradio app with Gradio 6, custom cybersecurity theme, and rich visualizations f20603d nihalaninihal Claude Opus 4.6 commited on 9 days ago
Remove hackathon_env template, rewrite train.py for SentinelOpsArena 0e5a0a6 nihalaninihal Claude Opus 4.6 commited on 9 days ago
Implement Phase 3 (HTTP server) and Phase 4 (demo + Gradio app) fa00f5a nihalaninihal Claude Opus 4.6 commited on 10 days ago
Implement Phase 2: environment core with MCPEnvironment base 6c20e91 nihalaninihal Claude Opus 4.6 commited on 10 days ago
Implement Phase 1: models, enterprise systems, attacks, rewards a4e6593 nihalaninihal Claude Opus 4.6 commited on 10 days ago
Refine build plan with devil's advocate corrections dc8bc66 nihalaninihal Claude Opus 4.6 commited on 10 days ago
Add phased build plan and setup guide for SentinelOps Arena 707377e nihalaninihal Claude Opus 4.6 commited on 10 days ago
Update SentinelOps Arena with detailed 14-hour implementation plan 5f590b1 nihalaninihal Claude Opus 4.6 commited on 10 days ago
Add SentinelOps Arena project specification af942b1 nihalaninihal Claude Opus 4.6 commited on 10 days ago
Initial project setup for OpenEnv Hackathon ccb5f4e nihalaninihal Claude Opus 4.6 commited on 10 days ago