Description

This repository contains the model for Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning.

Official Implementation

https://github.com/akatigre/MASA-RL

Citation

@article{kim2025meta,
  title={Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning},
  author={Kim, Yoonjeon and Jang, Doohyuk and Yang, Eunho},
  journal={arXiv preprint arXiv:2510.03259},
  year={2025}
}
Downloads last month
19
Safetensors
Model size
8B params
Tensor type
BF16
·
Video Preview
loading

Model tree for jadohu/Qwen3-8B-MASA

Base model

Qwen/Qwen3-8B-Base
Finetuned
(299)
this model
Quantizations
1 model

Dataset used to train jadohu/Qwen3-8B-MASA

Collection including jadohu/Qwen3-8B-MASA