This is a standard Llama-based Nano model (~100M params). It was trained using the robust python script.