interesting model

#8
by DUOWEN - opened

Identity misconception of this model. it believes that it is deepseek v3 when asked in chinese
image

Doesn't it mean that it probably shared a dataset?

This is very usual, even Claude says it is DeepSeek when asked in Chinese. Most models don't have a good sense of their 'identity'.

AI does not need self-consciousness at all. We don't need to deliberately make a difference between MiniMax/DeepSeek/Claude/GPT or more. As long as they all train with the whole Internet as data, their behavior will converge.

Using the "whole internet" as dataset doesn't make the model think it is Deepseek. They obviously used the Deepseek API to train the model (or extracted it from the files). I can't think of any other reason. Which can be a problem because this type of info synchronization could end up in cooptation.

We don't need to deliberately make a difference between MiniMax/DeepSeek/Claude/GPT or more.

Yes, we do. That's the whole point of having different models in the first place.

there are a lot of things to consider: 1. how was the model trained to answer such a question, 2. what is the context given by the program that runs it, 3. what takes precedence given those

Sign up or log in to comment