bigcode
/

santacoder

Text Generation

Eval Results (legacy)

text-generation-inference

Model card Files Files and versions

add note on fim tokens

#42

by loubnabnl HF Staff - opened Oct 12, 2023

base: refs/heads/main

←

from: refs/pr/42

Discussion Files changed

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -243,6 +243,7 @@ inputs = tokenizer.encode(input_text, return_tensors="pt").to(device)
 outputs = model.generate(inputs)
 print(tokenizer.decode(outputs[0]))
 ```
 ### Load other checkpoints
 We upload the checkpoint of each experiment to a separate branch as well as the intermediate checkpoints as commits on the branches. You can load them with the `revision` flag:

 outputs = model.generate(inputs)
 print(tokenizer.decode(outputs[0]))
 ```
+Make sure to use `<fim-prefix>, <fim-suffix>, <fim-middle>` and not  `<fim_prefix>, <fim_suffix>, <fim_middle>` as in StarCoder models.
 ### Load other checkpoints
 We upload the checkpoint of each experiment to a separate branch as well as the intermediate checkpoints as commits on the branches. You can load them with the `revision` flag: