not-lain commited on
Commit
c2da129
·
verified ·
1 Parent(s): 5428fa2

not-lain/finetuned_deepseek_ocr

Browse files
README.md CHANGED
@@ -1,9 +1,14 @@
1
  ---
2
- library_name: transformers
3
  license: mit
4
  base_model: deepseek-ai/DeepSeek-OCR
5
  tags:
6
- - generated_from_trainer
 
 
 
 
 
7
  model-index:
8
  - name: finetuned_deepseek_ocr
9
  results: []
@@ -34,12 +39,12 @@ More information needed
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 0.0005
37
- - train_batch_size: 2
38
  - eval_batch_size: 8
39
  - seed: 42
40
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
41
  - lr_scheduler_type: linear
42
- - training_steps: 3
43
 
44
  ### Training results
45
 
@@ -47,7 +52,8 @@ The following hyperparameters were used during training:
47
 
48
  ### Framework versions
49
 
 
50
  - Transformers 4.46.3
51
  - Pytorch 2.6.0+cu124
52
  - Datasets 4.3.0
53
- - Tokenizers 0.20.3
 
1
  ---
2
+ library_name: peft
3
  license: mit
4
  base_model: deepseek-ai/DeepSeek-OCR
5
  tags:
6
+ - base_model:adapter:deepseek-ai/DeepSeek-OCR
7
+ - lora
8
+ - sft
9
+ - transformers
10
+ - trl
11
+ pipeline_tag: text-generation
12
  model-index:
13
  - name: finetuned_deepseek_ocr
14
  results: []
 
39
 
40
  The following hyperparameters were used during training:
41
  - learning_rate: 0.0005
42
+ - train_batch_size: 4
43
  - eval_batch_size: 8
44
  - seed: 42
45
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
+ - training_steps: 8
48
 
49
  ### Training results
50
 
 
52
 
53
  ### Framework versions
54
 
55
+ - PEFT 0.17.1
56
  - Transformers 4.46.3
57
  - Pytorch 2.6.0+cu124
58
  - Datasets 4.3.0
59
+ - Tokenizers 0.20.3
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b286d1de8ead082ffe7f4d74bdb4fa3570ae09031c382169ecab15c1f03bb9dd
3
  size 2958552
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5b495ec84536d6424b2d80c560fe16eb1ffd638e871c98668ce4f122f8115bec
3
  size 2958552
runs/Oct30_17-20-43_368ad1980789/events.out.tfevents.1761844845.368ad1980789.6156.5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4861e14d1685ec7b7d5a1534232b5a82654c6e95ffcfca427dfcc6035550041f
3
- size 9149
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:31c15e392639fc44aecc2ad2c096a2535a7e19148e27e655a40f4b890fc3bc75
3
+ size 9704