facebook
/

wav2vec2-large-960h-lv60-self

@@ -24,7 +24,7 @@ model-index:
     metrics:
     - name: Test WER
       type: wer
-      value: 1.9
   - task:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
@@ -38,7 +38,7 @@ model-index:
     metrics:
     - name: Test WER
       type: wer
-      value: 3.9
 ---
 # Wav2Vec2-Large-960h-Lv60 + Self-Training
@@ -85,9 +85,9 @@ To transcribe audio files the model can be used as a standalone acoustic model a
  transcription = processor.batch_decode(predicted_ids)
  ```
-  ## Evaluation
- This code snippet shows how to evaluate **facebook/wav2vec2-large-960h-lv60-self** on LibriSpeech's "clean" and "other" test data.
 ```python
 from datasets import load_dataset
@@ -110,7 +110,7 @@ def map_to_pred(batch):
         logits = model(input_values, attention_mask=attention_mask).logits
     predicted_ids = torch.argmax(logits, dim=-1)
-    transcription = processor.batch_decode(predicted_ids)
     batch["transcription"] = transcription
     return batch
@@ -123,4 +123,4 @@ print("WER:", wer(result["text"], result["transcription"]))
 | "clean" | "other" |
 |---|---|
-| 1.9 | 3.9 |

     metrics:
     - name: Test WER
       type: wer
+      value: 1.86
   - task:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     metrics:
     - name: Test WER
       type: wer
+      value: 3.88
 ---
 # Wav2Vec2-Large-960h-Lv60 + Self-Training
  transcription = processor.batch_decode(predicted_ids)
  ```
+## Evaluation
+This code snippet shows how to evaluate **facebook/wav2vec2-large-960h-lv60-self** on LibriSpeech's "clean" and "other" test data.
 ```python
 from datasets import load_dataset
         logits = model(input_values, attention_mask=attention_mask).logits
     predicted_ids = torch.argmax(logits, dim=-1)
+    transcription = processor.batch_decode(predicted_ids)[0]
     batch["transcription"] = transcription
     return batch
 | "clean" | "other" |
 |---|---|
+| 1.86 | 3.88 |