Automatic Speech Recognition
Transformers
Safetensors
English
whisper
zen
zenlm
hanzo
zen3
speech
audio
whisper-like
Instructions to use zenlm/zen3-asr with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use zenlm/zen3-asr with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="zenlm/zen3-asr")# Load model directly from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq processor = AutoProcessor.from_pretrained("zenlm/zen3-asr") model = AutoModelForSpeechSeq2Seq.from_pretrained("zenlm/zen3-asr") - Notebooks
- Google Colab
- Kaggle
Zen3 Asr
Zen3 automatic speech recognition model for multilingual transcription.
Overview
Built on Zen MoDE (Mixture of Distilled Experts) architecture with 1.5B parameters.
Developed by Hanzo AI and the Zoo Labs Foundation.
Quick Start
from transformers import AutoModelForSpeechSeq2Seq, AutoProcessor
import torch
model_id = "zenlm/zen3-asr"
processor = AutoProcessor.from_pretrained(model_id)
model = AutoModelForSpeechSeq2Seq.from_pretrained(model_id, torch_dtype=torch.float16, device_map="auto")
# Load audio
import librosa
audio, sr = librosa.load("audio.wav", sr=16000)
inputs = processor(audio, sampling_rate=sr, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs)
print(processor.batch_decode(outputs, skip_special_tokens=True)[0])
Model Details
| Attribute | Value |
|---|---|
| Parameters | 1.5B |
| Architecture | Zen MoDE |
| Context | 30s audio |
| License | Apache 2.0 |
License
Apache 2.0
- Downloads last month
- 10