•1 min read•from Machine Learning
Best current methods for finetuning whisper on domain specific vocabulary? [P]
Hey everyone,
I’m wondering whether there are any newer or more effective methods for fine tuning whisper on domain specific speech. I’m working on a project where the model needs to reliably detect certain specific words and technical terms. The vocabulary and context are mostly in spanish.
Does anyone have experience with a similar use case? Roughly how many hours of labeled audio would be needed before seeing the model converged?
I know about lora, qlora, and spectrum, but Im curious if there are any newer or better ways to adapt whisper to specific vocabulary.
any help is welcome!
[link] [comments]
Want to read more?
Check out the full article on the original site
Tagged with
#natural language processing for spreadsheets
#generative AI for data analysis
#rows.com
#Excel alternatives for data analysis
#financial modeling with spreadsheets
#Whisper
#Fine-tuning
#Domain-specific
#Vocabulary
#Labeled audio
#Spanish
#Technical terms
#Speech recognition
#Lora
#QLoRA
#Spectrum
#Model convergence
#Audio data
#Machine Learning
#Deep Learning