NagaTranslate: Building a translation and voice pipeline for low-resource Nagaland creoles (Whisper, VITS, LLMs) [P]
![NagaTranslate: Building a translation and voice pipeline for low-resource Nagaland creoles (Whisper, VITS, LLMs) [P]](/_next/image?url=https%3A%2F%2Fpreview.redd.it%2Fbu6xsk4hvx9h1.jpg%3Fwidth%3D140%26height%3D63%26auto%3Dwebp%26s%3D0a4c589616d351d10d735940874706494e48d408&w=3840&q=75)
| Hello r/MachineLearning , I wanted to share the architecture and challenges behind a project I’ve been building called NagaTranslate. The goal is to build a translation and speech pipeline for the low-resource languages of Nagaland, India (currently supporting Nagamese, Ao, and Sema). Since Nagamese and other native Naga languages were primarily oral languages (though recent times have seen a surge in print and digital media in local dialects) with very little standard parallel data, this has been an interesting challenge in low-resource NLP. I’d love to share the technical setup and get your feedback on the architecture and how to improve the pipeline under strict resource constraints. The Architecture & Models 1. Text Translation
2. Speech Synthesis (TTS)
3. Speech Recognition (ASR)
Technical Questions & Challenges I’d Love Advice On:
I’d appreciate any insights, feedback on the methodology, or pointers to similar low-resource architectures you've found successful. [link] [comments] |
Want to read more?
Check out the full article on the original site