About the Role
We are looking for a motivated Intern/Trainee AI TTS/STT Engineer to join our AI team and help develop Bangla Voice AI solutions. Candidates should have hands-on experience through projects, research, internships, freelancing, or personal work involving Speech-to-Text (STT), Text-to-Speech (TTS), and conversational AI technologies, preferably for Bangla language applications.
Responsibilities:
- Assist in developing and improving Bangla Speech-to-Text (STT) systems.
- Assist in developing and improving Bangla Text-to-Speech (TTS) systems.
- Work on AI-powered voice conversational systems and voice assistants.
- Integrate STT, TTS, and LLM components into end-to-end voice AI workflows.
- Prepare and manage speech datasets for training and evaluation.
- Conduct model testing, performance evaluation, and optimization.
- Collaborate with AI Engineers and Software Developers to deploy Voice AI solutions.
- Research and evaluate emerging technologies in speech and conversational AI
Qualifications:
- Bachelor's degree (completed or final year) in CSE, EEE, AI, Data Science, or a related field.
- Hands-on experience with at least one AI Voice Conversational System (academic, personal, freelance, internship, or open-source project).
- Must be able to explain their contribution to the project during the interview.
- Strong Python programming skills.
- Basic knowledge of Machine Learning and Deep Learning.
- Familiarity with Speech-to-Text (STT) and Text-to-Speech (TTS).
- Understanding of conversational AI workflows.
Preferred Skills:
- Speech-to-Text (STT): Whisper, Wav2Vec2, NeMo, DeepSpeech
- Text-to-Speech (TTS): Coqui TTS, XTTS, VITS, FastSpeech, Tacotron
- AI Development: PyTorch, TensorFlow, LLM Integration
- Deployment & APIs: FastAPI, Docker
- Audio Processing & Dataset Preparation
What Will Make You Stand Out:
- Experience building a Bangla voice assistant or voice chatbot.
- Experience integrating STT, TTS, and LLMs into a conversational system.
- Experience working with Bangla speech datasets.
- Demonstrable GitHub repositories, research papers, thesis work, or project portfolios related to Voice AI.
- Understanding of real-time voice interaction systems.
Job Benefits:
- Opportunity to work in a young, dynamic, and fun environment with one of the best teams in the country.
- Opportunity to learn immensely in a short time.
- Hands-on experience working with top brands.
- Free breakfast, lunch, and tea; occasional ice cream, pizzas, and snacks!
- Console gaming and a lot more!
Job Details:
- Office workdays: Sunday to Thursday (5 Days); subject to periodic changes based on project requirements.
- Office working hours: 10:00 am to 7:00 pm (1 hour break)
- Office location: Uttara.
- Salary range: BDT 8000-12000
Application Deadline:
- June 30, 2026 (Preference will be given to those who will apply early).









