Intern/Trainee- AI TTS/STT Engineer (Bangla Voice AI)

About the Role

We are looking for a motivated Intern/Trainee AI TTS/STT Engineer to join our AI team and help develop Bangla Voice AI solutions. Candidates should have hands-on experience through projects, research, internships, freelancing, or personal work involving Speech-to-Text (STT), Text-to-Speech (TTS), and conversational AI technologies, preferably for Bangla language applications.

Responsibilities:

  • Assist in developing and improving Bangla Speech-to-Text (STT) systems.
  • Assist in developing and improving Bangla Text-to-Speech (TTS) systems.
  • Work on AI-powered voice conversational systems and voice assistants.
  • Integrate STT, TTS, and LLM components into end-to-end voice AI workflows.
  • Prepare and manage speech datasets for training and evaluation.
  • Conduct model testing, performance evaluation, and optimization.
  • Collaborate with AI Engineers and Software Developers to deploy Voice AI solutions.
  • Research and evaluate emerging technologies in speech and conversational AI

Qualifications:

  • Bachelor's degree (completed or final year) in CSE, EEE, AI, Data Science, or a related field.
  • Hands-on experience with at least one AI Voice Conversational System (academic, personal, freelance, internship, or open-source project).
  • Must be able to explain their contribution to the project during the interview.
  • Strong Python programming skills.
  • Basic knowledge of Machine Learning and Deep Learning.
  • Familiarity with Speech-to-Text (STT) and Text-to-Speech (TTS).
  • Understanding of conversational AI workflows.

Preferred Skills:

  • Speech-to-Text (STT): Whisper, Wav2Vec2, NeMo, DeepSpeech
  • Text-to-Speech (TTS): Coqui TTS, XTTS, VITS, FastSpeech, Tacotron
  • AI Development: PyTorch, TensorFlow, LLM Integration
  • Deployment & APIs: FastAPI, Docker
  • Audio Processing & Dataset Preparation

What Will Make You Stand Out:

  • Experience building a Bangla voice assistant or voice chatbot.
  • Experience integrating STT, TTS, and LLMs into a conversational system.
  • Experience working with Bangla speech datasets.
  • Demonstrable GitHub repositories, research papers, thesis work, or project portfolios related to Voice AI.
  • Understanding of real-time voice interaction systems.

Job Benefits:

  • Opportunity to work in a young, dynamic, and fun environment with one of the best teams in the country.
  • Opportunity to learn immensely in a short time.
  • Hands-on experience working with top brands.
  • Free breakfast, lunch, and tea; occasional ice cream, pizzas, and snacks!
  • Console gaming and a lot more!

Job Details:

  • Office workdays: Sunday to Thursday (5 Days); subject to periodic changes based on project requirements.
  • Office working hours: 10:00 am to 7:00 pm (1 hour break)
  • Office location: Uttara.
  • Salary range: BDT 8000-12000

Application Deadline:

  • June 30, 2026 (Preference will be given to those who will apply early).

Life at Notionhive, where people & culture come together