← Back to Blog
How-To 11 min read • 2026-05-31

🎓 How to Train AI Voice Bot in Nepali, Hindi, or English (2026 Guide)

Complete guide to training AI voice agents in multiple languages including Nepali, Hindi, and English. Covers prompt engineering, knowledge base structure, and testing strategies.

Key Insight

AI voice bots don't need traditional ML training. They learn from your system prompt + knowledge base. The skill is in writing prompts that produce natural, accurate Nepali/Hindi/English conversation while respecting language-specific nuances like honorifics, code-switching, and formality levels.

You Don't "Train" — You Configure

Modern AI voice bots (powered by Gemini Live, GPT-4o, etc.) don't require ML training. The model is already trained on massive datasets. Your job is to configure it for your specific use case via:

  1. System prompt — personality, rules, conversation flow
  2. Knowledge base — facts about your business
  3. Examples — sample conversations showing desired behavior
  4. Voice/language settings — which voice, which language

Structuring Your System Prompt

A well-structured Nepali voice bot prompt has these sections:

1. Identity (Who Are You?)

तिमी TalkC AI हौ — XYZ Company को आधिकारिक भ्वाइस एजेन्ट।
तिमी मानिस जस्तै कुरा गर — robot होइन, मानिस।

2. Language Rules

- Default: नेपाली
- Caller ले English बोले: English मा switch गर
- Caller ले Hindi बोले: Hindi मा reply (तर तिम्रो default नेपाली नै हो)
- नेपाली-English mix ("balance check garnu") = normal नेपाली, language switch नगर

3. Tone and Style

- हजुर, हस्स, हुन्छ नि — natural Nepali fillers use गर
- Caller को tone match गर — angry → empathetic, happy → cheerful
- SHORT reply — 1-2 sentence then STOP, let caller talk
- Repetition नगर — same phrase दुई पटक नभन

4. Conversation Flow

═══ GREETING ═══
"नमस्ते हजुर! XYZ Company मा स्वागत छ। कसरी मद्दत गरौं?"

═══ ANSWER STYLE ═══
- Caller ले प्रश्न सोध्यो? सिधा उत्तर देऊ
- Answer दिसकेपछि stop — "अरु केही?" बारम्बार नसोध

═══ ESCALATION ═══
- "मान्छेसँग कुरा गर्न चाहन्छु" सुनियो? → human transfer
- 2 पटक try गरेर solve भएन? → escalate

5. Knowledge Base Section

═══ TIMRO COMPANY ═══
- Name: XYZ Restaurant
- Hours: 11 AM - 10 PM daily
- Address: Thamel, Kathmandu
- Phone: 01-1234567
- Speciality: Newari thali

═══ MENU ═══
- Veg Thali: Rs 350
- Newari Khaja Set: Rs 550
- Mo:mo (10 pcs): Rs 150
- ...

Hindi-Specific Considerations

Honorifics

Common Hindi Code-Switching

English Configuration

English is "easier" for AI but has its own challenges in South Asia:

Knowledge Base Best Practices

Structure Information Hierarchically

Keep It Conversational

Write in the language and style the AI should respond. Don't paste technical documents:

Include Real Phrases

If callers ask things in specific ways, include those phrases:

And paste the answer in same language nearby.

Testing Strategy

1. Internal Testing (Day 1-2)

2. Soft Launch (Day 3-7)

3. Full Launch (Day 8+)

Common Mistakes to Avoid

1. Prompts That Are Too Long

5000+ token prompts cause latency. Keep under 3000 tokens. Use knowledge base for facts, prompt for behavior.

2. Translation Instead of Native

Don't write in English then translate. Write directly in the target language. Awkward phrasing kills user experience.

3. No Anti-Repetition Rules

Without explicit rules, AI will repeat phrases like "अरु केही?" every turn. Always add: "Same phrase दुई पटक नभन।"

4. No Voice/Personality

Generic "AI assistant" prompts produce generic AI. Add personality: warm/professional/playful — match your brand.

5. Missing Escalation Rules

AI will try to handle everything if you don't tell it when to give up. Add specific triggers for human transfer.

Advanced: Function/Tool Calling

For dynamic data (real-time order status, appointment booking), define tools the AI can call:

{
  "name": "check_order_status",
  "description": "Look up order by order number",
  "parameters": {
    "order_number": "string"
  }
}

AI decides when to call the function based on conversation context.

Continuous Improvement

Frequently Asked Questions

Can I train AI voice bot myself or do I need an expert?

For basic deployments, anyone with good writing skills can configure an AI voice bot. For complex use cases (tool calling, multi-step workflows), you may benefit from prompt engineering experience.

How long is a good system prompt?

1500-3000 tokens (roughly 1000-2000 words). Longer prompts slow response time. Keep behavior rules in prompt, push facts to knowledge base.

Should the prompt be in Nepali/Hindi or English?

Write prompts in the target language for best results. Mixed prompts (English instructions, Nepali examples) also work. Avoid translating English prompts — write natively.

How do I handle multiple languages simultaneously?

Modern voice models handle code-switching naturally. Set primary language, allow switching based on caller. Test mid-call language switches explicitly.

Can AI learn from each call automatically?

Not in real-time. You manually update prompts/KB based on insights. Some platforms offer call analytics to identify improvement opportunities. True online learning is risky for production voice bots.

Ready to see TalkC.ai in action?

Get a personalized demo of TalkC.ai's voice AI platform. See how we handle 22,000+ calls/month for Yango Nepal, OCR Nepal, and government offices — same-day setup, 70+ languages.

Book a Demo →
T
TalkC.ai Team
team@talkc.ai • Kathmandu, Nepal