• Skip to primary navigation
  • Skip to main content
  • Skip to footer
Cyara

Cyara

Cyara Customer Experience Assurance Platform

  • Login
  • Contact Us
  • Request a demo
  • Search
  • Login
  • Contact us
  • Request a demo
  • Why Cyara
    • Cyara Agentic Platform
    • Cyara partner network
    • Cyara Academy
  • Products
    • ValidationBuild your CX stack with confidence – every layer, validated early
          • AI bot validationValidate conversational AI, GenAI, agentic AI chat, and voice bots
          • Telco infrastructureValidate carrier connectivity and routing for global calling and SMS
          • Network & endpointsValidate WebRTC media paths and agent desktop connectivity
    • ReadinessDeploy your CX journeys with confidence – at scale, through change
          • Agentic journey assuranceAssure end-to-end agentic and hybrid journeys before go-live
          • Load and performanceAssure CX journeys through load, peak, and scale
          • Human agent readinessAssure inbound and outbound agent paths before go-live
    • ObservabilityRun your CX operations with confidence – continuous monitoring, proactive resolution
          • Agentic AI trust & governanceMonitor AI agent hallucination, compliance, and misuse
          • Omnichannel observabilityMonitor end-to-end CX journey experience across channels
          • Human agent monitoringMonitor live agent connectivity and experience in real-time
    • Learn about the Cyara Agentic Platform
  • Resources
    • CX Assurance blog
    • Customer success showcase
    • CX use cases
    • Events & upcoming webinars
    • On-demand webinars
    • Resource library
  • About Us
        • About Cyara

        • About Cyara
        • Leadership
        • Careers
        • Legal statements, policies, & agreements
        • Services

        • Cyara Academy
        • Consulting services
        • Customer success services
        • Technical support
        • News

        • Press releases
        • Media coverage
        • Cyara awards
        • Partners

        • Partners

Blog / CX Assurance

March 8, 2021

4 DOs and 3 DON’Ts for Training a Chatbot NLP Model

Florian Treml, Senior Director, Engineering

This article was originally published on Botium’s blog on March 8, 2021, prior to Cyara’s acquisition of Botium. Learn more about Cyara + Botium

A quick summary of 7 important DOs and DON’Ts when training an NLP model for a chatbot. They are best applied before starting a project, but can also help to build a mindset for quality training data in all chatbot project phases.

DO’s and 3 DON’Ts for Chatbot Testing Strategies

DOs and DON’Ts

✅ DO: think in problem space, not in solution space

Users typically think in problem space, not in solution space, and so should you. As a quick example, consider the case of a user who ordered a shirt in an online shop and wants to know when it is expected to arrive. Consider this question:

  • when will my shirt arrive

This is a question from problem space, describing the problem the user wants to be solved, while these are from solution space:

  • what is the estimated shipping time
  • show me the order status

They are describing how your business will react to the problem.

Benefit: the chatbot and the users speak the same language


❌ DON’T: overload your intents with too many problems

As a rule-of-thumb, your intents should handle at most 3–6 user problems as described above. For each problem, you should provide at least 3 user examples. Put your focus on the essence of the intent — the solution your chatbot can provide for your users.

Benefit 1: content stays maintainable and focused

Benefit 2: separation of concerns makes dialog building straight forward


✅ DO: clear separation of intents vs entities

To our surprise, it is still a very common pattern to intermix the concepts of intents and entities, and we strongly suggest to stop doing it. Consider a real-life example of a fashion store that has trained an NLP model with the 3 intents

  • order_tshirt
  • order_pants
  • order_socks

In this case, there is room for exactly 1 intent (order) and 3 entities (shirt, pants, socks). Data scientists training the NLP model maybe won’t notice a real difference, but your developers will be grateful when coding dialog flow and fulfillment based on the NLP model output.

Benefit: maintainable and clearly defined NLP model output


❌ DON’T: repeat sentence patterns in training data

When thinking about the question how much training data is sufficient? you have the resist the general answer the more the better. Having training examples following the same patterns like

  • order me a shirt
  • order me some shirts
  • order me shirts

In the best case, don’t help your NLP model in classification, and in the worst case, will even have a negative effect by overfitting your NLP model (but to be honest, when using a state-of-the-art pre-trained NLP model this is usually prevented out-of-the-box).

Benefit: keeps your training data small and focused


✅ DO: vary sentence structure and key terms

Instead of repeating the same patterns you absolutely should vary the sentence structure for teaching the NLP model different ways for a user to express the problem — here are some good training examples:

  • order me a shirt
  • need a new shirt
  • dress me up with a fancy new shirt

Depending on the domain it may even make sense to use a thesaurus, but — IMPORTANT — only on an entity and key term level: everything else a state-of-the-art NLP model will learn itself. A special thing to consider here is country-specific variations.

Benefit: makes classification robust for variations


❌ DON’T: train the model with misspelled data (but prepare for it)

This is one is obvious — especially for entity resolution some kind of spellchecking not only in training but also on live inference is a must. But also for intent classification, the NLP model will in the worst case learn rubbish.

Benefit: makes classification robust for real user input


✅ DO: edit and use real user input as training data

While you shouldn’t blindly copy&paste real user input to your training data, it is without any doubt the most valuable source of training data and future improvements of your chatbot’s understanding. As long as unsupervised learning for NLP tasks is still in its infancy, having some kind of manual interception and editing process in place is a must to establish continuous learning.

Benefit: improves the quality of your NLP model with each interaction


Action Plan

You can find information about how Botium can help on our Wiki and in our Blog:

  • NLP Quality Metrics
  • Test Set Insights
  • Confusion Matrix and Embeddings
  • Articles about NLU/NLP in our Wiki

Read more about: Chatbot testing, Chatbots, Cyara Botium, Natural language processing (NLP)

Related Posts

chatbot testing

June 11, 2026

Silent AI Failures in CX: When Bots Respond Correctly but Still Frustrate Users

Learn how to reduce risk, customer frustrations, and deliver better CX with AI and chatbot testing solutions.

Topics: AI chatbot testing, AI-Powered CX, Automated testing, Chatbot assurance, Chatbot testing, Customer experience (CX)

conversational AI testing

August 28, 2025

Automated Testing for Conversational AI: A Game-Changer in Customer Support

The rise of AI-powered CX offer many key benefits... and risks. Learn how to ensure CX quality with a conversational AI testing solution.

Topics: AI chatbot testing, Artificial intelligence (AI), Automated testing, Chatbots, Conversational AI, Conversational AI Testing, Customer experience (CX)

chatbot testing

July 31, 2025

How Chatbot Testing Helps You Accelerate CX Innovation and Deliver Exceptional Interactions

Poorly optimized chatbots leave you vulnerable to a wide range of risks. Start accelerating CX innovation confidently with chatbot testing.

Topics: AI chatbot testing, Artificial intelligence (AI), Automated testing, Chatbot assurance, Chatbot testing, Contact center, Customer experience (CX)

Footer

Cyara
Leader Enterprise Best Est. ROI Enterprise Easiest To Use Enterprise
  • LinkedIn
  • YouTube
  • Products
    • Cyara Agentic Platform
    • Validation
      • Botium
      • Voice Assure
      • testRTC
    • Readiness
      • Velocity
      • Cruncher
      • testRTC
    • Observability
      • AI Trust
      • Pulse360
      • Pulse
      • Number Trust
      • ResolveAX
  • Resources
    • CX Assurance Blog
    • Events & upcoming webinars
    • On-demand webinars
    • Customer success showcase
    • Resource library
  • Company
    • About us
    • Leadership
    • Careers
    • Press releases
    • Media coverage
    • Cyara awards
    • Partners
    • Legal
  • Support
    • Cyara Academy
    • Support sites

Copyright © 2006–2026 Cyara® Inc. The Cyara logo, names and marks associated with Cyara’s products and services are trademarks of Cyara. All rights reserved. Privacy Statement