• Skip to primary navigation
  • Skip to main content
  • Skip to footer
Cyara

Cyara

Cyara Customer Experience Assurance Platform

  • Login
  • Contact Us
  • Request a demo
  • Search
  • Login
  • Contact us
  • Request a demo
  • Why Cyara
    • Cyara Agentic Platform
    • Cyara partner network
    • Cyara Academy
  • Products
    • ValidationBuild your CX stack with confidence – every layer, validated early
          • AI bot validationValidate conversational AI, GenAI, agentic AI chat, and voice bots
          • Telco infrastructureValidate carrier connectivity and routing for global calling and SMS
          • Network & endpointsValidate WebRTC media paths and agent desktop connectivity
    • ReadinessDeploy your CX journeys with confidence – at scale, through change
          • Agentic journey assuranceAssure end-to-end agentic and hybrid journeys before go-live
          • Load and performanceAssure CX journeys through load, peak, and scale
          • Human agent readinessAssure inbound and outbound agent paths before go-live
    • ObservabilityRun your CX operations with confidence – continuous monitoring, proactive resolution
          • Agentic AI trust & governanceMonitor AI agent hallucination, compliance, and misuse
          • Omnichannel observabilityMonitor end-to-end CX journey experience across channels
          • Human agent monitoringMonitor live agent connectivity and experience in real-time
    • Learn about the Cyara Agentic Platform
  • Resources
    • CX Assurance blog
    • Customer success showcase
    • CX use cases
    • Events & upcoming webinars
    • On-demand webinars
    • Resource library
  • About Us
        • About Cyara

        • About Cyara
        • Leadership
        • Careers
        • Legal statements, policies, & agreements
        • Services

        • Cyara Academy
        • Consulting services
        • Customer success services
        • Technical support
        • News

        • Press releases
        • Media coverage
        • Cyara awards
        • Partners

        • Partners

Blog / CX Assurance

September 30, 2020

Ensure Chatbot Training Phrase Quality with Botium

Florian Treml, Senior Director, Engineering

This article was originally published on Botium’s blog on September 30, 2020, prior to Cyara’s acquisition of Botium. Learn more about Cyara + Botium

This article shows you how to analyze and evaluate the quality of the training phrases for your chatbot intents with Botium. The purpose of this analysis is to avoid confusing the agent with phrases irrelevant to the intents supplied to, or more relevant to, other intents.

Botium first generates semantic embeddings of the training phrases by using the TensorFlow Hub Universal Sentence Encoder module and visualizes them in a 2D map. Based on the similarity between the training phrases, the average similarity between the intents is computed (separation), as well as the average similarity of phrases within an intent (cohesion). This approach helps to identify training phrases that might confuse your chatbot – based on the similarity in the embedding space.

Downloading Chatbot Training Phrases to Botium

All data science projects start with slicing and dicing data. Botium includes a Test Case Wizard, which downloads the training phrases from your chatbot provider with a single click – IBM Watson, Google Dialogflow, SAP Conversational AI, and more (see a list of supported chatbots engines in the Botium Wiki). Or you can decide to import one of the supported file formats instead – including JSON, YAML, Excel, and more (see Botium Wiki for details).

Test Case Wizard - Conversation Model Downloader

Analyze Training Phrases

Now navigate to the NLP Training Quality section to launch the analysis job in the background. Depending on the size of your training data this will take a few minutes.

Similarity Visualization

The semantic similarity of the training phrases is now visualized on a 2D map: The closer the points, the closer the semantical similarity. Hovering over a data point will show you the similarity in question. You can decide to show and hide training phrases for individual intents be selecting/deselecting the intents one by one.

Intent Similarity

Hint: Having all the training phrases for all intents shown on one map is confusing. For getting the most out of it, first check the following sections for similarities and then activate/deactivate the intents in question on the map.

Utterance Similarity

Training phrases in different intents that have high similarity value can be confusing to the NLU engine and could lead to directing the user input to the wrong intent.

NLP Training Data - Intent Similarity

Intent Separation

Given two intents, the average distance between each pair of training phrases in the two intents is shown.

NLP Training Data - Intent Separation

Intent Cohesion

Cohesion is the average similarity value between each pair of training phrases in the same intent. That value is computed for each intent. The higher the intent cohesion value, the better the intent training phrases.

NLP Training Data - Intent Cohesion

Improve Chatbot Training Phrases

To improve the quality of the training phrases for your intents, consider the following approaches:

  • Find the phrases in different intents with high similarity in the Utterance Similarity table, and change or remove them
  • For intents with low cohesion, add more meaningful training phrases
  • For intent pairs with low separation, investigate training phrases

Botium provides tools for these steps. See the Botium Wiki. Learn more about Cyara Botium here.

Read more about: Chatbot assurance, Chatbots, Cyara Botium, Quality Assurance

Related Posts

chatbot testing

June 25, 2026

Better Chatbot Testing, Better Performance: A Guide for CX Teams

Discover why modern chatbot testing platforms are essential for conversational AI testing, chatbot performance, and reliable CX.

Topics: AI chatbot testing, AI-Powered CX, Chatbot assurance, Chatbot testing

chatbot testing

June 11, 2026

Silent AI Failures in CX: When Bots Respond Correctly but Still Frustrate Users

Learn how to reduce risk, customer frustrations, and deliver better CX with AI and chatbot testing solutions.

Topics: AI chatbot testing, AI-Powered CX, Automated testing, Chatbot assurance, Chatbot testing, Customer experience (CX)

conversational AI testing

August 28, 2025

Automated Testing for Conversational AI: A Game-Changer in Customer Support

The rise of AI-powered CX offer many key benefits... and risks. Learn how to ensure CX quality with a conversational AI testing solution.

Topics: AI chatbot testing, Artificial intelligence (AI), Automated testing, Chatbots, Conversational AI, Conversational AI Testing, Customer experience (CX)

Footer

Cyara
Leader Enterprise Best Est. ROI Enterprise Easiest To Use Enterprise
  • LinkedIn
  • YouTube
  • Products
    • Cyara Agentic Platform
    • Validation
      • Botium
      • Voice Assure
      • testRTC
    • Readiness
      • Velocity
      • Cruncher
      • testRTC
    • Observability
      • AI Trust
      • Pulse 360
      • Pulse
      • Number Trust
      • ResolveAX
  • Resources
    • CX Assurance Blog
    • Events & upcoming webinars
    • On-demand webinars
    • Customer success showcase
    • Resource library
  • Company
    • About us
    • Leadership
    • Careers
    • Press releases
    • Media coverage
    • Cyara awards
    • Partners
    • Legal
  • Support
    • Cyara Academy
    • Support sites

Copyright © 2006–2026 Cyara® Inc. The Cyara logo, names and marks associated with Cyara’s products and services are trademarks of Cyara. All rights reserved. Privacy Statement