• Skip to primary navigation
  • Skip to main content
  • Skip to footer
Cyara

Cyara

Cyara Customer Experience Assurance Platform

  • Login
  • Contact Us
  • Request a demo
  • Search
  • Login
  • Contact us
  • Request a demo
  • Why Cyara
    • Cyara Agentic Platform
    • Cyara partner network
    • Cyara Academy
  • Products
    • ValidationBuild your CX stack with confidence – every layer, validated early
          • AI bot validationValidate conversational AI, GenAI, agentic AI chat, and voice bots
          • Telco infrastructureValidate carrier connectivity and routing for global calling and SMS
          • Network & endpointsValidate WebRTC media paths and agent desktop connectivity
    • ReadinessDeploy your CX journeys with confidence – at scale, through change
          • Agentic journey assuranceAssure end-to-end agentic and hybrid journeys before go-live
          • Load and performanceAssure CX journeys through load, peak, and scale
          • Human agent readinessAssure inbound and outbound agent paths before go-live
    • ObservabilityRun your CX operations with confidence – continuous monitoring, proactive resolution
          • Agentic AI trust & governanceMonitor AI agent hallucination, compliance, and misuse
          • Omnichannel observabilityMonitor end-to-end CX journey experience across channels
          • Human agent monitoringMonitor live agent connectivity and experience in real-time
    • Learn about the Cyara Agentic Platform
  • Resources
    • CX Assurance blog
    • Customer success showcase
    • CX use cases
    • Events & upcoming webinars
    • On-demand webinars
    • Resource library
  • About Us
        • About Cyara

        • About Cyara
        • Leadership
        • Careers
        • Legal statements, policies, & agreements
        • Services

        • Cyara Academy
        • Consulting services
        • Customer success services
        • Technical support
        • News

        • Press releases
        • Media coverage
        • Cyara awards
        • Partners

        • Partners

Blog / CX Assurance

March 26, 2021

Upskilling Test Engineers for Chatbot Projects

Florian Treml, Senior Director, Engineering

This article was originally published on Botium’s blog on March 26, 2021, prior to Cyara’s acquisition of Botium. Learn more about Cyara + Botium

With Botium, we are currently defining the industry standard for testing chatbots. In our support and developer channels, we are regularly receiving questions like:

  • I have to test a Whatsapp chatbot, can you help me to set up Appium for it ?
  • For our client I have to test a chatbot embedded in their app, can I test it with Botium ?
  • I have troubles with testing the customer support chatbot on our website, Selenium says
  • … and so on
several chatbots, one smashed with a hammer

If you have a hammer, every problem looks like a nail.

The conclusion to draw from these questions is: the test engineers learned how to test websites with Selenium and smartphone apps with Appium in the past, and now they try to apply this valuable knowledge again — neglecting the fact that chatbots are a new kind of apps that require new kind of tools (like Botium).

You can read about the most important differences in one of my previous blog posts.

With Selenium and Appium, we are talking about End-2-End testing (E2E) — simulating the full user experience on a graphical user interface. Those tests

  1. are extremely slow in execution, as they are basically running in realtime, and even for a medium-size chatbot project there typically is a 5-figure number of test cases for having a satisfying test coverage — running those tests in an E2E scenario will take hours in the best case
  2. require a high amount of computing resources or access to expensive browser/device cloud services
  3. are flaky as the required infrastructure is error-prone as well
  4. cannot provide a holistic view of the test object quality, as some important assertions as the pure NLP performance are technically not possible at all with E2E testing.

So here are my recommendations for test engineers on how to get going when asked for testing a chatbot.

API First

The most important metric for a chatbot is: is it able to do a meaningful conversation with a client? In every chatbot project team, there are conversation designers which, well, design the conversations that will make up the final user experience. The chatbot engine is trained (or coded) to provide the logic for these conversations.

chat flow diagram

And this is the place to start testing: make sure that the conversations are working as designed, from a content perspective. You can read more about conversation flow testing in the Botium docs.

One important skill to have is knowing BotiumScript, the scripting language to define conversation flow test cases.

Testing the NLP engine

Most chatbots have some kind of natural language processing (NLP) component as part of the processing pipeline — it enabled users to communicate with the chatbot in natural language, and that’s what actually makes up a chatbot. As a test engineer, it is your job to explore the limits of the NLP engine, and this requires basic skills in machine learning concepts, such as

  • intents, entities, and prediction confidence
  • accuracy, sensitivity, specificity, precision, recall, F1-score
  • confusion matrix

You can read about it in my blog series Quality Metrics for NLU/Chatbot Training Data.

E2E Smoketest

Testing the end-user experience on the user interface level is an important part of a testing strategy. When doing it right you now have the confidence the conversation flow and the NLP component are doing their work, so it is now time to add some user interface testing to the mix. The recommendation is to

  • do a small number of test cases, which cover all of the possible user interaction elements
  • do those tests on a mix of representative browser versions/operating systems/smartphone devices, both virtual and physical

The good news is that here test engineers can shine with the existing knowledge on Selenium and Appium!

Read on in the Botium Wiki how to set up this with Botium!

Non-Functional Testing

Finally, there are also non-functional tests like performance tests and security tests to add to the test mix. As opposed to the other test types those are typically done on certain milestones in the project.

Summary

A new generation of apps such as chatbots requires a new generation of testing tools, like Botium. Test engineers have to develop additional skills for testing conversational interfaces like chatbots.

blog thumbnails

Read more about: Chatbot testing, Chatbots, Cyara Botium

Related Posts

chatbot testing

June 11, 2026

Silent AI Failures in CX: When Bots Respond Correctly but Still Frustrate Users

Learn how to reduce risk, customer frustrations, and deliver better CX with AI and chatbot testing solutions.

Topics: AI chatbot testing, AI-Powered CX, Automated testing, Chatbot assurance, Chatbot testing, Customer experience (CX)

conversational AI testing

August 28, 2025

Automated Testing for Conversational AI: A Game-Changer in Customer Support

The rise of AI-powered CX offer many key benefits... and risks. Learn how to ensure CX quality with a conversational AI testing solution.

Topics: AI chatbot testing, Artificial intelligence (AI), Automated testing, Chatbots, Conversational AI, Conversational AI Testing, Customer experience (CX)

chatbot testing

July 31, 2025

How Chatbot Testing Helps You Accelerate CX Innovation and Deliver Exceptional Interactions

Poorly optimized chatbots leave you vulnerable to a wide range of risks. Start accelerating CX innovation confidently with chatbot testing.

Topics: AI chatbot testing, Artificial intelligence (AI), Automated testing, Chatbot assurance, Chatbot testing, Contact center, Customer experience (CX)

Footer

Cyara
Leader Enterprise Best Est. ROI Enterprise Easiest To Use Enterprise
  • LinkedIn
  • YouTube
  • Products
    • Cyara Agentic Platform
    • Validation
      • Botium
      • Voice Assure
      • testRTC
    • Readiness
      • Velocity
      • Cruncher
      • testRTC
    • Observability
      • AI Trust
      • Pulse360
      • Pulse
      • Number Trust
      • ResolveAX
  • Resources
    • CX Assurance Blog
    • Events & upcoming webinars
    • On-demand webinars
    • Customer success showcase
    • Resource library
  • Company
    • About us
    • Leadership
    • Careers
    • Press releases
    • Media coverage
    • Cyara awards
    • Partners
    • Legal
  • Support
    • Cyara Academy
    • Support sites

Copyright © 2006–2026 Cyara® Inc. The Cyara logo, names and marks associated with Cyara’s products and services are trademarks of Cyara. All rights reserved. Privacy Statement