Voice transcription – What is the expected latency and level of accuracy for voice transcription?

Within Genesys Cloud, audio is transcribed in near real time, within seconds, and is accessible through our Notifications APIs.  The full interaction transcript becomes available in the Interaction Details UI immediately after the call, usually within 15 seconds.

  • Expected latency: approximately 3–5 seconds with this toggle enabled, compared to 35–40 seconds without it.
  • There is no additional cost for customers who use this feature.

For more information, see Genesys Cloud supported languages, and How do I increase the accuracy of voice transcription?, Configure voice transcription.


  • This field is for validation purposes and should be left unchanged.
  • If you still have questions you can ask the community for help.