Server sizing

Server sizing can be different for various speech transcription providers. When using the Verint Speech Transcription, the Verba Speech Analytics Server runs the transcription process and requires additional resources. In the case of 3rd party speech transcription providers, the transcription process runs in the cloud or on on-premise 3rd party servers. The Verba Speech Analytics Service has the following jobs (with impact):

Execute the configured Speech Transcription policies which query the database to create a list of call records for the transcription process (low)
Download the audio files from the storage target (low)
Optionally transcode the audio to the format supported by the transcription provider (medium)
Run the transcription directly on the server in the case of Verint Speech Transcription (high) or in the cloud or on 3rd party server (no impact)
Insert plain text transcription to the database (low)
Upload transcript file to the storage target (low)

Server Roles

The Verba Speech Analytics Service can be enabled on the following server roles:

Speech Analytics Server
Media Repository Server

Do not enable the service on any other server role. If the service is enabled on the Media Repository Server, the service has to be configured to limit the number of simultaneous transcription processes to 1 (Speech Analytics / Transcription / Task Processing Threads Count), this will use 1 core for transcription.

The service may be enabled on multiple servers, in this case, the system will automatically handle the load balancing between servers.

Verint Speech Transcription

When the Verint Speech Transcription engine is used, stand-alone Verba Speech Analytics Servers have to be deployed. The speech transcription process cannot be running on other server roles. This is due to the extensive resource usage nature of the speech transcription service which might interfere with other processes on the servers.

This table outlines typical server sizing and recommended hardware and software configurations:

Server Role	Speech Analytics Server
Server Platform	Industry standard PC servers Physical or virtual
CPU	Intel Xeon 2.4 GHz or higher Up to 16 CPU cores or vCPUs 2 cores/vCPUs must be "reserved" for the OS, only the rest can be used for application sizing Numbers only applicable when Receive-side scaling (RSS) is enabled in the OS
CPU	1 vCPU for every 200 hours of calls transcribed in 24 hours Individual results may vary due to the talk time in recordings, the storage codec, and the language model used
Memory	4GB RAM + 2GB / language model
Hard disk	System partition (OS and applications): 80 GB or more Media partition: not required, media files are only temporarily stored on the server Always use redundant disks with RAID and have separate Media and System volumes
Network	All server clocks must be synchronized, typically either with the domain controller or time server All servers must have the latest Time Zone configuration Custom time zones are not supported
Network	1x Gigabit Ethernet
Operating system	Microsoft Windows Server 2012 R2 Microsoft Windows Server 2016 Microsoft Windows Server 2019 Standard Edition, latest service packs installed We support the English versions of Microsoft server software
Virtualization	VMware and Hyper-V are recommended, see Virtualization.
Antivirus	Make sure your Antivirus software does not scan database, media and log folders.
Power	Redundant Power Supply UPS recommended

3rd party Speech Transcription

In the case of 3rd party speech transcription providers, the Verba Speech Analytics Service initiates the transcription process, which runs in the cloud or on on-premise 3rd party servers. For that reason, the Verba Speech Analytics Service can be run on dedicated Verba Speech Analytics Servers or on servers with Media Repository / Application Server roles.

For server sizing, use the standard Media Repository / Application Server guidelines at Server sizing and requirements.

Storage requirement

AVAILABLE IN 9.7.5 AND ABOVE

Speech transcription is supported on any storage target, except EMC Centera and Hitachi Content Platform (which do not allow uploading individual files after setting the retention on the object). In the case of WORM storage, the system automatically applies the retention period on the uploaded transcription files to match the original retention of the conversation.

Transcription policies only run on conversations stored on storage targets, which means files must be uploaded to the storage first.

Before version 9.7.5 the transcription only works when the media is stored on SMB storage. If the media has to be stored on a WORM or non-SMB storage, the files should be hosted temporarily on SMB and moved to the final storage target after transcription.

Installation

Installing the Verba Speech Analytics Server

For installing the server, refer to Installing a Verba Speech Analytics Server.

Enabling the Speech Analytics Service

Step 1 - On the Verba Web Interface, go to System / Servers. Select your server and click on the Service Activation tab.
Step 2 - Activate the Verba Speech Analytics Service by clicking on the icon.

Step 3 - Click on the Service Control tab.
Step 4 - Start the Verba Speech Analytics Service by clicking on the icon.

Configuration

Refer to Configuring and running transcription in order to configure speech transcription. You can find additional, integration specific information at:

VFC Capture (Verba) 9.9

Deploying transcription