Overview 

VIDIZMO offers tools and features for conducting a detailed analysis of your media or evidence in your Portal. When transcribing audio and video files, the VIDIZMO Audio Indexer identifies and separates different speakers based on their voices through speaker diarization. 


VIDIZMO accomplishes this by analyzing the unique characteristics of the speakers' voices, segmenting them, and assigning their transcriptions a speaker prefix (e.g., Speaker 1: ). The transcriptions generated reflect the identified speakers and their words, allowing you to quickly determine which sentences each speaker spoke. 


The VIDIZMO Audio Indexer for diarization only analyzes the voice segments in the audio, making the feature language independent. This means the application can identify different speakers regardless of the language used in your audio or video files. 


Prerequisites 

  • Access to the VIDIZMO Audio Indexer application must be available in your portal. 


Speaker Diarization Steps 

  1. Navigate to the audio or video you want to process.


        2. Click process on its overflow menu. 



        3. Select 'Generate AI Insights' on the 'Process' window.  

        4. Add 'Transcriptions/CC' to the Insights tab.  


Note: Speaker diarization is performed whenever transcriptions are generated via this AI Insight. You can also generate transcriptions for your content in several other ways, refer to: How to Generate Transcriptions and Translations using VIDIZMO Audio Indexer.  


        5. Click 'Start' to begin the processing  



        6. Once the processing is complete, click your audio or video file to open its playback page.



        7. The transcriptions segmented by speakers for your file are present in the Transcription tab.  


 
To learn more about transcriptions using VIDIZMO Audio Indexer, refer to Understanding Transcriptions and Translations Generation via VIDIZMO Audio Indexer