Back to Portfolio
WHO - Voice Identification System
AI/ML Project

WHO - Voice Identification System

CNN-based voice identification system using spectrograms to identify speakers from 3-second audio clips with data augmentation.

Project Overview

WHO is a sophisticated voice identification system built using Convolutional Neural Networks (CNN) that can accurately identify speakers from just 3-second audio clips using spectrogram analysis.

The system employs advanced data augmentation techniques on over 500 audio samples to improve model robustness and accuracy in speaker identification tasks.

This project strengthened expertise in CNN-based audio processing and demonstrated the practical application of deep learning in biometric identification systems.

Key Features

  • 3-second audio clip identification
  • CNN-based architecture
  • Spectrogram analysis
  • Data augmentation pipeline
  • High accuracy speaker recognition
  • Real-time processing capabilities

Technologies Used

PythonTensorFlowCNNAudio ProcessingSpectrogramsData Augmentation

Project Details

Client

Personal Project

Timeline

February 2024

Role

ML Engineer & Audio Processing Specialist

© 2026 Samarth Borade. All rights reserved.

0%