Projects
Profanity Detection
A robust multimodal system for detecting and rephrasing profanity in both speech and text, leveraging advanced Natural Language Processing (NLP) models to ensure accurate filtering while preserving conversational context.
FoodVision
FoodVision is a deep learning-based food detection system that utilizes YOLOv8 to identify and classify various food items in images. The system is capable of detecting 55 different food classes with a focus on fruits and vegetables, making it useful for dietary monitoring and nutritional analysis.
Speech Emotion Recognition using Wav2Vec2
This project implements a Speech Emotion Recognition (SER) system using the Wav2Vec2 pre-trained model. The system analyzes speech audio and classifies it into four emotional categories: Neutral, Happy, Sad, and Angry. Based on the paper "3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition".