SRAVAN REDDY

Applied ML and Computer Vision Engineer

Specialized in Computer Vision, NLP and Deep Learning

Explore My Work

About Me

Artificial Intelligence & Machine Learning

Exploring the frontiers of AI technology to solve complex real-world problems

Computer Vision & Deep Learning

Developing intelligent systems that can see, understand, and interpret visual data

IoT & Embedded Systems

Creating connected devices and smart systems for the future

Data Science & Analytics

Transforming raw data into actionable insights and intelligent solutions

B.Tech student in Computers and Communication Engineering at Amrita Vishwavidyapeetham (2021-2025). Passionate about AI, machine learning, and IoT, with innovative projects including surgical assistance systems and crop prediction models.

Skilled in Python, computer vision, and NLP with hands-on experience in developing speech-to-text systems, RAG models, and voice-controlled applications. Currently interning at Loksun.ai to enhance expertise in ML and AI. Holds certifications in Google Data Analytics and Python for Data Science, reflecting commitment to data-driven solutions. Aims to leverage technology to solve real-world challenges in AI and IoT.

reddyshravan0403@gmail.com

GitHub

Experience

Machine Learning Intern

Loksun.ai • April 2025 - Present

Developed a multi-Indian language speech-to-text system using Vosk for offline processing and Whisper for GPU-powered transcription, automating manual transcription tasks
Built a RAG model for the Indian Constitution using LangChain and Gemini 1.5 Flash, delivering accurate, context-aware legal responses

Education

B.Tech in Computers and Communication Engineering

Amrita Vishwavidyapeetham • 2021-2025

CGPA: 7.38

Specialized in Computer Science with focus on AI, Machine Learning, and IoT. Completed coursework in Data Structures, Algorithms, Computer Vision, Natural Language Processing, and Deep Learning. Actively participated in research projects and technical competitions.

Projects

Voice-Controlled Surgical Assistance System

Engineered a voice-controlled system integrating Google Speech-to-Text and YOLOv11 for surgical instrument detection with 97% accuracy. Developed a comprehensive dataset of surgical tools containing over 6,000 original images, training specialized models that reduced command misinterpretation by 40% in clinical environments.

Python OpenCV Machine Learning YOLOv11

Jan 2024 - May 2024

Crop Prediction

Built predictive models for 22 Indian crops using historical data, achieving 20% reduction in prediction error. Enhanced model accuracy by implementing advanced data preprocessing techniques, including feature scaling, outlier detection, and dimensionality reduction.

Python Machine Learning Google Colab

Aug 2023 - Nov 2023

Health Monitoring System

Designed an IoT-based system to monitor temperature, pulse, and BMI using Arduino and ThingSpeak, improving data accessibility by 75%. Integrated ThingSpeak cloud IoT platform to enable live uploading and remote access of patient health data, ensuring authorized personnel can monitor real-time health conditions from any location.

Python Sensors IoT Arduino

Feb 2022 - May 2022

Speech-to-Text System (Indian Languages)

Developed a multilingual speech-to-text system using Vosk for offline transcription and Whisper for GPU-accelerated processing. Automated transcription workflows across Indian languages, improving speed and accuracy while supporting dialect variations.

Python Whisper Vosk Offline Processing

May 2024 - Jul 2024

View on GitHub

RAG on Indian Constitution

Built a Retrieval-Augmented Generation (RAG) model using LangChain and Gemini 1.5 Flash to answer constitutional queries. Combined contextual retrieval with generative reasoning for accurate legal insights.

Python LangChain Gemini 1.5 RAG

Feb 2024 - Apr 2024

View on GitHub

Surgical Tools Dataset

Created a comprehensive dataset of 6,000 high-quality images for surgical tool recognition, including 9 categories of tools. Captured under diverse conditions with overlapping configurations to train robust computer vision models for clinical applications.

Computer Vision Dataset Creation Image Processing Medical AI

Jan 2025 - Feb 2025

View Dataset

Courses & Certifications

Data Base Management System

NPTEL

Issued Mar 2024

Comprehensive course covering database design, SQL, normalization, and database management principles. Learned advanced concepts in relational database management systems and optimization techniques.

Credential ID: NPTEL24CS21S553400346

Google Data Analytics

Coursera

Issued Sep 2023

Professional certificate program covering data analysis, visualization, and statistical analysis using tools like R, SQL, Tableau, and Spreadsheets. Gained expertise in data cleaning, processing, and creating actionable insights.

Credential ID: KPXB9RH6KEWA

Managing Emotions in Times of Uncertainty & Stress

Yale University

Issued Sep 2023

Evidence-based strategies for managing stress and emotional well-being during challenging times. Developed skills in emotional regulation, mindfulness, and resilience building.

Credential ID: F3WQF5ZJEBNY

Python for Data Science