SRAVAN REDDY

Applied ML and Computer Vision Engineer

Specialized in Computer Vision, NLP and Deep Learning

Explore My Work

About Me

AI Engineering

Artificial Intelligence & Machine Learning

Exploring the frontiers of AI technology to solve complex real-world problems

Computer Vision

Computer Vision & Deep Learning

Developing intelligent systems that can see, understand, and interpret visual data

IoT Engineering

IoT & Embedded Systems

Creating connected devices and smart systems for the future

Data Science

Data Science & Analytics

Transforming raw data into actionable insights and intelligent solutions

B.Tech student in Computers and Communication Engineering at Amrita Vishwavidyapeetham (2021-2025). Passionate about AI, machine learning, and IoT, with innovative projects including surgical assistance systems and crop prediction models.

Skilled in Python, computer vision, and NLP with hands-on experience in developing speech-to-text systems, RAG models, and voice-controlled applications. Currently interning at Loksun.ai to enhance expertise in ML and AI. Holds certifications in Google Data Analytics and Python for Data Science, reflecting commitment to data-driven solutions. Aims to leverage technology to solve real-world challenges in AI and IoT.

Experience

Machine Learning Intern
Loksun.ai • April 2025 - Present
  • Developed a multi-Indian language speech-to-text system using Vosk for offline processing and Whisper for GPU-powered transcription, automating manual transcription tasks
  • Built a RAG model for the Indian Constitution using LangChain and Gemini 1.5 Flash, delivering accurate, context-aware legal responses

Education

B.Tech in Computers and Communication Engineering
Amrita Vishwavidyapeetham • 2021-2025
CGPA: 7.38

Specialized in Computer Science with focus on AI, Machine Learning, and IoT. Completed coursework in Data Structures, Algorithms, Computer Vision, Natural Language Processing, and Deep Learning. Actively participated in research projects and technical competitions.

Projects

Voice-Controlled Surgical Assistance System

Engineered a voice-controlled system integrating Google Speech-to-Text and YOLOv11 for surgical instrument detection with 97% accuracy. Developed a comprehensive dataset of surgical tools containing over 6,000 original images, training specialized models that reduced command misinterpretation by 40% in clinical environments.

Python OpenCV Machine Learning YOLOv11

Jan 2024 - May 2024

Crop Prediction

Built predictive models for 22 Indian crops using historical data, achieving 20% reduction in prediction error. Enhanced model accuracy by implementing advanced data preprocessing techniques, including feature scaling, outlier detection, and dimensionality reduction.

Python Machine Learning Google Colab

Aug 2023 - Nov 2023

Health Monitoring System

Designed an IoT-based system to monitor temperature, pulse, and BMI using Arduino and ThingSpeak, improving data accessibility by 75%. Integrated ThingSpeak cloud IoT platform to enable live uploading and remote access of patient health data, ensuring authorized personnel can monitor real-time health conditions from any location.

Python Sensors IoT Arduino

Feb 2022 - May 2022

Speech-to-Text System (Indian Languages)

Developed a multilingual speech-to-text system using Vosk for offline transcription and Whisper for GPU-accelerated processing. Automated transcription workflows across Indian languages, improving speed and accuracy while supporting dialect variations.

Python Whisper Vosk Offline Processing

May 2024 - Jul 2024

View on GitHub

RAG on Indian Constitution

Built a Retrieval-Augmented Generation (RAG) model using LangChain and Gemini 1.5 Flash to answer constitutional queries. Combined contextual retrieval with generative reasoning for accurate legal insights.

Python LangChain Gemini 1.5 RAG

Feb 2024 - Apr 2024

View on GitHub

Surgical Tools Dataset

Created a comprehensive dataset of 6,000 high-quality images for surgical tool recognition, including 9 categories of tools. Captured under diverse conditions with overlapping configurations to train robust computer vision models for clinical applications.

Computer Vision Dataset Creation Image Processing Medical AI

Jan 2025 - Feb 2025

View Dataset

Courses & Certifications

Data Base Management System
NPTEL
Issued Mar 2024
Comprehensive course covering database design, SQL, normalization, and database management principles. Learned advanced concepts in relational database management systems and optimization techniques.
Credential ID: NPTEL24CS21S553400346
Google Data Analytics
Coursera
Issued Sep 2023
Professional certificate program covering data analysis, visualization, and statistical analysis using tools like R, SQL, Tableau, and Spreadsheets. Gained expertise in data cleaning, processing, and creating actionable insights.
Credential ID: KPXB9RH6KEWA
Managing Emotions in Times of Uncertainty & Stress
Yale University
Issued Sep 2023
Evidence-based strategies for managing stress and emotional well-being during challenging times. Developed skills in emotional regulation, mindfulness, and resilience building.
Credential ID: F3WQF5ZJEBNY
Python for Data Science
NPTEL
Issued Sep 2022
Comprehensive Python programming course focused on data science applications. Covered NumPy, Pandas, Matplotlib, and statistical analysis techniques for data manipulation and visualization.
Credential ID: NPTEL22CS74S13070141