About Skills Projects Experience Contact

Seamless integration of form and function!!

I'm Shivam Kushwaha, from Mumbai university and aspiring to be aiml researcher.

I'm passionate about AI and Machine Learning with a enthusiasm for solving complex problems through research, development, and competitive programming. Always on the lookout for new challenges.
I'm particularly drawn to the intersection of mathematics, deep learning, and real-world applications, with a keen interest in building scalable, intelligent software systems. Whether through developing new projects, optimizing algorithms, I am committed to continuous growth and contributing meaningfully to the evolving world of technology.

My Tech Stack Includes:

AI/ML enthusiast with a passion for development.

Do you want to start a project together?

Merging technical precision with creative problem-solving!!

Skills

Here are some of my skills on which I have been working on for the past 2 years.

Machine Learning

Python

Pytorch

Tensorflow

Keras

Jupyter

Google Colab

Scikit-learn

Frontend

React

Nextjs

HTML

CSS

JavaScript

Tailwind

Backend

Node Js

Python

Flask

Django

MySQL

MongoDB

DevOps

AWS

Google Cloud

Others

Pycharm

GitHub

VS Code

Experience

My work experience as a software engineer and working on different companies and projects

Machine Learning Intern

FairAds AI

July 2025 - Dec 2025

At FairAds AI, I dove deep into the world of computer vision and real-time AI systems. My primary focus was building a sophisticated gender and age estimation pipeline using the DETR (Detection Transformer) architecture combined with MiViOLo for demographic analysis. I curated and annotated a custom dataset, then fine-tuned the model using PyTorch, experimenting with various data augmentation techniques and hyperparameter optimization strategies to push the boundaries of accuracy.

Beyond model development, I architected a production-grade event-driven microservices architecture using Apache Kafka for real-time data streaming. The system was designed to handle high-throughput scenarios, with services deployed and orchestrated using Docker and Coolify. I implemented robust monitoring and logging with Prometheus and Grafana, ensuring system reliability and observability across all services.

One of the most exciting challenges was developing a multi-agent AI orchestration system using the Model Context Protocol (MCP) and CrewAI framework. I designed specialized agents for data analysis, report generation, and business intelligence, each with distinct roles and capabilities. The system leveraged LangChain for agent coordination and FastAPI for the backend server, creating an automated pipeline that transformed raw business data into actionable insights.

Data Science Intern

AgenixAI

Jan 2025 - June 2025

My journey at AgenixAI was all about pushing the boundaries of enterprise AI and document intelligence. I spearheaded the development of a comprehensive Retrieval-Augmented Generation (RAG) pipeline that revolutionized how the company handled document queries. Using Llama Cloud Parser for advanced document parsing and LangChain for orchestration, I built a system that could understand context and retrieve relevant information with remarkable precision. The pipeline integrated seamlessly with OpenAI's GPT-4 for generation tasks.

I architected an enterprise-grade semantic search engine from the ground up, leveraging Pinecone vector database for efficient similarity search at scale. The system used OpenAI's ada-002 embeddings to transform documents into high-dimensional vectors, implementing sophisticated metadata filtering and hybrid search capabilities. I optimized the indexing strategy and query pipeline using Python and FastAPI, ensuring lightning-fast retrieval even with massive document collections.

Perhaps the most challenging project was building a multimodal document intelligence system that could process diverse document formats. I integrated GPT-4 Vision for visual understanding and Unstructured.io for document preprocessing, creating a unified pipeline that handled PDFs, images, scanned documents, and more. The system employed OCR techniques, layout analysis, and entity extraction using custom spaCy models. I also implemented data validation pipelines with Pydantic and deployed the entire system on AWS using Docker containers.

Research Intern

Lokmanya Tilak College of Engineering, Navi Mumbai

Aug 2023 - Dec 2023

During my research internship at Lokmanya Tilak College of Engineering, I worked under Prof. Sanjivani Deokar on Conditional Diffusion Models, I focused on advancing techniques for high-fidelity image generation. The project centered on developing innovative approaches to enhance the quality and accuracy of generated images through diffusion-based methods.

A key achievement of this research was the development of a novel attention-based conditioning mechanism, which resulted in an 11% improvement in FID scores compared to existing baseline approaches. To optimize the training process, I implemented sophisticated techniques including mixed-precision training and gradient accumulation, which significantly enhanced the overall computational efficiency of the model training pipeline.

Projects

I have worked on some very cool projects integrating machine learning models with interactive UI. Here are some of my projects.

Advance ATS using AI

A sophisticated AI powered Application Tracking System to efficiently select suitable candidates based on job descriptions and role. Features online assessment for evaluation!!

Check Out Live

Advance ATS using AI

Diffusion model

A diffusion model based Image generation tool based on DDPM sampler, Fine tuned for Different styles.

Check Out Live

Diffusion model

Machine Translation

Implementation of Transformer paper from - "Attention is All you need" paper, trained on custom dataset for English to Hindi translation.

Check Out Live

Machine Translation

Book Recommender System

A collaborative filtering based Book Recommender Engine designed to suggest books based on user preferences.

Check Out Live

Book Recommender System

Customer churn prediction

Developed a deep learning model using Tensorflow to predict customer churn by analyzing banking behavior patterns and demographic data through neural networks.

Check Out Live

Customer churn prediction

Multi Agent - Market research system

A Multi agent AI system which analyzes and do market research of any businesses and propose a AI use case proposal & solutions to enhance and expand the businesses by integrating the modern AI solutions.

Check Out Live

Multi Agent - Market research system