Welcome! I'm Tanmay Kapure, a dedicated and results-oriented Data Scientist with a passion for uncovering insights and driving innovation through data. Currently pursuing a Master of Science in Data Science at the New Jersey Institute of Technology, where I blend a robust academic foundation with hands-on industry experience. Through high-impact roles— including working on projects for reputed organizations like U.S Army and NASA —I've consistently driven measurable improvements in efficiency and accuracy. My expertise spans advanced statistical modeling, machine learning, and data visualization, complemented by proficiency in cutting-edge tools like AWS, Transformers, and anomaly detection systems. Passionate about solving complex challenges, I thrive on turning large datasets into strategic solutions and leading multidisciplinary projects that push the boundaries of innovation.
0 + Projects completed
Professional Data Scientist driving business strategies through data-driven insights. Proven expertise in data science, statistical analysis, machine learning algorithms and project management.
Worked with Senior Professors and Researchers of U.S Army to research and utlizie advanced methods to predict fatigue strength of Metals
The CSTR is an international leader in solar and terrestrial physics, with interest in understanding the effects of the Sun on the geospace environment.
A profitable startup based in singapore, which provides software solutions and services that help companies
Jubilant Biosys is a Contract Research, Development and Manufacturing Organization in India providing comprehensive drug discovery and research services worldwide.
My academic journey and achievements.
GPA 3.91 / 4.0
GPA 3.74 / 4.0
Below are my personal projects in Python, ML, Gen AI and Full Stack Development.
Built an end-to-end RAG system leveraging LangChain, FastAPI, and Pinecone, enabling context-aware responses from LLMs. Enhanced document processing & chunking using sentence transformers and made a Gradio webapp to interact with model. Orchestrated the webapp on Hugging Face Spaces with Docker & CI/CD, ensuring scalability & real-time query handling.
Created browser agent to navigate, extract info from web as per natural language instructions using Langgraph, Gemini, Playwright. Ability to bypass captcha, scrape websites, Human in the Loop feature to seamlessly fulfill browser tasks. Smart LLM assisted navigation to successfully handle dynamically changing web layouts.
A full stack Voice AI based end to end web application agent to simulate mock interview using VAPI AI and Gemini API's. Intelligent Chatbot that gives feedback based on candidate's performance and helps them evaluate their preparation for Interviews. Responsive and beautiful frontend based on Next.js and Tailwind CSS, while PostgreSQL for database and Firebase for authentication, hosted on Vercel.
Compared the performance of Quantum Machine Learning algorithms vs the classical ML agorithms for solving the backorders problem and Optimal vehicle route prodection in supply chain management
Performed exoplanet exploration using NASA's keper telescope dataset from NASA's exoplanet archive
Predicted various sentimens using BILSTM-RNN with an accuracy of 94 % on emotions data
We Used RoBERTa LLM for converting Natural language sentences into SQL queries
Developed an ML model to perform association rule mining in various retail datasets
Developed ML model to predict shipment price in logistics of a pharma company with an accuracy of 95%
Below are the details to reach out to me!
Newark, NJ