- Developed a FastAPI service implementing advanced prompt engineering with LLMs (Gemini/GPT-4) to deliver real-time, context-aware insights, streamlining complex research workflows for pharmaceutical KAMs.
- Engineered a batch-processing pipeline to automate data ingestion from 50+ sources (REST APIs, FTP, SQL), replacing manual form entry and reducing CRM onboarding time by 2-3 weeks per source.
- Led a team of 4 to build a scalable Data Change Request system using Azure Functions and SQL Stored Procedures to handle high data volumes, processing 4,000+ requests and saving ~40 hours of manual cleaning.
- Designed the data warehouse architecture and implemented data pipelines across various platforms such as Snowflake, Databricks, dbt and Python for top-tier pharmaceutical clients, accelerating commercial reporting and ad-hoc statistical analysis by over 50%.
About
I am an aspiring software engineer with a passion for systems and software engineering.
Currently, I am a graduate student at Stony Brook University majoring in computer science with the intention to delve deeper into systems and work on some cool research along the way.
Previously, I was a senior data engineer at ZS Associates developing data products and web apps for pharmaceutical organizations.
Work Experience
- Built a Node.js backend and Angular front-end for real-time contracting scenario analysis, processing tens of thousands of rows via configurable algorithms to optimize pharma contract selection and boost ROI by more than 20%.
- Developed a full-stack graph visualization tool (Neo4j/D3.js) to map complex health system hierarchies (working with Veeva/IQVIA data), providing clear visibility into organized customer groups for sales targeting, segmentation and reporting.
- Developed robust data transformation processes across platforms such as Snowflake, Databricks, dbt and Python centralizing & standardizing client data management to enhance downstream analytics and improve turnaround time by more than 75%.
- Engineered a scalable Spark pipeline to process 100TB+ of complex, nested JSON datasets, utilizing strategic partitioning to optimize compute costs.
- Designed and deployed Course and Event management applications with Python/Django, enhancing the user experience for 100+ users with JavaScript-driven features like interactive quizzes and charts.
Co-curriculars
- Worked as a teaching assistant for CSE 337 - Scripting Languages, which involves proctoring and grading student examinations.
- Also conducted regular office hours to clarify students' doubts and grade scripting assignments.
- Research Assistant implementing the consensus protocol powering a decentralized Proof-Of-Agreement (PoA) based blockchain network.
- Working with DistAlgo - a language for building high-performance distributed algorithms.
- Consistently created engaging social media content using Adobe After Effects and CorelDraw, resulting in a 40% increase in follower count and online engagement
- Publicised the cultural fest in public areas and other colleges in the city and enrolled participants for various competitions
- Worked as a volunteer in Aarohan's Run for a Cause marathon which involved participation registration, distributing goodies, setting up directions along the route, etc.