Applied Research Scientist
ServiceNow
Applied Research Scientist | May 2024 – Present
At ServiceNow, I work on various NLP projects for multiple clients, focusing on data curation, training SLMs and LLMs, and evaluating their performance. I have developed in-house frameworks to enhance core functionalities.
Thoughtworks
Senior Data Scientist | September 2020 – May 2024
At Thoughtworks, I led the development of AI and NLP solutions, focusing on domain-specific models and revenue forecasting, which significantly enhanced client outcomes.
Python
Supervised & Unsupervised Learning Deep Learning Transfer Learning Ensemble Methods Gradient Boosting (XGBoost, LightGBM, CatBoost) Neural Networks
CNNs RNNs LSTMs Transformer Models (BERT, GPT) Attention Mechanisms
LLM’s finetuning and alignment Text Classification NER Machine Translation Sequence-to-Sequence Models Word Embeddings Speech Recognition
Data Cleaning Feature Engineering Data Visualization Predictive Modelling Statistical Modelling Time Series Analysis A/B Testing
TensorFlow PyTorch Keras Scikit-learn Pandas NumPy SciPy Hugging Face OpenCV NLTK SpaCy
Docker Kafka CI/CD ONNX Fast API MLflow Model Monitoring
AWS GCP Azure
Master of Science – Data Science
University of Arizona
2022 - 2024
Bachelor of Technology – Computer Science
DIT University
2016 - 2020
Note: The above list includes some of my key publications. For a complete list, please visit my Google Scholar profile.
This project creates a streamlined NLP pipeline for Indian legal documents, featuring Named Entity Recognition (NER), rhetorical role identification, and summarization. The pipeline is carefully designed and trained, with rigorous data curation and annotation, and evaluated using robust metrics.
Jugalbandi is a free and open platform that combines the power of Large Language Models such as ChatGPT and Indian language translation models such as those under the Government of India's Bhashini mission to power conversational AI solutions in any domain.
Aalap (Assistant for Legal and Paralegal functions in India) is an instructions fine-tuned version of Mistral 7B that can perform specific legal tasks in the Indian context. This research model intends to show that we can develop tasks for the legal domain and teach LLMs to do them at an affordable cost.