Hi, I am Vivek Aryan.
Data Scientist | ML Engineer | AI Engineer
Welcome to my Data Science Portfolio! I am a skilled data scientist proficient in ML and AI with more than 3.5 professional experience.
Explore my projects showcasing expertise in predictive modeling, NLP, computer vision, and more.
Currently seeking full-time opportunities from May 2024.
Contact
+1 (346) 303-8568Location
Houston, TexasAbout Me
My introduction
I am a Data science master's student with 1 year experience in focused Generative AI research, particularly in the Large
Language Models space, and over 2.5 years of professional industry experience analyzing data and providing recommendations
in product and service based companies. Skilled in machine learning, statistics, data visualization, and generative AI. This
blend of academic rigor and practical industry exposure positions me as a versatile professional capable of navigating the
complexities of data science. Highly dedicated problem solver, goal-oriented, and an efficient team player. Self-driven and
fast learner.
What I may lack in years of experience, I compensate with my ability to grasp new tools and techniques quickly. This is evident
in my journey from Civil Engineering to Business Intelligence Engineer and ultimately to a Data Scientist, gaining experience in
generative AI and computer vision along the way.
experience
projects
worked
Domain Knowledge
Industries and domains I worked inFood Tech
Ad Tech
Fin Tech
E-Commerece
Explainable AI
User Expereince and Customer Experience
Techinical Skills
My techinical levelProgramming
Python
AdvancedPyTorch
AdvancedHTML
ProficientCSS
ProficientJavaScript
ProficientDatabase
MySQL
AdvancedRedshift
AdvancedSnowflake
AdvancedVector Database [FAISS, Weaviate]
ProficientGraph Database [NebulaGraph, Neo4J]
ProficientAnalytical Tools
Tableau [Certified]
AdvancedPowerBI
AdvancedMicrosoft Excel
AdvancedMicrosoft Powersoft
AdvancedMachine Learning Alogirithms
Linear Regression
Logistic Regression
KNN
Decision Trees
Random Forest
Support Vector Machines
Apriori Algorithm
Dimensionality Reduction [PCA, SOM, tSNE]
Deep Learning/Artificial Intillegence
Recurrent Neural Networks
LSTM
Transformers
Large Language Models
Convolutional Neural Networks
Object Detection
Image Segmentation
Pose Detection
Prompt Engineering
Retrieval Augmented Generation
Knowledge Graphs
Dev-ops Tools
Git
AdvancedDocker
ProficientAWS
ProficientData Analysis & ML/DL Dependencies
Numpy
Pandas
SQL
OpenCV
Pillow
Tensorflow
Pytorch
Scikit-Learn
Experience
My personal journeyM.S in Data Science
University of Houston - Main CampusHouston, Texas
GPA: 3.93
B.Tech in Civil Engineering
Manipal Institute of TechnologyManipal, India
AI Research Assistant
Aiceberg - Houston, Texas- Fine-tuned large language models (such as llama2 )on a custom dataset using LoRA and QLoRA fine-tuning methods and quantization of the models.
- Grounded the finetuned LLM with external non-parametric knowledge though RAG and Knowledge Graphs and generate quality response through prompt engineering to tackle hallucination in LLM.
- Enhanced research and conducted experiments across multiple frameworks (LangChain, LLamaindex), embedder models (SOTA embedders), and vector storage databases (simple methods to FAISS) to optimize both the speed and quality of processes.
- Researched techniques to enhance text-generative AI interpretability and explainability. Developed an end-to-end commercial integration from a cybersecurity perspective.
- Developing and experimenting with novel techniques in feature extraction using vector embeddings and prompt engineering.
Business Analyst
Meesho - Bangalore, India- Generated leads through data mining and measured the impact/performance of webinars and 1:1 training through A/B testing.
- Provided business recommendations to improve supplier engagement.
- Developed and maintained analytical dashboards utilized by stakeholders to track the L0 metrics of the Supplier Activation charter.
Business Analyst
Swiggy - Bangalore, India- Implemented A/B testing and normalizations to measure the impact/performance of in-house products or features on Chatbot (CRM) and formulate necessary business recommendations.
- Improved the CPO by 15% by changing the nomenclature of a bot disposition. Reduced 95th percentile customer wait times during peak hours by 60% by balancing the load.
- Utilized Power BI to develop and maintain smart, compelling analytical dashboards to monitor KPIs, identify trends, and monitor company initiatives and agents' performance.
- Contributed to the formulation of various metrics (active agents) and the enhancement of a bot efficacy metric.
- Conducted driver analysis on key metrics to identify potential improvement areas in the Swiggy Chatbot flow.
- Collaborated with enterprise data warehouse, data governance, and business teams on data quality issues, as well as the architecture of data repositories or fact tables under my purview.
Product Data Anayst Trainee
Capital Float - Bangalore, India- Used tools such as Redshift, python, Excel and Power BI to defining metrics, drive roadmaps, and provide data driven insights to improve Unsecured Business loan product and drive growth.
- Facilitated Root Cause Analysis (RCA) on incidents. Production of statistics and reports to demonstrate performance.
- Data mining to drive growth. Designed a one-stop source schema on Database that increased the efficiency.
Data Analyst Intern
Inmobi - Bangalore, India- Worked on structured data, performed statistical analysis on the data to identify patterns and trends on iDSP product.
- Delivered insights and inferences that helped to boost the business side of the company. Used different tools such as Microsoft Excel, Python and SQL, and worked on customer relations.