AVIKSHITH REDDY YELAKONDA DATA SCIENCE · MACHINE LEARNING · AI
DATA SCIENCE · MACHINE LEARNING

Reliable models. Real business impact.

I build end-to-end ML and AI systems—RAG pipelines, NLP workflows, and chatbots—from data ingestion and evaluation to deployment—focused on reliability, scalability, and measurable business impact.

Dallas, Texas MS Computer Science — AI/ML, SMU
Portrait of Avikshith Yelakonda

End-to-End Machine Learning & AI Workflows

overview

I build production-ready systems across Data Science, ML, and AI

I design, develop, and deploy machine learning and AI solutions end-to-end — from data preparation and modeling to deployment, monitoring, and decision support. My work blends strong statistical foundations with modern ML, NLP, and GenAI workflows.

Data Science Foundations

  • EDA, feature engineering, statistical modeling, forecasting, and hypothesis testing
  • Model evaluation, validation, explainability, and experiment-driven analysis

ML & AI Engineering

  • Supervised ML, time-series models, and experimentation (A/B testing)
  • NLP and LLM-based systems including RAG pipelines, embeddings, and prompt workflows

Production & Impact

  • ETL/ELT pipelines with Airflow, dbt, Spark/Databricks, and cloud platforms
  • Model deployment, tracking, and monitoring with Docker, MLflow, and CI/CD
Python SQL Data Science Machine Learning LLMs & RAG Docker MLflow

Skills

stack

Programming Languages

Core
Python
SQL
R
C++
Bash
SAS

Machine Learning & AI

Modeling
Regression & Classification
Clustering & Segmentation
Time-Series Forecasting
Model Training & Evaluation
Feature Engineering
A/B Testing & Experimentation

NLP & Generative AI

LLMs
LLMs & Prompt Engineering
Retrieval-Augmented Generation (RAG)
Embeddings & Semantic Search
Transformer Models
Vector Databases
Unstructured Document Processing

ML Frameworks & Libraries

Libraries
scikit-learn
PyTorch
TensorFlow
Hugging Face
Explainability (SHAP, LIME)
pandas & NumPy

Data Engineering

Pipelines
ETL / ELT Pipelines
Spark / PySpark
Databricks
Data Quality & Validation

Cloud & Data Platforms

Scale
AWS
Azure
GCP
Snowflake
BigQuery
Redshift

MLOps & Deployment

Production
Docker
MLflow
CI/CD Pipelines
Model Versioning & Monitoring
Reproducible ML Workflows
Batch & Inference Pipelines

Analytics & Visualization

Insights
Power BI
Tableau
Looker
Matplotlib
Seaborn
Excel

Developer Tools

Collaboration
Git
GitHub
JIRA
Streamlit
REST APIs
Agile Workflows

Featured Projects

work

Consumer Affairs Lead Conversion Case Study

Built a predictive modeling pipeline (Logistic Regression, XGBoost), achieving ROC-AUC ≈ 0.68 and 2.3–2.4× lift in top-decile leads; shipped Power BI dashboards for stakeholders.

Python · scikit-learn · XGBoost · Power BI

AI Assignments – Intelligent Agent Systems

Designed AI agents for search and optimization (maze, n-Queens, Connect-4) using BFS/DFS/A*, Hill-Climbing, Simulated Annealing, and Minimax+Alpha-Beta.

Python · NumPy · Matplotlib · AI Search

University Course Teaching Assistant RAG

Built a multi-tenant RAG teaching assistant where professors upload course PDFs/PPTX and students get grounded Q&A scoped to their professor and course.

Python · Streamlit · OpenAI API · FAISS · RAG · Docker

ML Assignments – Predictive Modeling and Neural Nets

Compared supervised learners (LogReg, PCA+MLP, CNN, Wide and Deep). Tuned optimizers and evaluated across datasets.

Python · scikit-learn · TensorFlow · Keras

Revenue Growth Campaign Optimization

Designed A/B tests, analyzed KPIs, and improved marketing ROI by 22% with budget allocation recommendations.

Python · A/B Testing · KPI Analysis

Stock Price Time-Series Forecasting

Forecasted Shopify and Alibaba using ARIMA and exponential smoothing to support investor risk profiling.

Python · ARIMA · Forecasting

Super Bowl Ad Analysis

Performed data wrangling, EDA, and visualization to uncover trends in ad spending, brand presence, and engagement.

Python · pandas · Seaborn · Matplotlib

Columbus Trash and Recycling RAG Assistant

Developed an end-to-end RAG system using LangChain and OpenAI LLMs to answer from city PDFs and pages.

Python · LangChain · OpenAI API · ChromaDB

Financial Education LLM — TinyLlama Fine-Tune

Ingested CFPB guides, generated Q&A pairs, fine-tuned TinyLlama with LoRA, and shipped a chatbot.

Python · PyTorch · Hugging Face · PEFT (LoRA)

NYC Taxi Demand Forecasting

Predicted hourly demand (92%+ accuracy) with weather and trip features using tuned models for planning.

Python · LightGBM · XGBoost · CatBoost

Experience

timeline

Data Scientist — PioneerSoft Corporation

Aug 2025 – Present · United States
  • Designed and deployed end-to-end ML systems across ingestion, feature engineering, model training, evaluation, and production monitoring, supporting reliable decision-making in regulated environments.
  • Built and validated predictive and statistical models using Python and SQL, applying feature engineering, cross-validation, and offline evaluation to ensure model stability as data distributions evolved.
  • Developed NLP and LLM-powered workflows—including RAG pipelines, embeddings, and API-based integrations—to extract insights from unstructured documents and automate downstream analytics.
  • Engineered scalable data and feature pipelines with strong data quality and governance controls, enabling consistent experimentation and repeatable model training.
  • Operationalized ML systems using Docker, MLflow, CI/CD, and cloud platforms, enabling reproducible experimentation, controlled releases, and production monitoring across environments.
  • Partnered with engineering and business stakeholders to translate ambiguous analytical requirements into production-ready ML solutions, balancing technical rigor with real-world impact.
Python SQL ML Systems LLMs RAG MLOps MLflow Docker CI/CD Data Pipelines Cloud

Cox Copy Center — Support Part-time

SMU Cox School of Business · Sep 2024 – May 2025 · Dallas, TX · On-site
  • Maintained secure access to confidential materials (exam papers, keys, staff resources).
  • Handled high-volume print/copy requests under tight deadlines for professors and staff.
  • Point-of-contact for real-time staff support in a fast-paced academic environment.
  • Assisted with building security procedures and safe closing operations after hours.
  • Supported exam proctoring while upholding academic integrity.
Communication Customer Service Team Coordination Time Management

Student Ambassador — RLSH SMU Part-time

Southern Methodist University · Sep 2023 – Aug 2024 · Dallas, TX · On-site
  • Front-desk representative supporting student engagement and housing operations.
  • Maintained accurate housing records and ensured policy clarity through consistent documentation.
  • Resolved routine and urgent concerns with confidentiality and professionalism.
  • Led one-on-one and group engagement to improve residential experience.
  • Served as interim Program Coordinator for a Dorm Leadership Team.
Leadership Event Planning Problem Solving Slack

Data Analyst — Yogin

Jan 2021 – May 2023 · 2 yrs 5 mos · India · Hybrid
  • At Yogin, I worked across the full data analysis lifecycle, supporting business and product teams with data-driven insights, forecasting, and scalable analytics workflows. My role focused on transforming raw, multi-source data into reliable analytical datasets and models that informed planning, performance tracking, and operational decision-making.
  • I performed extensive data cleaning, exploratory data analysis (EDA), and feature engineering using Python and SQL to uncover trends, behavioral drivers, and performance patterns across large datasets. Building on this foundation, I developed statistical and machine learning models for forecasting, segmentation, and classification, applying validation, hypothesis testing, and evaluation techniques to ensure results were interpretable and dependable.
  • To support repeatable analytics and reporting, I designed and maintained ETL pipelines to ingest, validate, and transform data for downstream analysis and modeling. I also built dashboards and visual reports to communicate KPIs, experiment results, and insights clearly to non-technical stakeholders, helping teams move from raw data to actionable decisions.
  • In addition to core analytics and ML work, I supported early AI-oriented workflows by integrating machine learning models into analytical pipelines and experimenting with NLP-based techniques for text analysis and unstructured data. Throughout my time at Yogin, I collaborated closely with cross-functional teams to translate analytical requirements into scalable, maintainable solutions, emphasizing data quality, interpretability, and long-term usability.
Python SQL EDA Feature Engineering Forecasting Machine Learning ETL Pipelines Data Visualization Power BI Tableau NLP

Education & Achievements

profile

B.Tech (Computer Engineering)

2019 – 2023

CVR College of Engineering — Hyderabad

Masters (Computer Science — AI/ML)

2023 – 2025

Southern Methodist University — Dallas

Leadership Recognition Certifications Events

Leadership Roles

  • Coordinator Head for the college annual fest; drove cross-team collaboration and seamless execution.
  • Program Coordinator for the SMU residential leadership team; led peer engagement and strengthened community culture.
  • Coordinator for Street Cause at CVR College of Engineering; managed volunteers and executed social impact campaigns.
  • Board Member, SMU International Student Board; represented student interests and fostered global inclusivity.

Recognition

  • Nominee, Unsung Hero Award — SMU RLSH Year-End Student Banquet.

Certifications

British Airways — Data Science (Forage, May 2025) Deloitte AU — Data Analytics (Forage, May 2025) Quantium — Data Analytics (Forage, May 2025) Tata — Data Visualisation (Forage, May 2025) Data Analyst Certification — OneRoadmap (Apr 2025) Google Project Management — Coursera

Hackathons & Events

Hackathon — SMU Career Development Programme (Oracle) — CVR

Let's Connect

reach

I'm actively seeking opportunities in Data Science and AI. Let's discuss how I can contribute to your team.