Causal Impact Measurement
Built experimentation and uplift modeling workflows to identify high-response segments and quantify true incremental impact.
+65% relative conversion lift · p < 0.001I build production-grade machine learning, analytics, and generative AI systems that turn messy data into decision-ready products: predictive models, RAG pipelines, experimentation platforms, and monitored deployment workflows.
My work centers on three strengths: predictive modeling for decisioning, LLM and RAG systems for knowledge workflows, and data/ML platforms that make models repeatable, observable, and deployable.
Built experimentation and uplift modeling workflows to identify high-response segments and quantify true incremental impact.
+65% relative conversion lift · p < 0.001Improved regulated decision support by building supervised models for borrower behavior and portfolio risk analytics.
+14% default prediction accuracyHandled imbalanced conversion data with calibrated ranking, explainability, and business-focused prioritization.
ROC‑AUC ≈ 0.683 · 2.35× top-decile liftDeveloped forecasting and analytics systems with evaluation diagnostics, reporting layers, and stakeholder-ready outputs.
MAPE ≈ 20.8% · 35% reporting effort reducedTranslate business questions into modeling problems with clear metrics, interpretable outputs, and measurable operational impact.
Build user-facing AI systems with retrieval, ranking, generation, and evaluation loops that prioritize grounded outputs and practical usefulness.
Build reproducible data and model pipelines with monitoring, lineage, and deployment practices that support reliable production delivery.
This section ranks the strongest ML signals in my profile by scanning portfolio project content, resume text, and GitHub repository metadata. It highlights where my portfolio shows the most depth, not just a list of tools.
Scores are weighted by repeated evidence across projects, experience, and recent repositories.
A word-cloud style view of the highest-frequency ML themes appearing across the portfolio footprint.
Each theme is tied back to concrete portfolio, resume, or repository evidence.
A full portfolio of machine learning, analytics, and applied AI systems spanning predictive modeling, experimentation, recommendation, OCR, time-series forecasting, and retrieval-augmented generation. Each project is framed around a business problem, a technical approach, and an operational outcome.
Built an interactive reinforcement-learning sandbox that makes Q-learning legible to non-specialists through environment controls, policy updates, and live value-map visualization.
Designed a reviews intelligence system that clusters recurring issues, attaches grounded evidence, scores outputs with RAGAS, and routes executive alerts so teams can move from raw feedback to action quickly.
Automated a high-friction expense workflow with OCR, field extraction, confidence scoring, and human review, reducing manual handling while improving auditability and downstream analytics readiness.
Built a finance-focused LLM assistant that combines fine-tuning with retrieval over SEC filings to deliver cited answers, explain key metrics, and support analyst research workflows.
Built a multi-course teaching assistant that uses hybrid retrieval and course-isolated knowledge stores to deliver citation-grounded answers with lower hallucination risk and better student self-service.
Built an experimentation workflow for incremental-impact measurement, combining power analysis, uplift modeling, and segment targeting to identify the customers most likely to convert under treatment.
Compared classical and deep-learning forecasting approaches for short-horizon retail demand planning, using feature engineering and error diagnostics to support staffing and inventory decisions.
Built a lead-scoring pipeline for imbalanced conversion data, using feature engineering, calibrated XGBoost, and SHAP-based interpretation to prioritize outreach by expected business value.
Evaluated marketing-channel performance with statistical testing and time-aware regression analysis to inform budget allocation and choose the higher-return campaign strategy.
Built a recommendation and ranking workflow over user-item interactions to generate personalized Top-N suggestions and support experimentation around engagement and discovery quality.
Combined churn prediction and segmentation to help retention teams identify at-risk customers, prioritize intervention, and tailor actions to higher-value behavioral segments.
Developed an imbalance-aware fraud detection pipeline that generates review-ready risk scores, helping teams surface suspicious transactions while controlling false positives.
Standardized noisy EHR data into modeling-ready datasets and built predictive baselines that support clinical risk analysis and more reliable downstream healthcare analytics.
Analyzed equity time series with indicator engineering and visual diagnostics to compare trend, volatility, and portfolio behavior across multiple public-market assets.
Applied dimensionality reduction and clustering to uncover patient-risk groupings in partially labeled clinical data, enabling more targeted downstream analysis.
Built a preference-based property matching system that scores candidate listings, ranks them for fit, and automates recommendation delivery to renters.
Created reusable scraping pipelines that collect job-market listings, normalize them into structured datasets, and support downstream labor-market analysis.
GPA: 3.5 / 4.0. Graduate focus in applied machine learning, AI systems, and production-oriented data science.
Built the technical foundation in software, analytics, and machine learning that shaped later AI/ML specialization.
I’m pursuing Data Scientist, ML Engineer, and Applied AI roles where I can own analytics and machine learning systems from raw data through deployment-ready outputs.