Nzubechukwu Ohalete add
images/headshot.jpg
Data Science · Machine Learning · Trustworthy AI

Nzubechukwu Ohalete (Nzube)

PhD candidate in Data Science & Analytics at Kennesaw State University. I build and evaluate large language model systems for survey research, demographic bias auditing, and trustworthy AI.

Atlanta, Georgia
01

About

As AI systems increasingly stand in for human respondents in research, they raise the hard question of whether we can trust what they produce, and whether they carry hidden biases. My research answers that by building autonomous AI agents that generate synthetic respondents and complete surveys, then rigorously measuring how faithfully they reflect real populations.

Alongside the academic work, I’ve partnered with Delta, Blue Cross Blue Shield of TN, Atlanticus, and Southern Company on applied machine learning, and presented at INFORMS and JSM. I’m a builder at heart, and I love playing with data.

02

Education

Ph.D.

Kennesaw State University

Data Science & Analytics
Dissertation: Investigating Demographic Bias and Autonomous Survey Completion in Large Language Models.
4.0 / 4.0 · Kennesaw, GA · in progress
M.S.

Bowling Green State University

Applied Statistics (Business Analytics)
Thesis: A Study of Online Auction Processes Using Functional Data Analysis.
3.92 / 4.0 · Bowling Green, OH · April 2022
B.S.

University of Nigeria

Mathematics
First Class Honors. Thesis on exact solutions of the Navier–Stokes equations.
4.65 / 5.0 · Nsukka, Nigeria · July 2016
03

Publications

2025
COSTAR-A: A Prompting Framework for Enhancing Large Language Model Performance on Point-of-View Questions
Ohalete, N., Gittner, K., & Matheny, L. · arXiv:2510.12637
arXiv
2025
Comparing Different Machine Learning Models for Credit Risk Prediction
Ohalete, N., Muritala, F., & Ray, H. · Joint Statistical Meetings (JSM), Nashville, TN
DOI
2022
A Study of Online Auction Processes Using Functional Data Analysis
Ohalete, N. · Master's thesis, Bowling Green State University · OhioLINK ETD
OhioLINK
04

Industry Research

Delta Air Lines presentation add
images/delta.jpg

Delta Air Lines

Customer Experience

Analyzed over 32M calls, 13M messages, and 60M intent records to reduce repeat customer contacts. A CatBoost model reached 77% accuracy and drove a modeled 13% reduction.

Southern Company presentation add
images/southern.jpg

Southern Company

Churn Forecasting

Forecast early customer churn across 64 engineered features. A random forest reached 96% accuracy and 96% F1.

1st place · CDSA Industry Competition

Blue Cross Blue Shield of TN

Healthcare

Evaluated continuous glucose monitoring across over 500K member records and claims, with case-control matching on 7,500 pairs. Type I patients on CGM saw significantly greater HbA1c reductions than matched controls.

Atlanticus

Credit Risk

Modeled credit risk on about 1M records, applying SMOTE for class imbalance and trimming 458 features to 116. A random forest reached 88% accuracy and 81% precision.

3rd place · CDSA Industry Competition
05

Selected Data Science Projects

06

Selected Talks

INFORMS 2025 presentation add
images/informs.jpg
INFORMS 2025 · Atlanta, GA

Bridging the Gap: COSTAR-A for Smarter Prompting in Localized Language Models

INFORMS Annual Meeting · October 2025
JSM 2025 presentation add
images/jsm.jpg
JSM 2025 · Nashville, TN

Optimizing Credit Risk Classification Using Machine Learning Techniques

Joint Statistical Meetings · August 2025
Panelist — "AI in Higher Education: Evolving Use Cases and Ethical Implications." USG Ethics Awareness Week, Kennesaw State University. November 2025.
07

Teaching

Over a decade teaching mathematics, statistics, and data science across the U.S. and Nigeria, as both instructor of record and teaching assistant. Roles marked IoR were taught as instructor of record.

CourseLevelRole & Term
Introduction to Data Science
DATA 1501
Bachelor'sIoR Spring 2026 · TA Fall 2023
Python for Data Science
DS 7140
Master'sTA · Spring 2025
Statistical Computing
STAT 7020
Master'sTA · Fall 2023, Spring 2024
Data Mining in Business Analytics
STAT 4440
Bachelor'sTA · Spring 2022
Data Mining
STAT 6440
Master'sTA · Spring 2022
Regression Analysis
STAT 5020
Bachelor's, Master'sTA · Fall 2021
Linear and Integer Programming
OR 6610
Master'sTA · Fall 2021
Elementary Differential Equations I & II
MTH 203 / 204
Bachelor'sIoR 2019–2021
Elementary Mathematics I & II
MTH 101 / 102
Bachelor'sIoR 2019–2021
08

Honors

Outstanding Student Award add
images/award.jpg
2026

Outstanding Student Award

Top doctoral student in the PhD in Data Science & Analytics, Kennesaw State University.

3MT Award add
images/award3mt.jpg
2026

Three Minute Thesis — First Place

Graduate College, Kennesaw State University.

2022

James A. Sullivan Outstanding Graduate Student Award

A $400 award to the top graduate student in the M.S. in Applied Statistics program at BGSU.