Portfolio

Projects & Research

Research and engineering projects across AI safety, LLM evaluation, LiDAR processing, NLP, and mobile development

Featured

SAE Jailbreak Detection Benchmark

SAEGuardBench

SAE features consistently hurt jailbreak detection. Raw activation probes achieve 0.949 AUROC vs 0.712 for SAE-based methods.

PythonPyTorchTransformerLens+3

Featured

LLM Tool-Use Evaluation

CompToolBench

Evaluation framework testing where 18 LLMs fail across four complexity levels. Found the Selection Gap: 13.2pp lower accuracy on single-tool selection than multi-step composition.

PythonLiteLLMpytest+3

Featured

Backdoor Survival in Model Merging

MergeSafe

Backdoors survive model merging. LoBAM amplification brings attack success to 83-99%. Built a pre-merge scanner with 100% recall.

PythonPyTorchHuggingFace+2

Featured

12-Year Concept Drift Study

TrafficLM

Measuring how website fingerprinting models decay over 12 years. Deep Fingerprint CNN achieves 94.4% but drops under concept drift.

PythonPyTorchscikit-learn+2

Encrypted Traffic Classification

Traffic Fingerprinting

94.1% accuracy classifying encrypted website traffic using only packet-size features across 5,001 samples from 50 websites.

Pythonscikit-learnStreamlit+2

EM Side-Channel Detection

RADAR-Rowhammer

EM side-channel rowhammer detection across 5 architectures, 7 attack patterns, and 3 DRAM platforms. 92.7% accuracy.

PythonPyTorchtorchaudio+2

Adaptive Learning Platform

IELTSLab

21K+ LOC monorepo with 5 containerized microservices. ML sidecar with faster-whisper speech recognition and adaptive testing.

Next.jsReact NativeFastAPI+2

Bilingual Academic Publishing

AnthroCircle

Bilingual academic publishing platform serving 120+ articles and 5,000+ daily readers. Migrated from WordPress to Next.js 15.

Next.jsPostgreSQLPrisma+2

Dual-task NLP Framework

PolyHope

80% F1 score using RoBERTa for hope speech detection.

PyTorchRoBERTaTransformers+1

TTU Parking Intelligence

RaiderPark

Parking intelligence app for Texas Tech with on-device ML (TF Lite), LightGBM + Temporal Fusion Transformer ensemble, and PostGIS geospatial queries.

React NativeTensorFlow LiteLightGBM+4

Python Framework

LiDAR Traffic Safety

Real-time 3D point cloud processing with DBSCAN clustering.

Pythonscikit-learnmatplotlib+1

TxDOT Project

LiDAR Pipeline

10TB+ real-time data processing for infrastructure safety.

PythonCUDABig Data+1

View All on GitHub

Research & Projects — Md A Rahman

SAEGuardBench — Do SAE Features Help Detect Jailbreaks?

CompToolBench — LLM Tool-Use Evaluation

MergeSafe — How Backdoors Survive Model Merging

TrafficLM — A 12-Year Concept Drift Study in Website Fingerprinting

Traffic Fingerprinting — Encrypted Network Traffic Classification

RADAR-Rowhammer — EM Side-Channel Attack Detection

IELTSLab — Adaptive Learning Platform with Speech ML

PolyHope — Hope Speech and Sarcasm Detection

Projects & Research

SAEGuardBench

CompToolBench

MergeSafe

TrafficLM

Traffic Fingerprinting

RADAR-Rowhammer

IELTSLab

AnthroCircle

PolyHope

RaiderPark

LiDAR Traffic Safety

LiDAR Pipeline