AI & Machine LearningApr 2026 — Jun 2026

Colon Histopathology AI Classifier

Machine Learning Engineer & Data Scientist

Scroll

01 — Overview

End-to-end ML research pipeline automating colon cell cancer detection and cell-type classification from 27×27 RGB histopathology patches. Rigorous patient-level evaluation on unseen clinical data.

Exceeded diagnostic targets (≥0.90 cancer detection, ≥0.60 cell-type) on genuinely unseen patients — not just new images from seen patients.

02 — Gallery

03 — Key Contributions

Designed, trained, and evaluated 12 distinct models across two diagnostic tasks using classical ML (SVM, Random Forest) and custom PyTorch CNNs.

Implemented Test-Time Augmentation (TTA), dynamic learning rate scheduling, and per-epoch data augmentation.

Conducted controlled transfer learning experiments with CIFAR-10 pretraining, documenting negative transfer between natural-image and medical domains.

Architected strict patient-level train/validation/test splits preventing data leakage from staining and scanner artifacts.

Achieved 0.91 Macro F1 on binary cancer detection and 0.75 Macro F1 on 4-class cell-type classification on held-out unseen patients.

Performed comprehensive EDA using PCA, t-SNE, and K-Means clustering to analyze latent space and validate decision boundaries.

04 — Features

Patient-level evaluation preventing clinical data leakage

12 models across cancer detection and cell-type classification

CNN + transfer learning + TTA optimization

Macro F1 as primary metric for class imbalance

Controlled experiments isolating augmentation and ensemble effects

05 — Links

GitHub Repository→Kaggle Notebook→

← Previous

Autonomous Xiangqi-Playing UR5e Cobot

Internbot & interbotRAG