Data Scientist · AI Researcher

Thanya
Ramanathan

Mechanical engineer turned AI practitioner. I build production-ready systems at the intersection of NLP, generative models, and computer vision — currently as a Data Scientist at Tesco. MS AI from NTU Singapore.

2
IEEE Publications
4+
AI Domains
5+
Projects
MEng
+ MS AI · NTU
01

About

In 2019 I set out to become a mechanical engineer — then programming stole my heart. What started as curiosity about code became a full pivot into AI, driven by the belief that the most interesting engineering problems today sit at the intersection of mathematics, computation, and real-world data.

I completed my Master's in Artificial Intelligence at Nanyang Technological University in Singapore (2023–24), where my thesis explored how large language models can power next-generation recommendation systems. Before that, I spent six months at Amazon India as a Business Research Analyst, bridging data analysis with business strategy.

Today I'm a Data Scientist at Tesco, working on ML systems that operate at retail scale. Outside of work you'll find me reading, cooking something experimental, chasing an adrenaline rush, or dancing.

Research interests

  • Natural Language Processing
  • Generative Models & GANs
  • Computer Vision
  • Reinforcement Learning
  • Recommendation Systems
  • Unsupervised Domain Adaptation
02

Background

Aug 2024 — Present
Tesco
Data Scientist

MoE text classification: Implemented a Mixture of Experts architecture for hierarchical text classification — 97% accuracy on noisy datasets, cutting misclassification by 17% vs manual classification. Graph personalization engine: Built a graph-based personalization engine using multi-relational customer trajectory modelling, projected to surface £45M in incremental revenue in 2026. Fraud detection pipeline: Led end-to-end design and deployment of a fraud detection automation pipeline, achieving 80% reduction in manual case review.s

NLP ML Deployment Python
2023 — 2024
Nanyang Technological University · Singapore
MS in Artificial Intelligence

Focused on NLP, generative models, and recommendation systems. Thesis: Next Point-of-Interest Recommendation using Large Language Models — combining GETNext with LLM-generated semantic embeddings for personalised POI prediction.

Research LLMs Singapore
Jan — Jun 2023
Amazon India
Business Research Analyst Intern

Conducted data-driven research to surface insights supporting business strategy. Collaborated cross-functionally to translate analytical findings into actionable recommendations.

Analytics Business Research
2019 — 2023
Undergraduate — Mechanical Engineering
Bachelor of Engineering

Engineering fundamentals shaped how I think about systems, constraints, and optimisation — skills that transferred naturally to machine learning. Published two IEEE papers during this period.

IEEE Publications × 2 Research
03

Expertise

AI / ML domains

NLP Generative Models Computer Vision Reinforcement Learning Unsupervised Domain Adaptation Recommendation Systems Statistical Modelling Deep Learning

Tools & technologies

Python PyTorch HuggingFace Sentence-Transformers Scikit-learn SQL Pandas / NumPy Git Agile
04

Work

Transplay Logistics
Project · Reinforcement Learning · Healthcare AI

Transplant Logistics — NHS UK OpenEnv

Full RL environment simulating national organ allocation decisions, calibrated to real NHS Blood and Transplant 2022/23 data. An LLM agent acts as transplant coordinator — enforcing blood compatibility, HLA typing, cold ischaemia limits, and clinical urgency across 15 UK transplant centres. LLaMA-3.3-70b scores 0.917 mean vs 0.539 GPT-4o-mini baseline. Supports GRPO fine-tuning via TRL.

Reinforcement Learning LLM Agent GRPO FastAPI Healthcare OpenEnv
View on GitHub
Stanford RNA 3D Folding
Kaggle Competition · Structural Biology · 2026

Stanford RNA 3D Folding — Part 2

Predicted RNA 3D structure from sequence using Template-Based Modelling (TBM) combined with NVIDIA's RNAPro — a 500M-parameter diffusion model built on AlphaFold 3's architecture with a RNA-specific template embedder and RibonanzaNet2 as a frozen sequence encoder. Achieved public leaderboard score of 0.489 (vs 0.554 oracle baseline), with detailed ablations documenting that template quality is the single most critical pipeline variable.

Diffusion Models Structural Biology RNAPro Template-Based Modelling Kaggle
View on GitHub
Image Translation and UDA project
Project · Computer Vision

Image Translation & Unsupervised Domain Adaptation

Used GANs to bridge the domain gap between labelled source datasets and unlabelled target domains — enabling models trained on synthetic data to generalise to real-world scenarios without costly target-domain annotations.

GANs Computer Vision Domain Adaptation PyTorch
View on GitHub
Solar Irradiance Prediction paper
IEEE Publication · Research

Solar Irradiance Prediction

Benchmarked SVM and LSTM architectures for solar irradiance prediction at specific geographic coordinates using NASA POWER historical data. Compared model accuracy across latitudinal and longitudinal positions.

LSTM SVM Time Series IEEE
Read on IEEE
MDMTSP paper
IEEE Publication · Bachelor's Thesis

Multi-Depot MTSP via Genetic Algorithm & RL

Tackled the NP-hard closed-path multi-depot travelling salesman problem with a genetic algorithm (elitist selection, 2-opt mutation, OX crossover), further refined using reinforcement learning to converge on feasible, optimised solutions.

Genetic Algorithm Reinforcement Learning Optimisation IEEE
Read on IEEE
05

Contact

Let's build something interesting together.

Open to data science roles, research collaborations, and interesting conversations about AI. Based in Bengaluru — open to remote & relocation.

Send a message ↗