My headshot/profile image

|

I love exploring new things!!

About

Hi, I'm Jannat, a PhD researcher in Computer Science at VCU. I work on vision models and vision-language models, mostly around making visual and scientific information more accessible.

Most of what I do comes down to building efficient, low-compute multimodal models, with blind and low-vision users in mind from the start rather than as an afterthought. I co-authored a paper at BioASQ 2025 (CLEF) on biomedical information extraction.

Before this I did a bit of everything, machine learning models, multimodal AI accessibility, efficient models, RAG pipelines, full-stack apps, some NLP on the side.

Off the clock: I believe learning new skills keeps my brain active, so I'm always picking up something new, though tennis is my all-time favorite, where the ball make my body move.

Education

Ph.D., Computer Science

Jan 2026 – Present

Virginia Commonwealth University, Richmond, VA

Advisor: Dr. Tomasz Arodz

M.S., Computer Science

Aug 2024 – Dec 2025

Virginia Commonwealth University, Richmond, VA

Advisor: Dr. Bridget T. McInnes · GPA: 3.9

Master of Computer Applications

Jul 2023 – May 2024

CHRIST (Deemed to be University), Bangalore, India

GPA: 3.7

B.S., Applied Physical Science

Jul 2019 – May 2022

University of Delhi, New Delhi, India

GPA: 3.6 · Emphasis in Mathematics and Physics

Research Interests

Machine Learning, Multimodal AI, and Natural Language Processing

  • Vision–language models and information extraction from scientific figures
  • Robustness of multimodal models to noisy, real-world inputs
  • Accessibility of visual data for blind and low-vision users
  • Retrieval-augmented generation and biomedical text mining

Publications

Taylor, S., Dil, C., Shah, A., Jannat, Oldham, C., Upadhyay, A., Varughese, J., Yazbeck, N., & McInnes, B. T. NLP@VCU at BioASQ 2025: Information Extraction on the GutBrainIE Dataset. BioASQ Workshop, Conference and Labs of the Evaluation Forum (CLEF), 2025.

Leadership & Volunteering

Vice President, Computer Science Club

University of Delhi

2021 – 2022

Volunteer, Enactus Society

University of Delhi

2020 – 2022

Core Member, Computer Science Club

University of Delhi

2019 – 2021

My Professional Journey

Jan 2026Present
company logo

Graduate Research Assistant (Ph.D.)

Bioinformatics & Machine Learning Lab, Virginia Commonwealth University

Building and comparing vision-language models (ViT, DePlot) for chart understanding.

Click to see more
Sep 2025 – Dec 2025
company logo

Grading Assistant

CMSC Advanced Algorithms, Virginia Commonwealth University

Graded assignments and exams for a graduate-level advanced algorithms course.

Click to see more
Sep 2025 – Dec 2025
company logo

Teaching Assistant

ENGR 101 Introduction to Engineering, Virginia Commonwealth University

Supported first-year students with fundamentals in Arduino, circuit theory, and theremin projects.

Click to see more
Jun 2025 – Jul 2025
company logo

Graduate Student Software Engineer

Engineering Career Services, Virginia Commonwealth University

Built and deployed a student-alumni mentoring interface using Google Sheets and Apps Script.

Click to see more
Jan 2025 – Dec 2025
company logo

Research Assistant

NLP Lab, Virginia Commonwealth University

Developed Retrieval-Augmented Generation (RAG) pipelines with LLMs for relationship extraction from 1,500+ biomedical abstracts.

Click to see more
Sep 2024 – Dec 2024
company logo

Teaching Assistant

EGRE 246 Programming Using C, Virginia Commonwealth University

Mentored 20+ students weekly, guiding them in debugging, data structures, and algorithmic thinking.

Click to see more
Oct 2021 – Feb 2022
company logo

Software Engineer Intern

Raceme Tenders LLP, Delhi, India

Migrated a legacy PHP/MySQL site to AWS in a three-person team, reducing server downtime by 20%.

Click to see more
Scroll horizontally to explore timeline

Featured Projects

AI/ML, web development, and automation projects showcasing my technical skills and problem-solving approach.

RAG Research Platform
Completed

RAG Research Platform

AI-powered research blogging platform with real research paper integration, vector search capabilities, and beautiful animated UI.

ReactNode.jsMongoDB+4
GutBrainIE
Completed

GutBrainIE

Designed a robust relation extraction pipeline focused on gut-brain biome axis literature.

PythonPyTorchHuggingFace+2
Tennis Video Analysis System
Completed

Tennis Video Analysis System

Analyzes tennis videos for player & ball tracking and court line detection using modern computer vision.

PythonYOLOv8OpenCV+1
RideShare
Completed

RideShare

Real-time carpooling app connecting colleagues and friends who need a ride with those who can provide one to campus.

React NativeFirebaseNode.js+1
Crawler-Python Script
Completed

Crawler-Python Script

Automation for downloading thousands of the tenders files locally from the government tender website

PythonSeleniumWeb Scraping+1
Next.js icon

Next.js

React.js icon

React.js

Node.js icon

Node.js

Express.js icon

Express.js

JavaScript icon

JavaScript

Tailwind CSS icon

Tailwind CSS

HTML5 icon

HTML5

CSS3 icon

CSS3

PostgreSQL icon

PostgreSQL

MongoDB icon

MongoDB

Firebase icon

Firebase

MySQL icon

MySQL

Python icon

Python

TensorFlow icon

TensorFlow

PyTorch icon

PyTorch

Scikit-Learn icon

Scikit-Learn

Hugging Face icon

Hugging Face

OpenCV icon

OpenCV

NLP icon

NLP

Pandas icon

Pandas

GitHub icon

GitHub

Postman icon

Postman

Ollama

LlamaIndex

Want to Build Something Fun?

I'm always excited to collaborate on cool projects, discuss innovative ideas, or just chat about the latest in AI/ML!

(And yes, if you want to play tennis, I'm definitely up for that too! 🎾)