Shreyas Rajesh

PhD student, UCLA

Shreyas Rajesh

About

I’m a PhD student at UCLA advised by Prof. Vwani Roychowdhury. I’m broadly interested in language modeling and information retrieval. My current focus is on building better information representation systems for LLMs (memory systems) to enable more dynamic, adaptable representations. I’m also interested in applying advances in language modeling to problems in the sciences and in health. I’ve also spent consecutive summers (and a bit more!) at NVIDIA building on-device agents for gamers. At NVIDIA, I worked across the entire stack from curating proprietary tool-use/reasoning data, parameter-efficient fine tuning of Small Language Models, Retrieval mechanisms for system info QA, and a host of optimizations to keep everything working in a tiny form factor!

Research interests

  • Language modeling and retrieval-augmented generation
  • Information retrieval
  • Memory and representation systems for LLMs
  • Limitation and capabilities of Small Language Models, especially in the context of on-device/resource constrained environments.
  • Applications of NLP in scientific and health domains

Papers

[1] Beyond Fact Retrieval: Episodic Memory for RAG with Generative Semantic Workspaces

AAAI 2026 Oral
NeurIPS 2025 — Language, Agents and World Models Spotlight

Shreyas Rajesh, Pavan Holur, Chenda Duan, David Chong, Vwani Roychowdhury

We introduce GSW, which equips LLMs with human-like episodic memory through structured semantic representations that track actors, roles, and states across space and time, outperforming standard RAG frameworks on episodic memory tasks.

[2] Embed-Search-Align: DNA sequence alignment using Transformer models

Bioinformatics 2025

Pavan Holur*, Kenneth C Enevoldsen*, Shreyas Rajesh, Lajoyce Mboning, Thalia Georgiou, Louis-S Bouchard, Matteo Pellegrini, Vwani Roychowdhury

We introduce DNARDE, a DNA language model trained with a contrastive objective for sequence alignment, pairing it with vector-store retrieval over nucleotide sequences. DNARDE outperforms prior transformer baselines on alignment and transfers across chromosomes and species.

[3] Customizing Open Source LLMs for Quantitative Medication Attribute Extraction across Heterogeneous EHR Systems

NeurIPS 2025 — GenAI for Health Workshop

Zhe Fei*, Mehmet Yigit Turali*, Shreyas Rajesh*, Xinyang Dai, Huyen Pham, Pavan Holur, Yuhui Zhu, Larissa Mooney, Yih-Ing Hser, Vwani Roychowdhury

We develop a framework using open-source LLMs to standardize opioid use disorder prescriptions from diverse EHRs, enabling consistent analysis across different EHR sources and formats.

[4] Creating an AI Observer: Generative Semantic Workspaces

arXiv

Pavan Holur, Shreyas Rajesh, David Chong, Vwani Roychowdhury

Initial explorations on what an evolving memory system could look like from the perspective of how an ideal AI would observe and build a semantic map of an ongoing situation.

* denotes equal contribution.