Mahalakshmi Venkateswarlu 🤖

Mahalakshmi Venkateswarlu

(she/her)

ML Researcher

Georgia Institute of Technology (OMSCS)

Professional Summary

Mahalakshmi Venkateswarlu is an ML Researcher and Engineer with ~6 years of industry experience, currently pursuing an MS in Computer Science (Machine Learning specialization) at Georgia Tech’s OMSCS program.

Her research focuses on LLM inference efficiency, GPU optimization, and LLM-based agents. She is passionate about understanding and improving the systems that make large-scale AI practical from CUDA kernel optimization and GPU memory management to LLM serving infrastructure such as vLLM, PagedAttention, and Mixture-of-Experts routing. Her current work sits at the intersection of recommender systems and LLM agents, exploring context efficiency and cost-quality tradeoffs at scale.

Her paper “Less is More: Benchmarking LLM Based Recommendation Agents” was accepted for oral presentation at the LARS Workshop, ACM The Web Conference 2026 (WWW 2026)

Previously, she worked as an ML Engineer at Zoho Corporation and as a Software Engineer at Cognizant Technology Solutions, building distributed systems, cloud infrastructure on AWS, and production ML pipelines. She is also building AIwithMaha, an educational brand for making ML and AI more accessible.

Education

MS Computer Science (Machine Learning Specialization)

Georgia Institute of Technology (OMSCS)

BE Electrical and Electronics Engineering

Anna University

Interests

LLM Inference & Serving Systems GPU Optimization & CUDA Programming Recommender Systems & LLM Agents ML Systems Research
My Research
My research focuses on LLM inference efficiency, GPU optimization, and LLM-based agents. I am interested in the systems that make large-scale AI practical, from CUDA kernel optimization and GPU memory management to LLM serving infrastructure such as vLLM and PagedAttention. My current work explores context efficiency and cost-quality tradeoffs in LLM-based recommendation agents. Please reach out to collaborate!
Featured Publications
Less is More: Benchmarking LLM Based Recommendation Agents featured image

Less is More: Benchmarking LLM Based Recommendation Agents

We benchmark LLM-based recommendation agents and show that minimal context achieves similar recommendation quality to large context windows, with significant cost savings across …

avatar
Mahalakshmi Venkateswarlu
An example preprint / working paper featured image

An example preprint / working paper

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis posuere tellus ac convallis placerat. Proin tincidunt magna sed ex sollicitudin condimentum.

avatar
Mahalakshmi Venkateswarlu
Recent Publications
(2026). Less is More: Benchmarking LLM Based Recommendation Agents. In WWW 2026 LARS Workshop.
(2015). An example journal article. Journal of Source Themes, 1(1).
Recent News
Trending
🎉 Easily create your own simple yet highly customizable blog featured image

🎉 Easily create your own simple yet highly customizable blog

Take full control of your personal brand and privacy by migrating away from the big tech platforms!

avatar
Mahalakshmi Venkateswarlu
🧠 Sharpen your thinking with a second brain featured image

🧠 Sharpen your thinking with a second brain

Create a personal knowledge base and share your knowledge with your peers.

avatar
Mahalakshmi Venkateswarlu
📈 Communicate your results effectively with the best data visualizations featured image

📈 Communicate your results effectively with the best data visualizations

Use popular tools such as HuggingFace, Plotly, Mermaid, and data frames.

avatar
Mahalakshmi Venkateswarlu
👩🏼‍🏫 Teach academic courses featured image

👩🏼‍🏫 Teach academic courses

Embed videos, podcasts, code, LaTeX math, and even test students!

avatar
Mahalakshmi Venkateswarlu
✅ Manage your projects featured image

✅ Manage your projects

Easily manage your projects - create ideation mind maps, Gantt charts, todo lists, and more!

avatar
Mahalakshmi Venkateswarlu