Vishvak Murahari
Princeton, New Jersey, United States
2K followers
500+ connections
View mutual connections with Vishvak
Vishvak can introduce you to 3 people at Stealth
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
View mutual connections with Vishvak
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
About
Building the future of the attention economy.
Activity
2K followers
-
Vishvak Murahari shared this🚀 𝗜𝗻𝘁𝗿𝗼𝗱𝘂𝗰𝗶𝗻𝗴 vibe-analyst — the future of data science. No dashboards. No SQL. Just vibe with your data. Built on our breakthrough: Agent Context Protocols (ACP) — the first protocol for building multi-agent reasoning systems. 🌍 In a world drowning in data, making sense of it shouldn’t feel like rocket science. vibe-analyst talks to your data for you — no more manual querying. 🧠 How it works: ACP enables multiple specialized agents to share context, reason together, and tackle complex problems. ⚙️ Point vibe-analyst at your data through MCP servers and autonomously explore your silos, synthesize actionable insights and deliver smart recommendations. 🔥 vibe-analyst is just the beginning — we're unlocking a new era of collaborative AI. ✨ Try it. Fork it. Join the protocol revolution. 📁 Code: vibe-analyst.com 📄 Paper: https://lnkd.in/gx2E-gEc Be the Analyst. #AI #MultiAgentSystems #AgentProtocols #DataScience #IntelligentSystems #vibeanalyst #LLMs #ACPs #MCPs
-
Vishvak Murahari shared this One year ago, our seminal research on "Toxicity in ChatGPT" revealed severe AI safety issues in foundation models. Since then, our work has sparked critical public discourse in both research and journalist circles (WSJ, VentureBeat). This has consequently led to significant investments and commitments from major foundation model players to faithfully abide by the AI safety governance. Alarmingly, our upcoming academic research reports ominous news -- state-of-the-art models like GPT-4 continue to be unsafe and harmful at an unprecedented level. Given the magnitude of our results and their ramifications, our team will be sharing them with the community in a responsible way in the upcoming weeks. Stay tuned for more updates. https://lnkd.in/em87B3_SToxicity in ChatGPT: Analyzing Persona-assigned Language ModelsToxicity in ChatGPT: Analyzing Persona-assigned Language Models
-
Vishvak Murahari shared thisIt was a pleasure hosting Tanmay Rajpurohit and Rama Vedantam, Ph.D. today afternoon at the computer science department at Princeton. Enjoyed discussing #GenAI and #AISafety and research advancement towards the same at Princeton Language Institute (PLI).
-
Vishvak Murahari shared thisI had a wonderful time at ACM SIGGRAPH 2024 in Denver. It was a pleasure discussing the next generation of computing platforms with founders, research leaders, executives, and my close collaborators -- Tanmay Rajpurohit and Ameet Deshpande. Given the rapid advancements in AI applications over the past year, driven by our team's inventions, the computing stack is evolving at an accelerated pace to meet this growing demand. I am glad our team could share our research around LLM agents, efficiency, and AI safety with the broader computing community to shape their views on the next generation of digital infrastructure. It was great to kick off the summer conference season in Denver, and I look forward to the remaining conferences this summer.
-
Vishvak Murahari shared thisAttending ACM SIGGRAPH 2024, the premier graphics and computing conference, in Denver. Looking forward to an engaging week interacting with leaders like Jensen Huang and Mark Zuckerberg.
-
Vishvak Murahari reposted thisVishvak Murahari reposted thisGlad to share our work – "RLHF Deciphered" – which analyzes reinforcement learning for human feedback (RLHF) for LLMs. We examine RLHF from the lens of its fundamental building blocks, succinctly formulating the role of the RL and reward models in aligning LLMs with human preferences. RL has become the de facto paradigm for teaching LLMs and AI agents to serve as helpful and useful assistants. However, current implementations of RLHF permit safety failures, as highlighted by recent studies. While RLHF is useful, our expertise in RL led us to recognize its ad-hoc nature. Our formulation and ideas advocate carefully examining the vast body of RL algorithms that have been carefully researched for over three decades. Our paper not only provides a clear and effective path to understanding RLHF but also serves as a call for RL researchers to devote their efforts towards improving RLHF in a principled fashion. We will be doing our part, this work is just the beginning, exciting work to follow! Work done with an amazing team—Pranjal Aggarwal, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan, Ameet Deshpande, Bruno Castro Da Silva—who bring in their expertise from the fields of NLP, RL, AI Safety, and law. 🔗: https://lnkd.in/e5Ub3AHP
-
Vishvak Murahari shared thisHad a fantastic time hosting a forum on AI safety and personalization in Malta! It was a privilege to interact with researchers, academics, and industry leaders to discuss the major opportunities and risks in generative AI. Thanks to the overwhelming positive response, we will continue to bring together diverse stakeholders and perspectives. Stay tuned for more exciting forums to come! #AI #safety #personalization #Malta #generativeAI #eacl2024
-
Vishvak Murahari reposted thisVishvak Murahari reposted thisInteracted with leading AI researchers in an invited talk at #EACL2024 in Malta, delving into the subject of #AISafety and #Law. Addressed the concerns about slow moving AI legislation and the hasty adoption of #GenAI within the industry for building consumer facing applications. The #AISafety concerns continue to persist due to glaring systemic lack of understanding regarding the fundamental mathematical models at play. The band-aid solutions will only cause more problems and bound to fail in addressing issues like hallucination, privacy protection, copyrights, and bias, while striving to maintain the high accuracy and low latency of #llm models. The crux of the AISafety problems lies in the foundation of ‘foundation models’ which legislators worldwide should take note of in drafting the future AI legislation.
-
Vishvak Murahari shared thisLooking forward to organizing our workshop on AI safety and personalization at EACL 2024, Malta. Tune in tomorrow at 9 AM (France time) for engaging panels, talks, and our fantastic set of accepted papers. Program: https://lnkd.in/eKz5sBXA #eacl2024 #aipersonalization #aisafety
-
Vishvak Murahari reacted on thisVishvak Murahari reacted on thisAfter an incredible journey at Kearney and Deloitte, it’s time for the next chapter! I’m excited to share that I’ve started a new role as Senior Product Manager – Data and AI at Hyatt – a unique opportunity to work at the intersection of AI innovation and hospitality. I’m beyond grateful for the experiences, challenges, and amazing people I've had the privilege to work with along the way. A huge thank you to my colleagues, mentors, and friends at Kearney – Balaji Guntur, Stefan Simonetti, Sean Maharaj, Jeff Ward, Andres Mendoza Pena – for an invaluable journey of building, launching, and scaling products. I also want to extend my sincere appreciation to my leaders at Deloitte – Anjan Roy, Karmendra Jain (KJ), Amit Singh, Karen Ramsey, David Grube, PMP – the opportunities, mentorship, and exposure to cutting-edge technologies, global teams and large-scale implementations have been pivotal in setting me up for success at Hyatt. I’d also like to express heartfelt thanks to my professors and coaches at Northwestern University - Kellogg School of Management – Prof. Mohanbir Sawhney, Prof. Birju S., Prof. Joel Shapiro, Prof. Suzanne Muchin, Prof. Loran Nordgren, Mary Burns, ACC, MBA and Loraine Hasebe – whose guidance and encouragement played a vital role in helping me secure this opportunity. As I step into this new role at Hyatt, I carry forward everything I’ve learned from each chapter of my career. I look forward to working with – Raymond Boyle, Joanna Yang, Joern Mumme and team – and contributing to Hyatt’s commitment to advancing care through data-driven decisions and innovation, and driving meaningful impact for Hyatt's colleagues, guests, and partners. If you're passionate about AI, data, or the future of hospitality, let’s connect. Please feel free to reach out or share your thoughts in the comments below. #NewBeginnings #ProductManagement #Data #AI #Hyatt #TravelAndHospitality #Innovation #Gratitude
-
Vishvak Murahari reacted on thisVishvak Murahari reacted on thisExciting News 📢 I successfully defended my PhD thesis on "Genome Mining and Machine Learning Algorithms for Natural Product Drug Discovery" at Carnegie Mellon University! 🎓 Imagine if you could train an AI to discover drugs to combat antibiotic-resistant pathogens. My work focused on leveraging transformer models like ESM to discover new drugs - naturally produced by microbes like fungi - directly from their DNA. Some highlights I'm grateful and proud of: 1. We discovered a new antifungal drug, Edaphochelin A, which kills the deadly pathogen Candida auris, recently classified by the CDC as an urgent threat. Part of this line of work was recently published in Nature Communications: https://lnkd.in/gWRxkr_u. Excited to share the rest soon, which includes promising results in mice studies. 2. We developed and open-sourced MASPR, a model that leverages large transformer neural networks to predict which building blocks microbes use to create these natural product drugs: MASPR improves over previous approaches by almost 16% in out-of-distribution settings and our manuscript will be available soon. 3. Getting invited to share our deep learning model, SPRINT, at NeurIPS MLSB: SPRINT can screen billions of molecules against whole proteomes, helping elucidate molecular mechanisms of action: https://lnkd.in/g2v5AAXb As I move onto my next adventure (next post coming), I want to thank a few key folks for making the last 5 years the most memorable: my advisor Hosein Mohimani, committee members, collaborators like Caleb Ellington, Andrew McNutt, Monica D., Mustafa Guler, Donghui Yan, and many, many other friends. I'd also like to thank Felicis for accepting me as an AI Fellow, several mentors who helped me along the way: David Koes, Sundeep Peechu, Karthik Jagadeesh, Raphael Townshend, and CMU faculty who inspired me to strive for excellence: Ziv Bar-Joseph, Albert Gu, Louis Felix Nothias, Eric Xing, Russell Schwartz, and many others. Most of all, I'd like to thank my family for supporting me and coming to Pittsburgh to watch my defense! Pranava Adduri, Nancy Wang, Wayne Duso, my parents, and my grandparents - you guys are the best. 😇
-
Vishvak Murahari liked thisVishvak Murahari liked thisEnjoyed delicious food and warm weather while attending #EMNLP2024 at #Miami. Had fun meeting friends, ex-colleagues and research collaborators from all over the world; discussions around recent developments and research ideas were intellectually stimulating!
-
Vishvak Murahari liked thisVishvak Murahari liked thisHad an excellent visit to HBS and MIT Sloan to explore their #AI initiatives & advancements. It was a pleasure to take a tour of the campuses and engage with emerging business management professionals, gaining insights into their distinctive perspectives on leveraging AI technologies across various industry sectors and enterprises, and discussing the challenges involved. Also, reconnected with old colleagues and school friends in #Boston/ #Cambridge, making this trip even more memorable. Prineeta Kulkarni Lipika Kapoor #AIFirstEnterprise #ThoughtLeadership #BusinessManagement
Experience
Education
Honors & Awards
-
Bell Labs Prize 2022 Runner-up
Nokia Bell Labs
-
Qualcomm Innovation Fellowship Finalist
Qualcomm
-
MS Research Award
Georgia Institute of Technology
Awarded MS Research Award by College of Computing at Georgia Tech
View Vishvak’s full profile
-
See who you know in common
-
Get introduced
-
Contact Vishvak directly
Other similar profiles
Explore more posts
-
Shantanu Dey, PhD
BITS Pilani Work Integrated… • 1K followers
Tried a small experiment on vibe coding for assessing the productivity of GPT-5. Tried similar experiments with Gemini Pro before, and also with perplexity pro. Together, as human-AI collaboration, we built up a system that can automate complex decision-making on faculty timetable preparation for a mid-size institute given all the constraints. This is a non trivial problem for a planner, given the variabilities involved. With a set of carefully written prompts, we ended up with about 500 lines of working code in under ten minutes. It read the data, optimized the schedule, resolved conflicts, and exported a clean result. I spent most of my time stating the rules and checking use cases, not typing syntax or correcting errors. As a trained academician we should not indulge in sweeping generalizations such as "gone are the days of the coders," based on one experiment. There has to be extensive and intensive research with industry-academia collaboration to understand fully the impact of such a tremendous productivity boost. When tools can draft usable scaffolds in minutes, the real leverage moves upstream and downstream: scoping the problem, modeling constraints, validating outputs, and stitching systems together. The value is in understanding the process, the data, the risks, and the goals—then making the whole thing work end to end. This is where academia and industry have to look beyond, starting from how courses can be restructured to become more cross discipline and digital focused to usher in "transformation". For decades, educational institutions—particularly local colleges—have measured success by the number of “job‑ready” developers they produce. This made sense in an era when software development was a scarce and highly manual skill. But in a world where AI can generate production‑grade code in minutes, this approach risks creating a talent surplus in areas where the demand curve is flattening. We should not be preparing our students merely to compete with algorithms—they will lose that race. Instead, the conversation must shift toward digital transformation—not just in the IT sector, but across industries and societies. The opportunities shaped by Industry 4.0 are far broader than writing code. For instance, rather than IT, operational technology OT has a great strategic role to play, and IT OT integration is a vast unexplored realm by behavioral and management science, going beyond the obvious technology. Similarly, although supply chain disruption is a vast research area, it is rarely taught as a curriculum except for being a minor part of OM. Other areas of interest can be interoperability and integration with business partners, Megaproject execution — deploying digital platforms to manage complexity, and risk. Platformization and digital ecosystem creation across technology landscapes are going to of critical essence for businesses, large and small. It is time that we work on these concepts in broader, cross disciplinary ways.
54
7 Comments -
Chris Farish
techire ai • 14K followers
Last year's NeurIPS dropped something quietly brilliant. Most RLHF techniques treat each LLM response in isolation. Works fine for single queries. But what about when an AI needs to guide a conversation towards a goal over multiple turns? Google's MTPO paper tackled this head-on. Instead of rating individual responses, they evaluate entire conversations. A response that seems poor in isolation might be strategically brilliant when you see how the conversation unfolds. Their results were compelling, MTPO significantly outperformed single-turn baselines by learning to plan ahead rather than just optimise locally. With NeurIPS 2025 just weeks away, I'm buzzing to see what breakthroughs are coming next. If last year gave us multi-turn reasoning improvements, what's on the horizon this year? What and who are you hoping to see at this year's conference? Any top tips when heading to your first NeurIPS? 🤗 🛬 #NeurIPS2025 #ReinforcementLearning #LLMReasoning
13
2 Comments -
Shwetank Kumar
5K followers
Yesterday at GTC, NVIDIA announced a CPU optimized for single-thread performance. Surprised? If you've been paying attention to the inference bottleneck, you wouldn’t be. The primary reason - agentic AI broke the fundamental assumption that GPUs do all the heavy work. When you have agent swarms spawning sub-agents, each needing its own thread of orchestration, the CPU can easily become the chokepoint. NVIDIA's own infrastructure lead said so. With 88 Olympus cores, 1.2 TB/s memory bandwidth, Vera CPU is the fastest single-threaded performance CPU on the market. Why single-thread? Because every nanosecond that CPU stalls, your very expensive GPU sits idle waiting for data. Paired with Rubin GPUs over NVLink-C2C, Vera hits 1.8 TB/s of coherent CPU-GPU bandwidth. For context, PCIe Gen 6 doesn't come close. They also unveiled a CPU-only rack — 256 liquid-cooled Vera CPUs, purpose-built for agentic workloads. No GPUs. From the company that made its fortune on GPUs. This is a telling architectural decision in which the company that sells the world's most expensive GPUs just told you that for the next wave of AI, the CPU matters again. Full analysis of every Vera Rubin architectural choice — and why they all point to the same bottleneck — coming to AI Afterhours soon.
13
1 Comment -
Anjali Batra
3K followers
Context Engineering vs. Prompt Engineering Prompt engineering was the starting point of working with LLMs. Context engineering is where serious AI systems begin. Prompt engineering focuses on how we write instructions, especially system prompts to get better outputs from a model. This made sense in the early days, when most LLM use cases were single-shot tasks like classification or text generation. But as we move toward agents that operate over multiple turns, tools, and longer time horizons, prompts alone are no longer sufficient. This is where context engineering comes in. Context engineering is the discipline of curating, maintaining, and refining everything that enters the model’s context window during inference—system instructions, tools, external data, message history, and more. It’s not a one-time activity, it’s an iterative process that happens at every step of an agent’s execution. Why does this matter? Because context is not free. As context windows grow, models experience what’s often called context rot—their ability to retrieve and reason over relevant information degrades as more tokens are added. Like humans, LLMs have a limited attention budget. Every additional token competes for that attention. This limitation is rooted in transformer architectures, where attention scales quadratically with context length. Longer context doesn’t mean unlimited focus—it means harder prioritization. Effective context engineering, then, is about finding the smallest set of high-signal tokens that maximizes the likelihood of the desired outcome. That applies across: System prompts: clear, well-scoped, neither brittle nor vague Tools: minimal, unambiguous, and designed for efficient interaction Examples: a few strong, representative cases instead of exhaustive rule lists Message history: selectively retained and continuously pruned The shift from prompt engineering to context engineering mirrors the shift from chatbots to agents. Writing good prompts is necessary—but managing context intelligently is what enables reliable, scalable, agentic behavior. In short: Prompt engineering tells the model what to do. Context engineering decides what the model gets to know. And that distinction increasingly defines the difference between demos and production-grade AI systems.
30
5 Comments -
Jacob Warren
Rig AI • 7K followers
You don't need a separate reference model for RL. We built an RL pipeline built on a multi-turn-focused modification of Self-Distillation Policy Optimization (SDPO) that uses the same model in two roles: student and teacher. The difference? Context window. → The student generates actions with no knowledge of the outcome. The teacher re-evaluates the same tokens with hindsight: the observation from each action, terminal test output, and a successful rollout from the same prompt. The per-token advantage is just the logit gap between these two views. → This gives you dense, per-token credit assignment instead of a single scalar reward for the whole trajectory. Think tokens get 0.25× weight (loosely guided, not tightly constrained). Action tokens get full gradient. System/user/tool result tokens get nothing. → The "teacher" is not a frozen checkpoint or a separate reward model. It's your current policy, conditioned on richer information. This is well-defined under MoE because same parameters with different routing is normal operating behavior. → We use a tiered reward system with hard gates only for truly unrecoverable failures: sandbox escapes, test corruption, harness crashes, timeouts, infinite loops. Everything else like wrong directory, failed commands, incorrect approaches are learnable signal, not wasted compute. The key insight: most of the information you need for credit assignment is already inside the model. You just need to ask it the right question with the right context. On LiveCodeBench v6 SDPO reached 48.8% vs GRPO's 41.2% beating Claude Sonnet 4 with an 8B model! That's bonkers.
5
-
Shivashish Jaishy
Shristyverse • 2K followers
Stanford just dropped a paradigm-shifting paper : Agentic Context Engineering (ACE), proving that LLMs can get smarter without fine-tuning a single weight. Instead of retraining, ACE lets models iteratively rewrite their own prompts, reflect, and evolve; turning context itself into a living, self-improving knowledge system. The results? +10.6% over GPT-4 agents, +8.6% on financial reasoning, and 86.9% lower cost/latency, all without labeled data. This flips the fine-tuning narrative: the future isn’t about smaller prompts, it’s about denser context. Think of it as the shift from static intelligence to autonomous context evolution, a foundation for real-time, memory-driven AI agents. Paper: https://lnkd.in/gtv8fAZ9
4
-
bByte sized leadership
516 followers
Soumith Chintala, 𝗰𝗿𝗲𝗮𝘁𝗼𝗿 of PyTorch and one of the most respected builders in 𝗺𝗼𝗱𝗲𝗿𝗻 𝗔𝗜, has joined Mira Murati ’s fast-rising startup, Thinking Machines Lab. After 11 years at Meta (Facebook), where PyTorch grew from a 𝗻𝗶𝗰𝗵𝗲 𝗽𝗿𝗼𝗷𝗲𝗰𝘁 𝘁𝗼 𝗽𝗼𝘄𝗲𝗿𝗶𝗻𝗴 𝗴𝗹𝗼𝗯𝗮𝗹 𝗔𝗜 𝗶𝗻𝗻𝗼𝘃𝗮𝘁𝗶𝗼𝗻, Chintala’s move signals a new wave in human–𝗔𝗜 𝗰𝗼𝗹𝗹𝗮𝗯𝗼𝗿𝗮𝘁𝗶𝗼𝗻. Thinking Machines Lab isn’t just gathering top talent from Meta, OpenAI, Anthropic, and leading universities—they’re offering 𝗴𝗮𝗺𝗲-𝗰𝗵𝗮𝗻𝗴𝗶𝗻𝗴 𝘀𝗮𝗹𝗮𝗿𝗶𝗲𝘀 (up to $500K), 𝗮 𝗺𝗮𝘀𝘀𝗶𝘃𝗲 $𝟮 𝗯𝗶𝗹𝗹𝗶𝗼𝗻 𝘀𝗲𝗲𝗱 𝗿𝗼𝘂𝗻𝗱, and are aiming for a $50 billion market valuation. Their new product, Tinker, is already in use at Princeton, Stanford, and by major enterprises. Meta, meanwhile, is regrouping under the 𝗻𝗲𝘄 𝗦𝘂𝗽𝗲𝗿𝗶𝗻𝘁𝗲𝗹𝗹𝗶𝗴𝗲𝗻𝗰𝗲 𝗟𝗮𝗯𝘀 division led by Alexandr Wang, with other AI leaders reportedly on the move. Chintala’s reasoning: time to pursue “𝙨𝙤𝙢𝙚𝙩𝙝𝙞𝙣𝙜 𝙨𝙢𝙖𝙡𝙡, 𝙨𝙤𝙢𝙚𝙩𝙝𝙞𝙣𝙜 𝙣𝙚𝙬, 𝙨𝙤𝙢𝙚𝙩𝙝𝙞𝙣𝙜 𝙪𝙣𝙘𝙤𝙢𝙛𝙤𝙧𝙩𝙖𝙗𝙡𝙚.” His legacy—the PyTorch tool now thriving in classrooms from 𝗠𝗜𝗧 𝘁𝗼 𝗿𝘂𝗿𝗮𝗹 𝗜𝗻𝗱𝗶𝗮—𝗶𝘀 𝘀𝗲𝗰𝘂𝗿𝗲. #AI #TechNews #PyTorch #Meta #ThinkingMachines #Startup #AIJobs #TalentWar #Innovation #BigFunding
8
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top content