People

Kaushik Veeraraghavan

Software Engineer

I work on large scale distributed systems, especially on improving reliability, efficiency and scalability. My current focus is on building out Facebook’s shared compute and storage platform as part of our Infrastructure-as-a-Service effort. My previous projects include:

  • Fault tolerance and disaster readiness
  • Data center capacity management
  • Traffic management
  • End-to-end performance tracing and analysis
  • Data consistency
  • Cache consistency

I received my BS in Computer Engineering from the University of Maryland, College Park in 2003. After a short stint in industry, I obtained my PhD in Computer Science at the University of Michigan, Ann Arbor in September 2011.

Interests

Distributed systems, fault tolerance, data center engineering, reliability, efficiency, scalability

Related Links

Google Scholar

Latest Publications

Taiji: Managing Global User Traffic for Large-Scale Internet Services at the Edge

David Chou, Tianyin Xu, Kaushik Veeraraghavan, Andrew Newell, Sonia Margulis, Lin Xiao, Pol Mauri Ruiz, Justin Meza, Kiryong Ha, Shruti Padmanabha, Kevin Cole, Dmitri Perelman

SOSP - October 29, 2019

A Large Scale Study of Data Center Network Reliability

Justin Meza, Tianyin Xu, Kaushik Veeraraghavan, Onur Mutlu

IMC - October 31, 2018

Maelstrom: Mitigating Datacenter-level Disasters by Draining Interdependent Traffic Safely and Efficiently

Kaushik Veeraraghavan, Justin Meza, Scott Michelson, Sankaralingam Panneerselvam, Alex Gyori, David Chou, Sonia Margulis, Daniel Obenshain, Ashish Shah, Yee Jiun Song, Tianyin Xu

OSDI - October 9, 2018

Canopy: An End-to-End Performance Tracing and Analysis System

Jonathan Kaldor, Jonathan Mace, Michał Bejda, Edison Gao, Wiktor Kuropatwa, Joe O’Neill, Kian Win Ong, Bill Schaller, Pingjia Shan, Brendan Viscomi, Vinod Venkataraman, Kaushik Veeraraghavan, Yee Jiun Song

SOSP 2017 - October 28, 2017

DQBarge: Improving Data-Quality Tradeoffs in Large-Scale Internet Services

Michael Chow, Kaushik Veeraraghavan, Michael Cafarella, Jason Flinn

OSDI 2016 - November 2, 2016

Kraken: Leveraging Live Traffic Tests to Identify and Resolve Resource Utilization Bottlenecks in Large Scale Web Services

Kaushik Veeraraghavan, Justin Meza, David Chou, Wonho Kim, Sonia Margulis, Scott Michelson, Rajesh Nishtala, Daniel Obenshain, Dmitri Perelman, Yee Jiun Song

OSDI - November 2, 2016

Existential Consistency: Measuring and Understanding Consistency at Facebook

Haonan Lu, Kaushik Veeraraghavan, Philippe Ajoux, Jim Hunt, Yee Jiun Song, Wendy Tobagus, Sanjeev Kumar, Wyatt Lloyd

SOSP'15 - October 4, 2015

Challenges to Adopting Stronger Consistency at Scale

Philippe Ajoux, Nathan Bronson, Sanjeev Kumar, Wyatt Lloyd, Kaushik Veeraraghavan

HotOS - May 19, 2015

Wormhole: Reliable Pub-Sub to Support Geo-replicated Internet Services

Yogeshwer Sharma, Philippe Ajoux, Petchean Ang, David Callies, Abhishek Choudhary, Laurent Demailly, Thomas Fersch, Liat Atsmon, Andrzej Kotulski, Sachin Kulkarni, Sanjeev Kumar, Harry Li, Jun Li, Evgeniy Makeev, Kowshik Prakasam, Robbert van Renesse, Sabyasachi Roy, Pratyush Seth, Yee Jiun Song, Benjamin Wester, Kaushik Veeraraghavan, Peter Xie

NSDI ’15 - May 6, 2015