All Research Areas
Research Areas
Year Published

387 Results

April 25, 2012

PACMan: Coordinated Memory Caching for Parallel Jobs

USENIX Symposium on Networked Systems Design and Implementation (NSDI)

Data-intensive analytics on large clusters is important for modern Internet services. As machines in these clusters have large memories, in-memory caching of inputs is an effective way to speed up the…

By: Ganesh Ananthanarayanan, Ali Ghodsi, Andrew Wang, Dhruba Borthakur, Srikanth Kandula, Scott Shenker, Ion Stoica
April 17, 2012

Structural Diversity in Social Contagion

Proceedings of the National Academy of Sciences (PNAS)

The concept of contagion has steadily expanded from its original grounding in epidemic disease to describe a vast array of processes that spread across networks, notably social phenomena such as fads,…

By: Johan Ugander, Lars Backstrom, Cameron Marlow, Jon Kleinberg
April 16, 2012

The Role of Social Networks in Information Diffusion

International World Wide Web Conference (WWW)

Online social networking technologies enable individuals to simultaneously share information with any number of peers. Quantifying the causal effect of these mediums on the dissemination of informatio…

By: Eytan Bakshy, Itamar Rosenn, Cameron Marlow, Lada Adamic
March 1, 2012

Bootstrapping Data Arrays of Arbitrary Order

The Annals of Applied Statistics (AOAS)

In this paper we study a bootstrap strategy for estimating the variance of a mean taken over large multifactor crossed random effects data sets. We apply bootstrap reweighting independently to the lev…

By: Art B. Owen, Dean Eckles
March 1, 2012

Predicting Memcache Throughput using Simulation and Modeling

IEEE Symposium on Theory of Modeling and Simulation (TMS)

The current work introduces a method for predicting Memcached throughput on single-core and multi-core processors. The method is based on traces collected from a full system simulator running Memcached.

By: Steven Hart, Eitan Frachtenberg, Mateusz Berezecki
December 23, 2011

High-efficiency server design

ACM Conference on Supercomputing (ICS)

Large-scale datacenters consume megawatts in power and cost hundreds of millions of dollars to equip. Reducing the energy and cost footprint of servers can therefore have substantial impact.

By: Eitan Frachtenberg, Ali Heydari, Hu Li, Amir Michael, Jacob Na, Avery Nisbet, Pierluigi Sarti
December 1, 2011

Performance of an online translation tool when applied to patient educational material

Journal of Hospital Medicine

We evaluate the accuracy of state-of-the-art online machine translation systems for translating patient educational material.

By: Raman R. Khanna, Leah S. Karliner, Matthias Eck, Eric Vittinghoff, Christopher J. Koenig, Margaret C. Fang
August 15, 2011

Phonetic Classification Using Controlled Random Walks

Conference of the International Speech Communication Association (Interspeech)

Recently, semi-supervised learning algorithms for phonetic classifiers have been proposed that have obtained promising results. Often, these algorithms attempt to satisfy learning criteria that are not inherent in the standard generative or discriminative training procedures for phonetic classifiers.

By: Katrin Kirchhoff, Andrei Alexandrescu
July 24, 2011

Learning Relevance from a Heterogeneous Social Network and Its Application in Online Targeting

ACM Special Interest Group on Information Retrieval (SIGIR)

The rise of social networking services in recent years presents new research challenges for matching users with interesting content. While the content-rich nature of these social networks offers many…

By: Chi Wang, Rajat Raina, David Fong, Ding Zhou, Jiawei Han, Greg Badros
July 17, 2011

Dimensions of Self-Expression in Facebook Status Updates

AAAI International Conference on Weblogs and Social Media (ICWSM)

We describe the dimensions along which Facebook users tend to express themselves via status updates using the semi-automated text analysis approach, the Meaning Extraction Method (MEM).

By: Adam D. I. Kramer, Cindy K. Chung