MapReduce Program Synthesis (PLDI 2016 - Research Papers)

Mon 13 - Fri 17 June 2016 Santa Barbara, California, United States

Who

Calvin Smith, Aws Albarghouthi

Track

PLDI 2016 Research Papers

Time Zone

The program is currently displayed in (GMT-07:00) Tijuana, Baja California.

Use conference time zone: (GMT-07:00) Tijuana, Baja CaliforniaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Thu 16 Jun 2016 13:30 - 14:00 at Grand Ballroom Santa Ynez - Synthesis I Chair(s): Eran Yahav

Abstract

By abstracting away the complexity of distributed systems, large-scale data processing platforms—MapReduce, Hadoop, Spark, Dryad, etc.—have provided developers with simple means for harnessing the power of the cloud. In this paper, we ask whether we can automatically synthesize MapReduce-style distributed programs from input–output examples. Our ultimate goal is to enable end users to specify large-scale data analyses through the simple interface of examples. We thus present a new algorithm and tool for synthesizing programs composed of efficient data-parallel operations that can execute on cloud computing infrastructure. We evaluate our tool on a range of real-world big-data analysis tasks and general computations. Our results demonstrate the efficiency of our approach and the small number of examples it requires to synthesize correct, scalable programs.

Calvin Smith

University of Wisconsin - Madison

Aws Albarghouthi

University of Wisconsin–Madison

MapReduce Program Synthesis - Calvin Smith