Data-Centric Programming Workshop

January 25, 2014, San Diego, CA, United States
(co-located with POPL 2014)

We're very pleased to announce the Data-Centric Programming Workshop 2014 (DCP2014), an exciting workshop that builds on the success of the Data-Driven Functional Programming (DDFP) workshop at POPL 2013. This workshop is for anyone who loves the application of functional programming (and indeed other programming paradigms as well) to data-rich domains. Please consider submitting to the workshop—whatever your type of data and/or data-centric programming. We want this to be a great event that opens up opportunities at the intersection of data and programming.

Functional programming techniques are increasingly important in data-centric programming. Languages like Haskell, Scala, and C# draw heavily on a range of functional techniques and find application in numerous data-driven domains; paradigms like map/reduce and its extensions lie at the core of modern scalable data processing; and "information-rich" languages like Ur, F#, and Gosu use meta-programming to integrate type-safe queries, web-based APIs, and scalable data sources—along with associated semantically-rich metadata—into the programming language. In principle, the expressiveness, strong typing, and core functional paradigm of these languages make them an ideal choice for expressing robust and scalable data-centric programming.

On the other end, the web of data is growing at an enormous pace, with few dedicated software applications capable of dealing efficiently in information-rich spaces. Reasons for that include one (or more) of the following research issues: lack of integrated development environments (IDEs, such as Microsoft Visual Studio and Eclipse), poor programming language support, lack of standard testbeds and/or benchmarks, inadequate training, and perhaps the need for curriculum revision. Properly addressing these issues requires interdisciplinary skills, and the collaboration between academia and industry.

Many challenges remain.

Workshop goals

This workshop invites submissions that explore the gap between today's data management challenges, particularly the ones related to dealing with large amounts of semantically rich data, and the lack of adequate tools. We are looking for contributions that discuss, promote, and further advance the programming of semantically-rich data, including the development of new languages, extension of existing ones, and the inclusion of semantic-enabled capabilities into existing IDEs.

By devising methods for handling data from the programming level, we can promote the research and development of better data-centric programming technologies as a whole, as well as facilitate the shift towards both principled and effective data-centric computing.


Program committee

Important dates

Talk proposals

We want DCP2014 to be as informal and interactive as possible. The program will thus involve a combination of invited talks, contributed talks about work in progress, and open-ended discussion sessions. There will be no published proceedings, but participants will be invited to submit working documents, talk slides, and so forth to be posted on the workshop website.

We invite proposals for talks in any area related to the connection between programming and data, including, but not limited to:

Talks about work in progress are particularly encouraged. If you have any questions about the relevance of a particular topic, please send email to the program committee (PC) chairs at

We solicit proposals for contributed talks. Proposals should be no longer than two pages, in either plain text or PDF format. We plan to allocate 30-minute talk slots, but proposals for shorter or longer talks will also be considered. Speakers may also submit supplementary material (for example, a full paper or presentation slides) if they desire, which PC members are free (but not expected) to read.

Submission process

Note: The talk proposals will not be published; however, we hope to arrange a special issue of a relevant journal for publication of final papers.


07:30–08:30 Breakfast
09:00–09:30 Introductions
09:30–10:00 LITEQ: Language Integrated Types, Extensions and Queries for RDF Graphs ,
Stefan Scheglmann, Steffen Staab, and Martin Leinberger, University of Koblenz-Landau, Germany
10:00–10:30 Break
10:30–11:00 Language-integrated query using comprehension syntax: state of the art, open problems, and work in progress ,
James Cheney, Sam Lindley, and Philip Wadler, University of Edinburgh, United Kingdom
11:00–11:30 Towards a Core Calculus for XQuery 3.0 ,
Giuseppe Castagna, Hyeonseung Im, Kim Nguyễn, and Véronique Benzaken, Universite Paris-Diderot, France
11:30–12:00 A Functorial Query Language ,
Ryan Wisnesky, Massachussetts Institute of Technology, United States; David Spivak, Harvard University, United States
12:00–13:30 Lunch
13:30–14:30 Invited Talk: R: A quasi-functional, quasi-lazy language for data-centric programming ,
Jan Vitek, Purdue University, United States
14:30–15:00 Data centric programming with HLearn ,
Mike Izbicki, University of California Riverside, United States
15:00–15:30 Break
15:30–16:00 F# Data: Making structured data first class citizens ,
Tomas Petricek, University of Cambridge, United Kingdom
16:00–16:30 Abstraction Without Regret for Efficient Data Processing ,
Tiark Rompf, Oracle Labs and EPFL, Switzerland; Nada Amin, Thierry Coppey, Mohammad Dashti, Manohar Jonnalagedda, Yannis Klonatos, Martin Odersky, and Christoph Koch, EPFL, Switzerland
16:30-17:00 Wrap-up and next steps
Jeremy Gibbons, Oxford University, United Kingdom; Evelyne Viegas, Microsoft Research, United States

Imprint / Data Protection