Search the Dagstuhl Website
Looking for information on the websites of the individual seminars? - Then please:
Not found what you are looking for? - Some of our services have separate websites, each with its own search option. Please check the following list:
Schloss Dagstuhl - LZI - Logo
Schloss Dagstuhl Services
Within this website:
External resources:
  • DOOR (for registering your stay at Dagstuhl)
  • DOSA (for proposing future Dagstuhl Seminars or Dagstuhl Perspectives Workshops)
Within this website:
External resources:
Within this website:
External resources:
  • the dblp Computer Science Bibliography

Dagstuhl Seminar 07491

Mining Programs and Processes

( Dec 02 – Dec 07, 2007 )

(Click in the middle of the image to enlarge)

Please use the following short url to reference this page:




The main goal of the seminar "Mining Programs and Processes" was to create a synergy between researchers of three communities, namely mining software repositories, data mining and machine learning, and empirical software engineering. This goal was only partially met; while we had good response rates from the mining software archives community, we only had little response from the machine learning community. This was due to an unfortunate timing: The Dagstuhl seminar ran at the same time as NIPS 2007, the major machine learning event, and thus had great trouble attracting machine learning researchers. At the time the seminar date was finalized, the NIPS date was not set yet, and when the conflict was discovered, it was too late to change the date of the seminar. We thus lost a number of opportunities for interchange with machine learning ressearchers. Furthermore, one of our organizers (Tao Xie) was unable to come to due Visa issues. Despite these drawbacks, we are convinced that the seminar did generate a deeper understanding of the three communities' research challenges and state-of-the-art works. We thus focused on the interaction between mining software archives and empirical software engineering, with a small influx from the machine learning side.

We had invited a small number of participants to give keynotes that would reach out to different communities; in particular, Katharina Morik showed off the machine learning perspective, and Lionel Briand bridged the gap to empirical software engineering. During the seminar, the participants broke into five working groups, each dedicated to a specific topic. The results of these groups reflect the state of the art, as well as challenges for the future:

  • Issues of Mining Software Repositories
  • What do Developers (really) Need?
  • Empirical Studies and Long-Term Objectives
  • An Infrastructure for Mining Software Repositories
  • Understanding Code Changes

While there clearly is a need for further interaction between machine learning and mining software repositories, the seminar made clear progress into the interaction between mining software archives and empirical software engineering; this interaction is also reflected in the respective venues, which more and more adopt mining and empiricism as standard techniques. Also, the seminar showed inspiring directions in mining itself, as outlined above.

  • Abraham Bernstein (Universität Zürich, CH) [dblp]
  • Lionel C. Briand (Carleton University - Ottawa, CA) [dblp]
  • Irinia Iona Brudaru (Universität des Saarlandes, DE)
  • Jonathan Cook (New Mexico State University, US) [dblp]
  • Krzysztof Czarnecki (University of Waterloo, CA) [dblp]
  • Marco d'Ambros (University of Lugano, CH)
  • Valentin Dallmeier (Universität des Saarlandes, DE) [dblp]
  • Rob DeLine (Microsoft Research - Redmond, US) [dblp]
  • Serge Demeyer (University of Antwerp, BE) [dblp]
  • Premkumar T. Devanbu (University of California - Davis, US) [dblp]
  • Stephan Diehl (Universität Trier, DE) [dblp]
  • Thomas Fritz (University of British Columbia - Vancouver, CA) [dblp]
  • Harald Gall (Universität Zürich, CH) [dblp]
  • Emanuel Giger (Universität Zürich, CH)
  • Michael W. Godfrey (University of Waterloo, CA) [dblp]
  • Miha Grcar (Jozef Stefan Institute - Ljubljana, SI)
  • Ahmed E. Hassan (Queen's University - Kingston, CA) [dblp]
  • Kim Herzig (Universität des Saarlandes, DE) [dblp]
  • Abram Hindle (University of Waterloo, CA) [dblp]
  • Zhen Ming (Jack) Jiang (University of Victoria, CA) [dblp]
  • Miryung Kim (University of Washington - Seattle, US) [dblp]
  • Sunghun Kim (MIT - Cambridge, US) [dblp]
  • Volker Klingspor (Hochschule Bochum, DE)
  • Michele Lanza (University of Lugano, CH) [dblp]
  • Ben Liblit (University of Wisconsin - Madison, US) [dblp]
  • Yana Mileva (Universität des Saarlandes, DE)
  • Katharina Morik (TU Dortmund, DE) [dblp]
  • Martin Pinzger (Universität Zürich, CH) [dblp]
  • Rahul Premraj (Universität des Saarlandes, DE)
  • Martin Robillard (McGill University - Montreal, CA) [dblp]
  • Jelber Sayyad Shirabad (University of Ottawa, CA)
  • David Schuler (Universität des Saarlandes, DE)
  • Gina Venolia (Microsoft Research - Redmond, US)
  • Cristina Videira Lopes (University of California - Irvine, US) [dblp]
  • Cathrin Weiss (Universität Zürich, CH)
  • Michael Wuersch (Universität Zürich, CH)
  • Andreas Zeller (Universität des Saarlandes, DE) [dblp]
  • Thomas Zimmermann (University of Calgary, CA) [dblp]

  • Event Processing (main classification)
  • Artificial Intelligence
  • Databases
  • Programming Languages
  • Semantics/specification/formal methods
  • Software Engineering.

  • Event Processing
  • Real-time Information Systems
  • Reactive systems
  • Proactive systems
  • Active Technologies.