The Community for Technology Leaders
2013 10th Working Conference on Mining Software Repositories (MSR) (2013)
San Francisco, CA, USA USA
May 18, 2013 to May 19, 2013
ISSN: 2160-1852
ISBN: 978-1-4799-0345-0
TABLE OF CONTENTS

Cover page (PDF)

pp. 1

Title page (PDF)

pp. 1-2

Contents (PDF)

pp. 1-4

Welcome from the chairs (PDF)

pp. iii-viii

What is software development productivity, anyway? (Keynote) (PDF)

Gail C. Murphy , Department of Computer Science, University of British Columbia Vancouver, Canada
pp. 1

Why so complicated? Simple term filtering and weighting for location-based bug report assignment recommendation (Abstract)

Ramin Shokripour , Faculty of Computer Science & Information Technology, University of Malaya, Kuala Lumpur, Malaysia
John Anvik , Department of Computer Science Central Washington University, Ellensburg, Washington, USA
Zarinah M. Kasirun , Faculty of Computer Science & Information Technology, University of Malaya, Kuala Lumpur, Malaysia
Sima Zamani , Faculty of Computer Science & Information Technology, University of Malaya, Kuala Lumpur, Malaysia
pp. 2-11

Which work-item updates need your response? (Abstract)

Debdoot Mukherjee , IBM-Research-India, New Delhi, India
Malika Garg , Indian Institute of Technology, Delhi, New Delhi, India
pp. 12-21

Bug report assignee recommendation using activity profiles (Abstract)

Hoda Naguib , Institut für Informatik, Technische Universität München, Boltzmannstrasse 3, 87548 Garching, Germany
Nitesh Narayan , Institut für Informatik, Technische Universität München, Boltzmannstrasse 3, 87548 Garching, Germany
Bernd Brugge , Institut für Informatik, Technische Universität München, Boltzmannstrasse 3, 87548 Garching, Germany
Dina Helal , Department of Computer Science, German University in Cairo, Cairo, Egypt
pp. 22-30

Asking for (and about) permissions used by Android apps (Abstract)

Ryan Stevens , University of California, Davis Davis, CA, USA
Jonathan Ganz , University of California, Davis Davis, CA, USA
Vladimir Filkov , University of California, Davis Davis, CA, USA
Premkumar Devanbu , University of California, Davis Davis, CA, USA
Hao Chen , University of California, Davis Davis, CA, USA
pp. 31-40

Retrieving and analyzing mobile apps feature requests from online reviews (Abstract)

Claudia Iacob , Department of Computing and Communication Technologies, Oxford Brookes University, Oxford, United Kingdom
Rachel Harrison , Department of Computing and Communication Technologies, Oxford Brookes University, Oxford, United Kingdom
pp. 41-44

Gerrit software code review data from Android (Abstract)

Murtuza Mukadam , Concordia University Montreal, QC, Canada
Christian Bird , Microsoft Research, Redmond, WA, USA
Peter C. Rigby , Concordia University Montreal, QC, Canada
pp. 45-48

Who does what during a code review? Datasets of OSS peer review repositories (Abstract)

Kazuki Hamasaki , NAIST, Japan
Raula Gaikovina Kula , NAIST, Japan
Norihiro Yoshida , NAIST, Japan
A. E. Camargo Cruz , NAIST, Japan
Kenji Fujiwara , NAIST, Japan
Hajimu Iida , NAIST, Japan
pp. 49-52

Why, when, and what: Analyzing Stack Overflow questions by topic, type, and code (Abstract)

Miltiadis Allamanis , School of Informatics, University of Edinburgh, Edinburgh EH8 9AB, UK
Charles Sutton , School of Informatics, University of Edinburgh, Edinburgh EH8 9AB, UK
pp. 53-56

Deficient documentation detection a methodology to locate deficient project documentation using topic analysis (Abstract)

Joshua Charles Campbell , Department of Computing Science
Chenlei Zhang , Department of Computing Science
Zhen Xu , Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada
Abram Hindle , Department of Computing Science
James Miller , Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada
pp. 57-60

Detecting API usage obstacles: A study of iOS and Android developer questions (Abstract)

Wei Wang , David R. Cheriton School of Computer Science University of Waterloo, Waterloo, ON, Canada
Michael W. Godfrey , David R. Cheriton School of Computer Science University of Waterloo, Waterloo, ON, Canada
pp. 61-64

Encouraging user behaviour with achievements: An empirical study (Abstract)

Scott Grant , School of Computing, Queen's University, Kingston, Ontario, Canada
Buddy Betts , OUYA, Inc., Santa Monica, California, USA
pp. 65-68

Is programming knowledge related to age? An exploration of stack overflow (Abstract)

Patrick Morrison , Department of Computer Science, North Carolina State University, Raleigh, NC, USA
Emerson Murphy-Hill , Department of Computer Science, North Carolina State University, Raleigh, NC, USA
pp. 69-72

A discriminative model approach for suggesting tags automatically for Stack Overflow questions (Abstract)

Avigit K. Saha , University of Saskatchewan, Canada
Ripon K. Saha , The University of Texas at Austin, USA
Kevin A. Schneider , University of Saskatchewan, Canada
pp. 73-76

Exploring activeness of users in QA forums (Abstract)

Vibha Singhal Sinha , IBM Research - New Delhi, India
Senthil Mani , IBM Research - New Delhi, India
Monika Gupta , IBM Research - New Delhi, India
pp. 77-80

A study of innovation diffusion through link sharing on stack overflow (Abstract)

Carlos Gomez , University of Victoria Victoria, BC, Canada
Brendan Cleary , University of Victoria Victoria, BC, Canada
Leif Singer , University of Victoria Victoria, BC, Canada
pp. 81-84

Making sense of online code snippets (Abstract)

Siddharth Subramanian , School of Computer Science, University of Waterloo, Waterloo, ON, Canada
Reid Holmes , School of Computer Science, University of Waterloo, Waterloo, ON, Canada
pp. 85-88

Building reputation in StackOverflow: An empirical investigation (Abstract)

Amiangshu Bosu , Department of Computer Science, The University of Alabama Tuscaloosa, AL 35487-0290, USA
Christopher S. Corley , Department of Computer Science, The University of Alabama Tuscaloosa, AL 35487-0290, USA
Dustin Heaton , Department of Computer Science, The University of Alabama Tuscaloosa, AL 35487-0290, USA
Debarshi Chatterji , Department of Computer Science, The University of Alabama Tuscaloosa, AL 35487-0290, USA
Jeffrey C. Carver , Department of Computer Science, The University of Alabama Tuscaloosa, AL 35487-0290, USA
Nicholas A. Kraft , Department of Computer Science, The University of Alabama Tuscaloosa, AL 35487-0290, USA
pp. 89-92

An exploratory analysis of mobile development issues using stack overflow (Abstract)

Mario Linares-Vasquez , Computer Science Department, The College of William and Mary, Williamsburg, VA, USA
Bogdan Dit , Computer Science Department, The College of William and Mary, Williamsburg, VA, USA
Denys Poshyvanyk , Computer Science Department, The College of William and Mary, Williamsburg, VA, USA
pp. 93-96

Answering questions about unanswered questions of Stack Overflow (Abstract)

Muhammad Asaduzzaman , Department of Computer Science, University of Saskatchewan, Canada
Ahmed Shah Mashiyat , Department of Computer Science, University of Toronto, Canada
Chanchal K. Roy , Department of Computer Science, University of Saskatchewan, Canada
Kevin A. Schneider , Department of Computer Science, University of Saskatchewan, Canada
pp. 97-100

Will my patch make it? And how fast? Case study on the Linux kernel (Abstract)

Yujuan Jiang , MCIS, Polytechnique Montréal, Canada
Bram Adams , MCIS, Polytechnique Montréal, Canada
Daniel M. German , University of Victoria, Canada
pp. 101-110

Linux variability anomalies: What causes them and how do they get fixed? (Abstract)

Sarah Nadi , David R. Cheriton School of Computer Science, University of Waterloo, Canada
Christian Dietrich , Friedrich-Alexander University Erlangen-Nuremberg, Germany
Reinhard Tartler , Friedrich-Alexander University Erlangen-Nuremberg, Germany
Richard C. Holt , David R. Cheriton School of Computer Science, University of Waterloo, Canada
Daniel Lohmann , Friedrich-Alexander University Erlangen-Nuremberg, Germany
pp. 111-120

The impact of tangled code changes (Abstract)

Kim Herzig , Microsoft Research, Cambridge, UK
Andreas Zeller , Saarland University, Saarbrücken, Germany
pp. 121-130

A dataset from change history to support evaluation of software maintenance tasks (Abstract)

Bogdan Dit , Computer Science Department, The College of William and Mary, Williamsburg, VA, USA
Andrew Holtzhauer , Computer Science Department, The College of William and Mary, Williamsburg, VA, USA
Denys Poshyvanyk , Computer Science Department, The College of William and Mary, Williamsburg, VA, USA
Huzefa Kagdi , Department of Electrical Engineering and Computer Science, Wichita State University, Wichita, KS 67260-0083
pp. 131-134

Apache commits: Social network dataset (Abstract)

Alexander C. MacLean , Dept. of Computer Science, Brigham Young University, USA, 2204 TMCB, Provo, UT 84602
Charles D. Knutson , Dept. of Computer Science, Brigham Young University, USA, 2204 TMCB, Provo, UT 84602
pp. 135-138

Understanding the evolution of Type-3 clones: An exploratory study (Abstract)

Ripon K. Saha , The University of Texas at Austin, USA
Chanchal K. Roy , University of Saskatchewan, Canada
Kevin A. Schneider , University of Saskatchewan, Canada
Dewayne E. Perry , The University of Texas at Austin, USA
pp. 139-148

An empirical study of the fault-proneness of clone mutation and clone migration (Abstract)

Shuai Xie , Department of Electrical and Computer Engineering, Queen's University, Canada
Foutse Khomh , SWAT, Polytechnique Montréal, QC, Canada
Ying Zou , Department of Electrical and Computer Engineering, Queen's University, Canada
pp. 149-158

Intensive metrics for the study of the evolution of open source projects: Case studies from Apache Software Foundation projects (Abstract)

Santiago Gala-Perez , Apache Software Foundation, Spain
Gregorio Robles , GSyC/LibreSoft, Universidad Rey Juan Carlos, Spain
Jesus M. Gonzalez-Barahona , GSyC/LibreSoft, Universidad Rey Juan Carlos, Spain
Israel Herraiz , Universidad Politécnica de Madrid, Spain
pp. 159-168

A preliminary investigation of using age and distance measures in the detection of evolutionary couplings (Abstract)

Abdulkareem Alali , Department of Computer Science, Kent State University Kent, Ohio 44242, USA
Brian Bartman , Department of Computer Science, Kent State University Kent, Ohio 44242, USA
Christian D. Newman , Department of Computer Science, Kent State University Kent, Ohio 44242, USA
Jonathan I. Maletic , Department of Computer Science, Kent State University Kent, Ohio 44242, USA
pp. 169-172

Search-based duplicate defect detection: An industrial experience (Abstract)

Mehdi Amoui , University of Waterloo Waterloo, Canada
Nilam Kaushik , University of Waterloo Waterloo, Canada
Abraham Al-Dabbagh , University of Waterloo Waterloo, Canada
Ladan Tahvildari , University of Waterloo Waterloo, Canada
Shimin Li , BlackBerry Inc., Waterloo, Canada
Weining Liu , BlackBerry Inc., Waterloo, Canada
pp. 173-182

A contextual approach towards more accurate duplicate bug report detection (Abstract)

Anahita Alipour , Department of Computing Science, University of Alberta, Edmonton, Canada
Abram Hindle , Department of Computing Science, University of Alberta, Edmonton, Canada
Eleni Stroulia , Department of Computing Science, University of Alberta, Edmonton, Canada
pp. 183-192

Bug resolution catalysts: Identifying essential non-committers from bug repositories (Abstract)

Senthil Mani , IBM Research - New Delhi, India
Seema Nagar , IBM Research - New Delhi, India
Debdoot Mukherjee , IBM Research - New Delhi, India
Ramasuri Narayanam , IBM Research - New Delhi, India
Vibha Singhal Sinha , IBM Research - New Delhi, India
Amit A. Nanavati , IBM Research - New Delhi, India
pp. 193-202

The Eclipse and Mozilla defect tracking dataset: A genuine dataset for mining bug information (Abstract)

Ahmed Lamkanfi , University of Antwerp, Belgium
Javier Perez , University of Antwerp, Belgium
Serge Demeyer , University of Antwerp, Belgium
pp. 203-206

Mining source code repositories at massive scale using language modeling (Abstract)

Miltiadis Allamanis , School of Informatics, University of Edinburgh, Edinburgh EH8 9AB, UK
Charles Sutton , School of Informatics, University of Edinburgh, Edinburgh EH8 9AB, UK
pp. 207-216

Do software categories impact coupling metrics? (Abstract)

Lucas Batista Leite de Souza , Computer Science Department, Federal University of Uberlândia, Uberlândia, Brazil
Marcelo de Almeida Maia , Computer Science Department, Federal University of Uberlândia, Uberlândia, Brazil
pp. 217-220

The Maven repository dataset of metrics, changes, and dependencies (Abstract)

Steven Raemaekers , Software Improvement Group, Amsterdam, The Netherlands
Arie van Deursen , Delft University of Technology, Delft, The Netherlands
Joost Visser , Software Improvement Group, Amsterdam, The Netherlands
pp. 221-224

A historical dataset for the Gnome ecosystem (Abstract)

Mathieu Goeminne , Software Engineering Lab, COMPLEXYS research institute, UMONS, Belgium
Maelick Claes , Software Engineering Lab, COMPLEXYS research institute, UMONS, Belgium
Tom Mens , Software Engineering Lab, COMPLEXYS research institute, UMONS, Belgium
pp. 225-228

A network of Rails a graph dataset of Ruby on Rails and associated projects (Abstract)

Patrick Wagstrom , IBM TJ Watson Research Center, Yorktown Heights, NY, USA
Corey Jergensen , Computer Science and Engineering University of Nebraska - Lincoln, Lincoln, NE, USA
Anita Sarma , Computer Science and Engineering University of Nebraska - Lincoln, Lincoln, NE, USA
pp. 229-232

The GHTorent dataset and tool suite (Abstract)

Georgios Gousios , Software Engineering Research Group, Delft University of Technology, Delft, The Netherlands
pp. 233-236

Discovering, reporting, and fixing performance bugs (Abstract)

Adrian Nistor , University of Illinois at Urbana-Champaign
Tian Jiang , University of Waterloo
Lin Tan , University of Waterloo
pp. 237-246

Improving bug localization using correlations in crash reports (Abstract)

Shaohua Wang , School of Computing, Queen's University Kingston, ON, Canada
Foutse Khomh , SWAT Lab, DGIGL, Polytechnique Montréal, Montréal, QC, Canada
Ying Zou , Electronical and Computer Engineering, Queen's University, Kingston, ON, Canada
pp. 247-256

Testing principles, current practices, and effects of change localization (Abstract)

Steven Raemaekers , Software Improvement Group, Amsterdam, The Netherlands
Gabriela F. Nane , Delft Institute of Applied Mathematics, Delft, The Netherlands
Arie van Deursen , Delft University of Technology, Delft, The Netherlands
Joost Visser , Software Improvement Group, Amsterdam, The Netherlands
pp. 257-266

Fixing the ‘Out of sight out of mind’ problem one year of mood-based microblogging in a distributed software team (Abstract)

Kevin Dullemond , Delft University of Technology, IHomer, The Netherlands
Ben van Gameren , Delft University of Technology, IHomer, The Netherlands
Margaret-Anne Storey , Univerity of Victoria, Canada
Arie van Deursen , Delft University of Technology, The Netherlands
pp. 267-276

Communication in open source software development mailing lists (Abstract)

Anja Guzzi , Department of Software and Computer Technology - Delft University of Technology, The Netherlands
Alberto Bacchelli , REVEAL @ Faculty of Informatics - University of Lugano, Switzerland
Michele Lanza , REVEAL @ Faculty of Informatics - University of Lugano, Switzerland
Martin Pinzger , Institute for Informatics Systems - University of Klagenfurt, Austria
Arie van Deursen , Department of Software and Computer Technology - Delft University of Technology, The Netherlands
pp. 277-286

Tag recommendation in software information sites (Abstract)

Xin Xia , College of Computer Science and Technology, Zhejiang University
David Lo , School of Information Systems, Singapore Management University
Xinyu Wang , College of Computer Science and Technology, Zhejiang University
Bo Zhou , College of Computer Science and Technology, Zhejiang University
pp. 287-296

Using developer interaction data to compare expertise metrics (Abstract)

Romain Robbes , PLEIAD Laboratory Computer Science Department University of Chile
David Rothlisberger , Computer Science Department Universidad Técnica Federico, Santa Maria, Chile
pp. 297-300

Project roles in the Apache Software Foundation: A dataset (Abstract)

Megan Squire , Dept. of Computing Sciences, Elon University, Elon, NC, USA
pp. 301-304

Apache-affiliated Twitter screen names: A dataset (Abstract)

Megan Squire , Dept. of Computing Sciences, Elon University, Elon, NC, USA
pp. 305-308

Assisting code search with automatic Query Reformulation for bug localization (Abstract)

Bunyamin Sisman , Purdue University, West Lafayette, IN
Avinash C. Kak , Purdue University, West Lafayette, IN
pp. 309-318

Mining succinct and high-coverage API usage patterns from source code (Abstract)

Jue Wang , Tsinghua Univ., China
Yingnong Dang , Microsoft Research Asia, China
Hongyu Zhang , Tsinghua Univ., China
Kai Chen , Peking Univ. China
Tao Xie , NC State University, NC, USA
Dongmei Zhang , Microsoft Research Asia, China
pp. 319-328

Rendezvous: A search engine for binary code (Abstract)

Wei Ming Khoo , University of Cambridge, UK
Alan Mycroft , University of Cambridge, UK
Ross Anderson , University of Cambridge, UK
pp. 329-338

An unabridged source code dataset for research in software reuse (Abstract)

Werner Janjic , Software-Engineering Group, University of Mannheim, Germany
Oliver Hummel , Institute for Program Structures and Data Organization, Karlsruhe Institute of Technology, Germany
Marcus Schumacher , Software-Engineering Group, University of Mannheim, Germany
Colin Atkinson , Software-Engineering Group, University of Mannheim, Germany
pp. 339-342

The MSR Cookbook: Mining a decade of research (Abstract)

Hadi Hemmati , Software Architecture Group, David R. Cheriton School of Computer Science, University of Waterloo, Canada
Sarah Nadi , Software Architecture Group, David R. Cheriton School of Computer Science, University of Waterloo, Canada
Olga Baysal , Software Architecture Group, David R. Cheriton School of Computer Science, University of Waterloo, Canada
Oleksii Kononenko , Software Architecture Group, David R. Cheriton School of Computer Science, University of Waterloo, Canada
Wei Wang , Software Architecture Group, David R. Cheriton School of Computer Science, University of Waterloo, Canada
Reid Holmes , Software Architecture Group, David R. Cheriton School of Computer Science, University of Waterloo, Canada
Michael W. Godfrey , Software Architecture Group, David R. Cheriton School of Computer Science, University of Waterloo, Canada
pp. 343-352

Happy Birthday! A trend analysis on past MSR papers (Abstract)

Serge Demeyer , University of Antwerp Antwerp, Belgium
Alessandro Murgia , University of Antwerp Antwerp, Belgium
Kevin Wyckmans , University of Antwerp Antwerp, Belgium
Ahmed Lamkanfi , University of Antwerp Antwerp, Belgium
pp. 353-362

Replicating mining studies with SOFAS (Abstract)

Giacomo Ghezzi , s.e.a.l. - software evolution and architecture lab, Department of Informatics, University of Zurich, Switzerland
Harald C. Gall , s.e.a.l. - software evolution and architecture lab, Department of Informatics, University of Zurich, Switzerland
pp. 363-372

A historical dataset of software engineering conferences (Abstract)

Bogdan Vasilescu , Model-Driven Software Engineering, Department of Mathematics and Computer Science, Eindhoven University of Technology, The Netherlands
Alexander Serebrenik , Model-Driven Software Engineering, Department of Mathematics and Computer Science, Eindhoven University of Technology, The Netherlands
Tom Mens , Software Engineering Lab, COMPLEXYS Research Institute, University of Mons, Belgium
pp. 373-376

Automatically mining software-based, semantically-similar words from comment-code mappings (Abstract)

Matthew J. Howard , Department of Computer and Information Sciences, University of Delaware, Newark, DE 19716 USA
Samir Gupta , Department of Computer and Information Sciences, University of Delaware, Newark, DE 19716 USA
Lori Pollock , Department of Computer and Information Sciences, University of Delaware, Newark, DE 19716 USA
K. Vijay-Shanker , Department of Computer and Information Sciences, University of Delaware, Newark, DE 19716 USA
pp. 377-386

Strategies for avoiding text fixture smells during software evolution (Abstract)

Michaela Greiler , Delft University of Technology, The Netherlands
Andy Zaidman , Delft University of Technology, The Netherlands
Arie van Deursen , Delft University of Technology, The Netherlands
Margaret-Anne Storey , University of Victoria, Canada
pp. 387-396

Contextual analysis of program logs for understanding system behaviors (Abstract)

Qiang Fu , Microsoft Research Asia, Beijing, China
Jian-Guang Lou , Microsoft Research Asia, Beijing, China
Qingwei Lin , Microsoft Research Asia, Beijing, China
Rui Ding , Microsoft Research Asia, Beijing, China
Dongmei Zhang , Microsoft Research Asia, Beijing, China
Tao Xie , Department of Computer Science North Carolina State University, NC, USA
pp. 397-400

A dataset for evaluating identifier splitters (Abstract)

Dave Binkley , Loyola University Maryland, Baltimore, MD 21210
Dawn Lawrie , Loyola University Maryland, Baltimore, MD 21210
Lori Pollock , University of Delaware, Newark, DE 19716
Emily Hill , Montclair State University, Montclair, NJ, 07043
K. Vijay-Shanker , University of Delaware, Newark, DE 19716
pp. 401-404

INVocD: Identifier name vocabulary dataset (Abstract)

Simon Butler , Centre for Research in Computing, Department of Computing, The Open University, Milton Keynes, United Kingdom
Michel Wermelinger , Centre for Research in Computing, Department of Computing, The Open University, Milton Keynes, United Kingdom
Yijun Yu , Centre for Research in Computing, Department of Computing, The Open University, Milton Keynes, United Kingdom
Helen Sharp , Centre for Research in Computing, Department of Computing, The Open University, Milton Keynes, United Kingdom
pp. 405-408

Better cross company defect prediction (Abstract)

Fayola Peters , Lane Department of CS & EE, West Virginia University, USA
Tim Menzies , Lane Department of CS & EE, West Virginia University, USA
Andrian Marcus , Computer Science, Wayne State University, USA
pp. 409-418

Using citation influence to predict software defects (Abstract)

Wei Hu , Department of Computing Science, University of Alberta, Edmonton, Canada
Kenny Wong , Department of Computing Science, University of Alberta, Edmonton, Canada
pp. 419-428

Revisiting software development effort estimation based on early phase development activities (Abstract)

Masateru Tsunoda , Toyo University Saitama, Japan
Yasutaka Kamei , Kyushu University Fukuoka, Japan
Koji Toda , Fukuoka Institute of Technology Fukuoka, Japan
Meiyappan Nagappan , Queen's University Ontario, Canada
Kyohei Fushida , NTT DATA Corporation Tokyo, Japan
Naoyasu Ubayashi , Kyushu University Fukuoka, Japan
pp. 429-438
99 ms
(Ver )