The art and science of analyzing software data

Tim Menzies, Christian Bird, Thomas Zimmermann, Leandro L. Minku, Burak Turhan

Research output: Chapter in Book/Report/Conference proceedingConference PaperOtherpeer-review

Abstract

In the age of big data, data science is an essential skill that should be equipped by software engineers. It can be used to predict useful
information on new projects based on completed projects. This tutorial reflects on the state-of-the-art in this important field.
Before data mining, this tutorial discusses the tasks needed to deploy data mining algorithms to organizations including how to determine the information needs of particular managers.
During data mining, this tutorial discusses the following: (a) when studying particular organizations, how to use surveys and interviews to guide data analysis; (b) when local data is scarce, we show how to adapt data from other organizations to local problems; (c) when working with data of dubious quality, we show how to prune spurious information; (d) when data or models seem
too complex, we show how to simplify data mining results; (e) when the world changes, and old models need to be updated, we show how to handle those updates; (f) When the effect is too complex for one model, we show to reason over ensembles.
Target audience: Software practitioners and researchers wanting to understand the state of the art in using data mining for
software engineering (SE) data.
Pre-requisites: This tutorial makes minimal use of maths of advanced algorithms and would be understandable by developersand technical managers.
Original languageEnglish
Title of host publicationICSE’14 Tutorial Briefing (proposed length: half day) TUT: The Art and Science of Analyzing Software Data
Subtitle of host publicationMay 31 – June 7, 2014, Hyderabad, India
EditorsPankaj Jalote, Lionel Briand, André van der Hoek
Place of PublicationNew York NY USA
PublisherAssociation for Computing Machinery (ACM)
Number of pages4
ISBN (Electronic)9781450327565
Publication statusPublished - 2014
Externally publishedYes
EventInternational Conference on Software Engineering 2014 - Hyderabad, India
Duration: 31 May 20147 Jun 2014
Conference number: 36th
http://2014.icse-conferences.org/

Conference

ConferenceInternational Conference on Software Engineering 2014
Abbreviated titleICSE 2014
CountryIndia
CityHyderabad
Period31/05/147/06/14
Internet address

Cite this

Menzies, T., Bird, C., Zimmermann, T., Minku, L. L., & Turhan, B. (2014). The art and science of analyzing software data. In P. Jalote, L. Briand, & A. van der Hoek (Eds.), ICSE’14 Tutorial Briefing (proposed length: half day) TUT: The Art and Science of Analyzing Software Data: May 31 – June 7, 2014, Hyderabad, India Association for Computing Machinery (ACM). https://dl.acm.org/citation.cfm?id=2886235