Mining Software Repositories (MSR)

Mining software repositories (MSR) is a software engineering field where software practitioners and researchers use data mining techniques to analyze the data in software repositories to extract useful and actionable information produced by developers during the development process.

Using the Extracted Data

When mining software repositories, the extracted data can be used to discover hidden patterns and trends, support development activities, maintain existing systems or to improve decision-making around future software development and evolution. Typically the data is used to better manage software and to produce higher-quality software systems by analyzing past software development projects.

Types of software repositories include source control repositories, bug repositories, code repositories and even archived developer communications including mailing lists and wiki pages.

MSR Tools

MSR requires tools that extract data and structured code facts from projects. One example of a mining software repositories tool is SeCold, a platform that supports data extraction and on-the-fly inter-dataset integration from major version control, issue tracking, and quality evaluation systems. The dataset contains about two billion facts, such as source code statements, software licenses, and code clones from 18,000 software projects.

Previous articleiPhone X
Next articleCitizen Developer
Vangie Beal
Vangie Beal
Vangie Beal is a freelance business and technology writer covering Internet technologies and online business since the late '90s.

Top Articles

Huge List Of Texting and Online Chat Abbreviations

From A3 to ZZZ we list 1,559 text message and online chat abbreviations to help you translate and understand today's texting lingo. Includes Top...

How To Create A Desktop Shortcut To A Website

This Webopedia guide will show you how to create a desktop shortcut to a website using Firefox, Chrome or Internet Explorer (IE). Creating a desktop...

The History Of Windows Operating Systems

Microsoft Windows is a family of operating systems. We look at the history of Microsoft's Windows operating systems (Windows OS) from 1985 to present...

Hotmail [Outlook] Email Accounts

  By Vangie Beal Hotmail is one of the first public webmail services that can be accessed from any web browser. Prior to Hotmail and its...

Supply Chain Definition &...

A supply chain is a network between an organization and its suppliers to...

Relational Database Definition &...

A relational database stores and connects data in tables and columns, emphasizing the...

Common Business-Oriented Language (COBOL)...

What is COBOL? COBOL stands for Common Business-Oriented Language. It is a 60-year-old programming...