Mining Software Repositories (MSR)

Mining software repositories (MSR) is a software engineering field where software practitioners and researchers use data mining techniques to analyze the data in software repositories to extract useful and actionable information produced by developers during the development process.

Using the Extracted Data

When mining software repositories, the extracted data can be used to discover hidden patterns and trends, support development activities, maintain existing systems or to improve decision-making around future software development and evolution. Typically the data is used to better manage software and to produce higher-quality software systems by analyzing past software development projects.

Types of software repositories include source control repositories, bug repositories, code repositories and even archived developer communications including mailing lists and wiki pages.

MSR Tools

MSR requires tools that extract data and structured code facts from projects. One example of a mining software repositories tool is SeCold, a platform that supports data extraction and on-the-fly inter-dataset integration from major version control, issue tracking, and quality evaluation systems. The dataset contains about two billion facts, such as source code statements, software licenses, and code clones from 18,000 software projects.

Vangie Beal
Vangie Beal
Vangie Beal is a freelance business and technology writer covering Internet technologies and online business since the late '90s.

Related Articles

@ Sign

Pronounced at sign or simply as at, this symbol is used in e-mail addressing to separate the user' name from the user's domain name,...

Munging

(MUHN-jing) Munging (address munging), is the act of altering an email address posted on a Web page to make it unreadable to bots and...

How to Create an RSS Feed

In the second installment of RSS how-to, we look at some of the nonrequired (optional) channel and item tags, discuss RSS specifications in-depth and...

Dictionary Attack

(n.) (1) A method used to break security systems, specifically password-based security systems, in which the attacker systematically tests all possible passwords beginning with...

ScalaHosting

ScalaHosting is a leading managed hosting provider that offers secure, scalable, and affordable...

HRIS

Human resources information system (HRIS) solutions help businesses manage multiple facets of their...

Best Managed Service Providers...

In today's business world, managed services are more critical than ever. They can...