Data Definition & Meaning

Data refers to distinct pieces of information, usually formatted and stored in a way that is concordant with a specific purpose. Data can exist in various forms: as numbers or text recorded on paper, as bits or bytes stored in electronic memory, or as facts living in a person’s mind. Since the advent of computer science in the mid-1900s, however, data most commonly refers to information that is transmitted or stored electronically.

Grammatically, data is the plural form of the singular datum, but in practice data is widely used as a mass noun, like sand or water. For example, one might say the data proves something to be true in this case, “data” is referring to many pieces of information that are being used collectively to validate a claim. Not all writers accept the popular mass noun usage, however. Some academic and technical editors are adamant about the Latin plural and singular distinction (“the set of data prove” and “one datum proves”).

Machine-readable vs. human-readable data

All data can be categorized as machine-readable, human-readable, or both. Human-readable data utilizes natural language formats (such as a text file containing ASCII codes or PDF document), whereas machine-readable data uses formally structured computer languages (Parquet, Avro, etc.) to be read by computer systems or software. Some data is readable by both machines and humans, as in the case of CSV, HTML, or JSON.

The line between machine- and human-readable data is becoming increasingly blurred because so many formats that are prevalent today are accessible enough to be navigated by a human yet structured enough to be processed by a machine. This is largely the result of artificial intelligence, machine learning, and automation, which streamlines tasks and workflows so manual data entry and analysis is done by a machine rather than a human. However, these processes need to maintain their human readability in case the programming needs to be adjusted. Most data in these cases also exists in a vacuum and does not have much meaning without context from a human perspective.

Data phrases in technology

Data has become the forefront of many mainstream conversations about technology. New innovations constantly draw commentary on data, how we use and analyze it, and broader implications for those effects. As a result, the popular IT vernacular has come to include a number of phrases new and old:

  • Big data: A massive volume of structured and unstructured data that is too large to process using traditional database and software technologies.

  • Big data analytics: The process of collecting, organizing, and synthesizing large sets of data to discover patterns or other useful information.

  • Data center: Physical or virtual infrastructure used by enterprises to house computer, storage, and networking systems and components for the company’s IT needs.

  • Data integrity: The validity of data, which can be compromised in a number of ways including human error or transfer errors.

  • Data miner: A software application that monitors and/or analyzes the activities of a computer, and subsequently its user, to collect information.

  • Data mining: A class of database applications that look for hidden patterns in a group of data that can be used to predict/anticipate future behavior.

  • Data warehouse: A data management system that uses data from multiple sources to promote business intelligence.

  • Database: A collection of data points organized in a way that is easily maneuvered by a computer system.

  • Metadata: Summary information about a data set.

  • Raw data: Information that has been collected but not formatted or analyzed.

  • Structured data: Any data that resides in a fixed field within a record or file, including data contained in relational databases and spreadsheets.

  • Unstructured data: Information that does not reside in a traditional column-row database like structured data.

Kaiti Norton
Kaiti Norton is a Nashville-based Content Writer for TechnologyAdvice, a full-service B2B media company. She is passionate about helping brands build genuine connections with their customers through relatable, research-based content. When she's not writing about technology, she's sharing her musings about fashion, cats, books, and skincare on her blog.

Top Articles

Huge List Of Texting and Online Chat Abbreviations

From A3 to ZZZ we list 1,559 text message and online chat abbreviations to help you translate and understand today's texting lingo. Includes Top...

How To Create A Desktop Shortcut To A Website

This Webopedia guide will show you how to create a desktop shortcut to a website using Firefox, Chrome or Internet Explorer (IE). Creating a desktop...

The History Of Windows Operating Systems

Microsoft Windows is a family of operating systems. We look at the history of Microsoft's Windows operating systems (Windows OS) from 1985 to present...

Hotmail [Outlook] Email Accounts

  By Vangie Beal Hotmail is one of the first public webmail services that can be accessed from any web browser. Prior to Hotmail and its...

Supply Chain Definition &...

A supply chain is a network between an organization and its suppliers to...

Relational Database Definition &...

A relational database stores and connects data in tables and columns, emphasizing the...

Common Business-Oriented Language (COBOL)...

What is COBOL? COBOL stands for Common Business-Oriented Language. It is a 60-year-old programming...