Big Data is a term used to identify data that has the following characteristics (properties, attributes, features): [1] High Volume [2] High Velocity [3] High Variety . Why are these characteristics significant enough to define a genre of database management challenges and associated technology solutions? Database Management Applications have historically been designed to serve the needs of what Geoffrey Moore refers to as "Systems of Record" where the combined effects of Data Volume, Velocity, and Variety are limited. Emergence of the Web, REST Architecture, and Web Applications has created a new "Systems of Engagement" application genre (also coined by Geoffrey Moore). This application genre is characterized by Database Documents taking any combination of the following forms: [1] CSV files [2] Optimized Record Columnar (ORC) files [3] Parquet Files [4] HTTP Logs [5] OS System Logs [6] JSON (e.g., Avro Files) [7] Other structured document forms populated with engagement activity . As you can see, the database document types outlined above are different from the database documents associated with your typical RDBMS, for instance. Solution frameworks like Hadoop and Spark provide new infrastructure (Distributed Storage, Relational Abstraction Mapping & Transformation) for handling these new Data Management needs.
Attributes | Values |
---|---|
type | |
label |
|
comment |
|
skos:broader | |
skos:narrower | |
sameAs | |
skos:inScheme | |
Hashtag |
|
is skos:broader of | |
is skos:narrower of | |
is topic of | |
is References of | |
is Is Referenced By of | |
is xhv:related of |