ArangoDB Tutorial on Basic Concepts and Terminologies

in this chapter, we will discuss the basic concepts and terminologies for arangodb. it is very important to have a knowhow of the underlying basic terminologies related to the technical topic we are dealing with.

the terminologies for arangodb are listed below −

  • document
  • collection
  • collection identifier
  • collection name
  • database
  • database name
  • database organization

from the perspective of data model, arangodb may be considered a document-oriented database, as the notion of a document is the mathematical idea of the latter. document-oriented databases are one of the main categories of nosql databases.

the hierarchy goes like this: documents are grouped into collections, and collections exist inside databases

it should be obvious that identifier and name are two attributes for the collection and database.

usually, two documents (vertices) stored in document collections are linked by a document (edge) stored in an edge collection. this is arangodb's graph data model. it follows the mathematical concept of a directed, labeled graph, except that edges don't just have labels, but are full-blown documents.

having become familiar with the core terms for this database, we begin to understand arangodb's graph data model. in this model, there exist two types of collections: document collections and edge collections. edge collections store documents and also include two special attributes: first is the _from attribute, and the second is the _to attribute. these attributes are used to create edges (relations) between documents essential for graph database. document collections are also called vertex collections in the context of graphs (see any graph theory book).

let us now see how important databases are. they are important because collections exist inside databases. in one instance of arangodb, there can be one or many databases. different databases are usually used for multi-tenant setups, as the different sets of data inside them (collections, documents, etc.) are isolated from one another. the default database _system is special, because it cannot be removed. users are managed in this database, and their credentials are valid for all the databases of a server instance.