Category: Databases

Apache Hadoop

Apache Hadoop is an open-source software library that supports the distributed processing of large data sets across computer clusters. The framework is designed to scale up from single servers to multiple machines, each offering local computation and storage. At the same time, all modules of Hadoop are designed with the basic assumption that hardware failures are common and should be automatically handled by the framework.

The core of the Hadoop framework consists of a storage path known as HDFS (Hadoop Distributed File System), and a processing part called MapReduce. Hadoop splits files into large blocks and distributes them across nodes in a cluster. In order to process the data, the packaged code is transferred by Hadoop to nodes, so that the packages are processed in parallel according to the data that needs to be processed.

MongoDB

MongoDB is an open-source NoSQL database that can store, retrieve and manage document-oriented information. In order to adapt JSON documents, MongoDB is evading the relational database table-based structures.

MongoDB is created on an architecture of documents and collections, instead of using rows and tables as in a relational database. Also, MongoDB was written in C++, C and JavaScript by MongoDB Inc., published under dual license – GNU Affero General Public License and the Apache License.

Apache Cassandra

Apache Cassandra is a free and open-source distributed database used for managing large amounts of structured data across many commodity servers. Also, it provides services that are highly available and have no single point of failure.

Cassandra has a high-value performance and offers robust support (with asynchronous masterless replication) for clusters from multiple datacenters.

NoSQL

NoSQL is a non-relational, open-source, distributed and horizontally scalable database, which stores and accesses data using key values. Originally referred to as “non SQL” or “non-relational”, such databases have existed since the late 1960s, but have become more popular in the late twenty-first century when companies such as Facebook, Google and Amazon faced the need for Web 2.0.

Today, NoSQL is known as “Not only SQL” to point out the fact that it can support SQL-like queries. With the purpose of solving the scalability and big data performance issues, NoSQL supports a wide range of technologies and architectures.

MySQL

MySQL is the most popular open-source relational database management system. MySQL is an important part of LAMP (Linux, Apache, MySQL, PHP / Perl / Python), a widely used open-source technology stack for web development.

Microsoft SQL Server

Microsoft SQL Server is a relational database server that comes with a variety of tools and a front-end management application. It supports the Transact-SQL language for queries, but the .NET languages can be used as well.

Microsoft Access

Microsoft Access is a database management system (part of the Microsoft Office suite of applications) that uses a relational database engine (Microsoft Jet Database Engine). It provides a graphical user interface and software development tools. Also, it has the capability to import or link directly to the data stored in other applications and databases.

Microsoft Excel

Microsoft Excel is part of the Microsoft Office suite of applications, and it’s available for both the Microsoft Windows and Apple Mac OS platforms. Microsoft Excel manages spreadsheet files, and includes the following features: calculations, pivot tables, graphing tools and a macro programming language (Visual Basic for Applications).

Scroll to Top