CS 557 Writing Assignment #2
Naveen Chitturi
Introduction to the topic of NoSQL DBs
Relational database systems came into existence in the 1970’s, and revolutionized the way data is maintained in computers. Like every technology that is subjected to test of time, relational database systems are under criticism for not being scalable enough to meet growing users and exponentially increasing data management needs. Today, a new technology called NoSQL is being pioneered by leading companies like Google, Amazon, and Facebook etc. to meet the shortcomings of the 40-year-old relational database technology. Modern web applications are making a transition from traditional relational databases to NoSQL databases to meet the demand of steadily growing concurrent users and big data.
NoSQL today stands for ‘Not only SQL’ and encompasses a myriad of different database technologies that can cope up with scale and agility challenges faced by modern applications. It provides the capability to store unstructured data, which is not possible using RDBS. The major advantage of using a NoSQL DB is horizontally scalability, that is, cheap infrastructure can be newly added to improve performance instead of upgrading an already expensive system. There are four types of NoSQL databases, as follows
• Document store: These store complex data structures known as documents, by associating them with a unique key. Documents can contain many key-value pairs and can even nest other documents. E.g. MongoDB,
There are a lot of system requirements and assumptions made in this paper. The query model is assumed to have simple read and write operations to data nodes that are identified uniquely by a key. This assumption is made based on the fact that most of the amazon applications does not require a relational schema and can work with simple queries.
In order to overcome these limitations, a new database model known as Not Only SQL (NoSQL) database emerged with a set of new features. The main objective of NoSQL is not to discard SQL, but to be used as an alternative database data model for new features [1] [2] [3]. NoSQL database increases the performance of relational databases by a set of new characteristics and advantages. In contrast to relational databases, NoSQL databases introduced an additional feature that provides flexible and horizontal scalability and taking advantage of new clusters. The rise of NoSQL provides cost-effective management of data in modern web applications. With its new features, NoSQL can be used with applications that have a large transaction, and require low-latency access to huge datasets, service availability while
Provide reasoning to support the use of the NoSQL database as the database of choice to solve the problem faced by TWC. Identify one strength and one weakness for each of the other three kinds of databases to solve the problem for TWC.
STRUCTURE OF DATA: The data structure of a relational database comprises of table structure. Every table is identified by a unique name or label. The data tables are described as the collection of rows and columns. Each row of the table is known as the record and each column is known as the field of the specific data table. All the data sets are well organized and logical linked to each other through definite and unique relationships. A table, therefore can also be defined as the “structured collection of relationships”. The fundamental aim of developing No SQL database systems is to easily and effectively handle vast quantity of data or information in advanced web-scale applications. In order to achieve this purpose, the No SQL systems are designed as the schema-free database systems. There are different modes to define the No SQL databases that typically depend on the requirements of the data that has to be managed. The main No SQL data structures include column database, key-value store database, document store database, graph database and
NoSQL databases are a significant departure from the relational model that has dominated the business world for the past few decades. Standing for “Not Only SQL,” these products are all some variation of a non-relational, key-value pair database, and they are becoming very popular with companies that use Big Data and prioritize speed or availability over consistency of data.
NoSQL databases are databases designed to run on clusters of computers/servers, built for the ever-increasing data storage needs for websites. Devised as a way of scaling databases horizontally which is a challenge with traditional relational databases. Scaling horizontally is the ability to add more computers/servers as nodes to a database. These “clusters” work well with write-heavy systems and allow increase storage and processing power limited only by the number of connections you can have on the network. Defined as No-Schema, No-SQL data structures mean they are not limited to the original data structure. Objects and fields etc can be implemented at
NoSQL databases had made for unraveling the Big Data issue by utilizing a distributed system to bring out excellent performance in data storage and retrieval at very large-scale. At this scale, pieces of the system often fail and NoSQL is created to handle these failures (Chow, 2013) (Ron, Shulman-Peleg, & Bronshtein, 2015). Various companies have espouse different sorts of non-relational databases, ordinarily alluded to as
The paper provides background and related literature on the Big Data, studies the concept from Relational Database to current NoSQL database which have been fueled by the growth Big Data and importance of managing it. And surveys the Big Data challenges from the perspective of its characteristics Volume, Variety and Velocity and attempts to study how those challenges can be addressed.
The demands on database technology have been ever expanding since its introduction in the 1960’s. Today traffic on the internet requires that millions upon millions of records be stored and queried each second. Data must be highly available and quickly retrievable. These requirements put together have given rise to new forms of database technologies collectively called “NoSQL” or “Not Only SQL”. NoSQL eschews the strict guidelines that govern the creation and function of traditional relational databases. These guidelines are put aside in order to rise to the new demands of an increasingly interconnected world. The rigorous standards and data definitions of relational databases give way in order to provide the ability to rapidly
The modern RDBMS advancements are not capable of supporting unstructured information with ideal space necessity. The plan winds up plainly mind-boggling and is henceforth troublesome for designers. The requirement for unstructured information administration is so annoying with conventional RDBMS arrangements (Big data in financial services industry: Market trends, challenges, and prospects 2013 - 2018). Moreover, RDBMS turns out to be an exorbitant answer for creating light-footed web applications with direct information investigation necessities. NoSQL is developing as a proficient possibility in this situation, which connects the issues related with RDBMS innovation. The market development can credit to creative dispatches of NoSQL arrangements, and collective endeavors by NoSQL sellers and clients. The endeavors of organizations, to enhance their market offerings, are creating the request of NoSQL, as a back-end bolster (Big data in financial services industry: Market trends, challenges, and prospects 2013 - 2018). The emergence of agile software development is creating the demand for NoSQL (Big data in financial services industry: Market trends, challenges, and prospects 2013 - 2018). They offer users much more avenues to accept data in many different forms. NoSQL is adaptable as SQL but offers many more uses that can apply to many organizations.
With its dispatch in 2015 the AWB uses devices like Kinesis, which is for continuous spouting, S3 which is for basic accumulating stage and DynamoDB – it is immense limit with respect to NoSQL database.
NoSQL Databases are being used in the social media applications and big data processing based portals in which huge, heterogeneous and unstructured data formats are handled. NoSQL Databases are used for faster access of records from the big dataset at back-end. The AADHAAR Card implementation in India was done using NoSQL Databases as huge amount of information is associated including Text Data, Images, Thumb Impressions and Iris Detection. Any classical database system cannot handle the dataset of different types (Image, Text, Video, Audio, Video, Thumb Impressions for Pattern Recognition, Iris Sample) simultaneously.
As there is a rise in data volumes, the manageability of data and storing these huge volumes of data became a cause of concern to most of the organizations. It was during this period when Number of SQL or more popularly NoSQL was introduced, to process these large amounts of data efficiently and effectively. For this purpose, various Data Store categories were developed, based on the different data models. Some of the categories are:
NoSQL Databases also referred as Not only SQL databases. These NoSQL database have these days gained much attention and reputation because of their performance and high scalability. The advantage of NoSQL database is to store efficiently unstructured data. These days use of e-commerce websites, social networking sites etc. has been increased. These usage made to create the need to store the large data. Some companies have adopted NoSQL databases, as their data is growing. Dynamo, Big table, Voldemort, Cassandra are the NoSQL databases that are used by Amazon, Google, and LinkedIn and Facebook respectively. Facing these huge data has become challenging for Relational Database Management Systems. Hence NoSQL database came into existence. Mostly Relational Database Management System satisfies ACID properties, through NoSQL database we can achieve high level of Scalability and performance. As a lot of sensitive data is stored in NoSQL databases security issues becomes growing concerns.
Abstract- The multidimensional growth in computing systems and technologies have resulted in advanced scalable, portable and large scale integrated systems and technologies. Datacenters, virtualization, cloud and WEB2 technologies are the frontiers of such growth. [1] Cloud computing represents an important step towards realizing McCarthy’s dream that all aspects of computation may someday be organized as a public utility service. Both public and private cloud platforms are looking to deliver the benefits of cloud computing to their customers. The database is a critical part of this platform. Therefore cloud database need to be compatible with cloud computing. [5] Though cloud computing offers huge opportunities to the IT industry, there are many issues still to be addressed in the current scenario.[6]