SURVEY ON KEYWORD SEARCH IN RELATIONAL DATABASES
Chavan Aparna R1, Bangar S2
1 M. Tech Scholar, Department of Computer Science & Engineering, Maharashtra Institute of Technology (MIT), Aurangabad, Maharashtra, India
2 Assistant Professor, Department of Computer Science & Engineering, Maharashtra Institute of Technology (MIT), Aurangabad, Maharashtra, India
Abstract
Keyword search is the most effective information discovery method in documents. The large volume of data is stored in databases. Plain text coexists with structured data, unstructured data for this type of data efficient processing of top-k queries is a crucial requirement. This paper describes fundamental characteristics including relational database, top-k queries, steiner trees. Recently, Tuple units are used to improve the keyword search by joining the multiple related tuple units and indexes are used for structural relationships. In this paper various existing techniques for developing search system are compared. This survey also describes the Ranking. Ranking queries are dominant in many emerging applications for finding top-k answers. The research strategy used to resolve is top-k query processing.
Key Words: Keyword Search, Top-k Query Processing, Relational Databases, Tuple Units.
--------------------------------------------------------------------***----------------------------------------------------------------------
1. Introduction
Data mining is the process that attempts to discover patterns
Chapter 1 of James MacCormick’s book, Nine Algorithms That Changed The Future, provides an overview of what an algorithm is and introduces the reader of the algorithms that will be discussed throughout the book. An algorithm is a precise recipe that specifies the exact sequence of steps required to solve a problem (MacCormick, 3). Algorithm requires a sequence of steps that contain the instructions on what to do. One of the key elements of an algorithm is to have a set of rules in order to perform the mathematical calculations. Another feature of algorithms is that it always works. Computer science describes how to solve a problem using an algorithm. The main purpose of the book is to explain the algorithms one’s computer uses everyday
The excessive use of computers has drastically changed the lives of many users. As a multifaceted tool, the computer is used for tasks to include research, homework, business related
With low learning curve and massive collection of resources, our solutions are designed is such a flexible manner that a non-technical person will be able to use and master them with ease and perfection.
1. HD Introduction 2. Summary of Article 3. Issues 4. Recommendation & Implementation 5. Conclusion
There are many different areas in information systems to study. Data management, data mining, data warehousing, information management, information security, information assurance, healthcare informatics and bioinformatics are just a small sample of some of the different areas of study that will be examined in this paper. Also included in this paper are answers to questions posed by the rubric for this assignment.
Throughout extraction, the desired data is identified and extracted from the source system and is made available for additional processing. The data can be extracted from numerous different sources. In most cases, the data sources are internal however sometimes they are external. The ultimate aim is to retrieve all the essential data from the source system with as little resources as possible. The size of the data extracted can range from kilobytes to gigabytes.
In every moment of our lives, we handle a tremendous amount of large data, this large data will become a store of values if we could be turned it into searchable information involving analysis steps. The big challenge is that 90% of the large data are unstructured data which is growing faster than structured data, unstructured data as a data warehouse come without any predefined data structure and not appropriate with any relational database schema.
Ranking the query result is a key requirement for keyword search in order to rank and make appear the most relevant results first. XML keyword search queries are different from HTML keyword search queries in the way query results are ranked. Normally, documents are ranked by HTML search engines (such as Google) based (partly) on their hyperlinked structure (Brin and Page, 1998; Kleinberg, 1999). XML keyword search queries can return nested elements. Hence, ranking has to be computed at the granularity of XML elements, as opposed to entire XML documents. Since the semantics of containment links (relating parent and child elements) is very different from that of hyperlinks, computation of rankings at the granularity of elements is complicated. As a result, ranking techniques which are used for computation solely based on hyperlinks (Brin and Page, 1998; Kleinberg, 1999) cannot directly be applied for nested XML elements. Some of the works on result rankings for XML keyword query results include XRANK (Guo et al., 2003), XSEarch (Cohen et al., 2003), EASE (Li et al., 2008) and XReal (Bao et al., 2010a).
As an obvious fact, we have lots of data in various fields. Actually, it is estimated that the amount of useful data produced will be over 15 zettabytes by 2020, compared with 0.9 zettabytes in 2013. [IDC 's Study 1] This has led to an unavoidable challenge, however, data users have to figure out a way to properly store and effectively analyze the large-scale data.\
This proposal is submitted to the Computer and Information Science faculty in partial fulfillment for the degree
In additional, different libraries are utilized in this research. The online search mechanism is extensively used. Online search employs a number of key phrases and keywords in the search engines. Keywords and phrases used in the search process include; Pick-rounds in player selections, American football player recruitment, Draft position, NBA and NFL success, American football, American basketball, first round draft picks and late round draft picks.
Dept. of Computer Engineering, Bhivarabai Sawant Institute Of Technology and Reasearch, Wagholi, Pune, India 1
4 Assistant Professor , Bharti Vidyapeeth deemed university , College of engineering Pune , Maharashtra-411043 , India
3. Presents the software description. It explains the implementation of the project using PIC C Compiler software.
A PROJECT PROPOSAL SUBMITTED TO THE SCHOOL OF COMPUTER STUDIES IN PARTIAL FULLFILMENT OF THE REQUIREMENT FOR THE AWARD OF