Study Report on
Information retrieval and evaluating its usefulness
Adarsh Murali Kashyap
800828747
Table of Contents
1. Introduction III
2. ETL process III
3. Creation of a warehouse using SQL statements VII
4. OLAP operations VII
5. Data Mining IX
5.1. Cluster Analysis IX
5.2. Association Rule Mining XII
5.3. Outcome of ETL, OLAP, Mining operations XII
6. Data Analytics and its usefulness for business XII
7. Usage of production logs to test and engineer an application’s performance: XIII
8. References: XIV
9. VB Code XV
1. Introduction
Information is very important to a business organization. Information helps in identifying opportunities, understanding the customers in
…show more content…
There are many ways of cleaning data using many tools that help in formatting, removal of unwanted parts of data. Here I will make an effort to demonstrate a method of extracting, cleaning of data files using Visual Basic and evaluating its usefulness. This report comprises certain topics that I have studied during the course of my masters program which includes few concepts of data warehousing, data management and data analytics. These topics cover different ways of data manipulation such as extraction, transformation techniques, loading of data using SQL queries (creation of tables, insertion of values and checking their normal forms), creation of a data warehouse, evaluating its usefulness by measuring several factors, applying data mining techniques to analyze data in a better way that will lead to improved understanding of business and importance of analytics on business data.
2. ETL process
ETL is a process of managing databases by performing the below mentioned steps:
Step 1: Extraction - Extract data from data sources.
Step 2: Transformation
Data cleaning: remove errors, inconsistencies and redundancies.
Data transformation: transform data into warehouse format.
Data reduction: remove useless data, shrink data without loss of information.
Step 3: Loading - Load transformed data into database/warehouse.
I will be considering “Movies.list” file from IMDB
Information Management has to do with capturing information, efficient planning, organizing and evaluating the information to interpret for an organization to make well informed decisions. (Hinton, 2006) The main reason organizations depend on information is to improve its overall management in
Information management is a conscious process that needs to be planned. Having regular updates of the business is information gathered and this assists in any decision making. This is to be used by the business and is most useful at the starting point of the decision making process. The information gathered should be used at all levels of the business not just at senior management positions.
Information can be relative to anything with regards to an organisation. When it comes to customers, it can be their address, telephone number or outstanding payments, when it comes to employees, it can be their appraisals, salaries, again their address and telephone numbers, and for the business, it can be the business’s finances, profits, employee and customer details, and various other information.
1. Given a business situation in which managers require information from a database, determine, analyze and classify that information so that reports can be designed to meet the requirements.
Organizations today use information in more than ways than one can count. Information is so important today because it is used by organizations and businesses to keep everyday business processes and day-to-day operations running smooth and without glitches. As I have looked at many business and organizational sources both in person and on the internet, I have found that a large majority of these have the information flow start at the top in the executive area. This is what I like to call the trickle-down effect. Information is set to flow among the business and organization and starts at the top and goes all the way to the bottom. During that time, information can be changed to allow only that
This paper will discuss how an information system is critical to the business process of an organization and how the information has impacted the organization 's structure.
Reliable and valid information is essential to all businesses and organisations because they do not know where they are going and if they get to where they want to be it is more by luck than by good planning. Organisations use information for a variety of purposes and these are as
Information is the key to any organization in the world today; it is what makes an organization successful, accurate and proficient in an increasingly competitive market. Without information a company is powerless, it does not know its customer or understand them,
The purpose of information retrieval is to provide quality service for the right person at the right time, with all the required information in hand. Only if data is stored in a procedural manner it can be easily retrieved. Information might be retrieved for marketing purposes, for
Information is data that has been processed so that it has meaning and value to a recipient,
The ability to manage information plays a critical role in developing a firm’s capabilities in customer relationship management, process management and performance management (Mithas, 2011).
Information is an asset that, like other important personal assets, is essential to an in-dividual and should be protected. Information can exist in many forms. It can be printed or written on paper, stored electronically, transmitted by post or by using electronic means, shown, or spoken in conversation. In whatever form the information takes, or means by which it is shared or stored, it should always be appropriately secured.
Information is data that has been processed in such a way as to be meaningful to the person who receives it. Businesses need information that is relevant to them.
Today, just about everyone depends on information and communication to keep their lives moving through daily activities like work, education, health care, leisure activities, entertainment, travelling, personal relationships, and the other stuff with which we are
Information management (IM) is the collection and management of information from one or more sources and the distribution of that information to one or more audiences; is also particularly critical to businesses that work in conjunction with other businesses, so the two must share information with, or transfer information to, each other. In addition, businesses with more than one department or unit can use the MIS to compile information in one central location, thereby preventing information loss.