Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

data mining concepts , Exams of Management Information Systems

this document contains information regarding data mining, data warehousing and OLAP

Typology: Exams

2017/2018

Uploaded on 01/28/2018

rajya-lakshmi-jasti
rajya-lakshmi-jasti 🇮🇳

1 document

1 / 3

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Data mining is the process of sorting through large data sets to identify patterns and
establish relationships to solve problems through data analysis. Data mining tools
allow enterprises to predict future trends.
A data set is a collection of related, discrete items of related data that may be
accessed individually or in combination or managed as a whole entity.
Data mining parameters
In data mining, association rules are created by analyzing data for frequent if/then patterns,
then using the support and confidence criteria to locate the most important relationships
within the data. Support is how frequently the items appear in the database, while confidence
is the number of times if/then statements are accurate.
Association rules are if/then statements that help uncover relationships between
seemingly unrelated data in a relational database or other information repository. An
example of an association rule would be "If a customer buys a dozen eggs, he is
80% likely to also purchase milk."
An association rule has two parts, an antecedent (if) and a consequent (then). An
antecedent is an item found in the data. A consequent is an item that is found in
combination with the antecedent.
Association rules are created by analyzing data for frequent if/then patterns and
using the criteria support and confidence to identify the most important
relationships. Support is an indication of how frequently the items appear in the
database. Confidence indicates the number of times the if/then statements have
been found to be true.
In data mining, association rules are useful for analyzing and predicting customer
behavior. They play an important part in shopping basket data analysis, product
clustering, catalog design and store layout.
Programmers use association rules to build programs capable of machine learning.
Machine learning is a type of artificial intelligence (AI) that seeks to build programs
with the ability to become more efficient without being explicitly programmed.
Other data mining parameters include Sequence or Path Analysis, Classification, Clustering
and Forecasting. Sequence or Path Analysis parameters look for patterns where one event
leads to another later event. A Sequence is an ordered list of sets of items, and it is a common
type of data structure found in many databases. A Classification parameter looks for new
pf3

Partial preview of the text

Download data mining concepts and more Exams Management Information Systems in PDF only on Docsity!

Data mining is the process of sorting through large data sets to identify patterns and establish relationships to solve problems through data analysis. Data mining tools

allow enterprises to predict future trends.

A data set is a collection of related, discrete items of related data that may be

accessed individually or in combination or managed as a whole entity.

Data mining parameters

In data mining, association rules are created by analyzing data for frequent if/then patterns, then using the support and confidence criteria to locate the most important relationships within the data. Support is how frequently the items appear in the database, while confidence is the number of times if/then statements are accurate.

Association rules are if/then statements that help uncover relationships between seemingly unrelated data in a relational database or other information repository. An

example of an association rule would be "If a customer buys a dozen eggs, he is 80% likely to also purchase milk."

An association rule has two parts, an antecedent (if) and a consequent (then). An

antecedent is an item found in the data. A consequent is an item that is found in combination with the antecedent.

Association rules are created by analyzing data for frequent if/then patterns and using the criteria support and confidence to identify the most important

relationships. Support is an indication of how frequently the items appear in the

database. Confidence indicates the number of times the if/then statements have been found to be true.

In data mining, association rules are useful for analyzing and predicting customer

behavior. They play an important part in shopping basket data analysis, product clustering, catalog design and store layout.

Programmers use association rules to build programs capable of machine learning. Machine learning is a type of artificial intelligence (AI) that seeks to build programs with the ability to become more efficient without being explicitly programmed.

Other data mining parameters include Sequence or Path Analysis, Classification, Clustering and Forecasting. Sequence or Path Analysis parameters look for patterns where one event leads to another later event. A Sequence is an ordered list of sets of items, and it is a common type of data structure found in many databases. A Classification parameter looks for new

patterns, and might result in a change in the way the data is organized. Classification algorithms predict variables based on other factors within the database.

Clustering parameters find and visually document groups of facts that were previously unknown. Clustering groups a set of objects and aggregates them based on how similar they are to each other.

There are different ways a user can implement the cluster, which differentiate between each clustering model. Fostering parameters within data mining can discover patterns in data that can lead to reasonable predictions about the future, also known as predictive analysis.

Data mining tools and techniques

Data mining techniques are used in many research areas, including mathematics, cybernetics , genetics and marketing. While data mining techniques are a means to drive efficiencies and predict customer behavior, if used correctly, a business can set itself apart from its competition through the use of predictive analysis.

Data Mining 101

Web mining, a type of data mining used in customer relationship management, integrates information gathered by traditional data mining methods and techniques over the web. Web mining aims to understand customer behavior and to evaluate how effective a particular website is.

Other data mining techniques include network approaches based on multitask learning for classifying patterns, ensuring parallel and scalable execution of data mining algorithms, the mining of large databases, the handling of relational and complex data types, and machine learning. Machine learning is a type of data mining tool that designs specific algorithms from which to learn and predict.

Benefits of data mining

In general, the benefits of data mining come from the ability to uncover hidden patterns and relationships in data that can be used to make predictions that impact businesses.

Specific data mining benefits vary depending on the goal and the industry. Sales and marketing departments can mine customer data to improve lead conversion rates or to create one-to-one marketing campaigns. Data mining information on historical sales patterns and customer behaviors can be used to build prediction models for future sales, new products and services.