



Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
computer science and programming
Typology: Essays (university)
1 / 6
This page cannot be seen from the preview
Don't miss anything!
Q1. Explain the kind of knowledge to be mined? (10 Marks) Answer: Data mining is not specific to one type of media or data. Data mining should be applicable to any kind of information repository. Algorithms and approaches may differ when applied to different types of data. The challenges presented by different types of data vary significantly. Data mining is being put into use and studied for databases, including relational databases, object-relational databases and object-oriented databases, data warehouses, transactional databases, unstructured and semi-structured repositories such as the World Wide Web, advanced databases such as spatial databases, multimedia databases, time-series databases and textual databases, and even flat files. Here are some examples.
For example, a user may define big spenders as customers who purchase items that cost $100 or more on an average; and budget spenders as customers who purchase items at less than $100 on an average. The mining of discriminant descriptions for customers from each of these categories can be specified in the DMQL as : mine comparison as purchaseGroups for bigSpenders where avg(I.price) ≥$ versus budgetSpenders where avg(I.price)< $ analyze count
For example, To mine patterns, classifying customer credit rating where the classes are determined by the attribute credit_rating, and mine classification is determined as classify Customer Credit Rating analyze credit_rating