Jaypee Institute of Information Technology (JIIT) 2008 B.E Computer Science Test 1 : Information Retrieval and Data Mining for CSE - Question Paper
Jaypee University of Information Technology VVaknaghat B.Tcch. VIII Scm (C.S.E. & IT.)
Test*]. February 2008
Course code : 07B7ICI4U Maximum Time: I Hour
Course Title: Information Retrieval and Data Mining Maximum Marks' 20
Course credit: 3
(a)Brieny outline the major steps of decision tree classification.
(b)Dcscribe how a box plot can give information about whether the value of an attribute is symmetrically distributed.
Q2 [2+2J
Suppose that the university course database for Big-University includes the following attributes describing students: name, address, status ( e.g,. undergraduate or graduate), major, and CPA (cumulative grade point average).
(a)Describe the architecture you would choose.
(b)Wntc a DMQI, query to find associations involving course instructors, student grades. Use a metarule to specify the format of associations you would like to find. Specify minimum thresholds for the confidence and support of the association rules reported.
class problem.
Consider the following dataset for a binary | |||||||||||||||||||||||||||||||||
|
Calculate the information gain when splitting on A and B. Which attribute would the decision tree induction algorithm choose?
(M I [3+3]
Suppose that a data warehouse consists of the four dimensions date, spectator, location, and gome, and two measures count and charge, where charge is the fare that a spectator pays when uatching a game on a given date. Spectators may be students, adults, or seniors, with each category having its own charge rate.
<a)Draw a star schema diagram for the above data warehouse.
(b)Starting with the base cuboid [date, spectator, location, game), wtiat specific OLAP operations should one perform in order to list the total charge paid by student spectators at GM Place in 2007?
Attachment: |
Earning: Approval pending. |