- Premium Academic Help From Professionals
- +1 323 471 4575
- support@crucialessay.com

## Generalization Error Rate Of The Tree Using The Optimistic Approach

Order ID# 45178248544XXTG457Plagiarism Level: 0-0.5%Writer Classification: PhD competentStyle: APA/MLA/Harvard/ChicagoDelivery: Minimum 3 HoursRevision: PermittedSources: 4-6Course Level: Masters/University CollegeGuarantee Status: 96-99%

InstructionsGeneralization Error Rate Of The Tree Using The Optimistic Approach

Homework 3

Answer the following questions: (10 point each)

- The following table summarizes a data set with three attributes A, B. C and two class labels *, -. Build a two-level decision tree.

A B C Number of Instances

– + T T T 5 0 F T T 0 10 T F T 10 0 F F T 0 5 T T F 0 10 F T F 25 0 T F F 10 0 F F F 0 25

- According to the classification error rate, which attribute would be chosen as the first splitting attribute? For each attribute, show the contingency table and the gains in classification error rate.
- Repeat for the two children of the root node.
- How many instances are misclassified by the resulting decision tree?
- Repeat parts (a), (b), and (c) using C as the splitting attribute.
- Use the results in parts (c) and (d) to conclude about the greedy nature of the decision tree induction algorithm.
- Classify the following attributes as binary, discrete, or continuous. Also classify them as qualitative (nominal or ordinal) or quantitative (interval or ratio). Some cases may have more than one interpretation, so briefly indicate your reasoning if you think there may be some ambiguity. Example: Age in years. Answer: Discrete, quantitative, ratio
- Gender in terms of Mor F.
- Temprature as measured by people’s judgments.
- Height as measured by people’s height.
- Body Mass Index (BMI) as an index of weight-for-height that is commonly used to classify underweight, overweight and obesity in adults.
- States of matter are solid, liquid, and gas.
- For the following vectors, x and y, calculate the indicated similarity or distance measures.
- (a) x : (1,0,0,1), y : (2,1,1,2) cosine, correlation, Euclidean
- (b) x : (1,1,0,0), y : (1,1,1,0) cosine, correlation, Euclidean, Jaccard
- Construct a data cube from Fact Table. Is this a dense or sparse data cube? If it is sparse, identify the cells that are empty. The data cube is shown in Data Cube Table.

Fact Table Product ID Location ID Number Sold 1 1

2

2

3

1 3

1

2

2

10 6

5

22

2

Data Cube Table Product ID Location ID Number Sold 1 2 3 1 2

3

10 5

0

0 22

2

6 0

0

16 27

2

Total 15 24 6 45

- Consider the decision tree shown below:
- Compute the generalization error rate of the tree using the optimistic approach.
- Compute the generalization error rate of the tree using the pessimistic approach. (For simplicity, use the strategy of adding a factor of 0.5 to each leaf node.)
- Compute the generalization error rate of the tree using the validation set shown above. This approach is known as reduced error pruning.

Training: Instance A

C

B

+

–

+

–

0

1

0

1

0

1

A B C Class 1 0 0 0 + 2 0 0 1 + 3 0 1 0 + 4 0 1 1 – 5 1 0 0 + 6 1 0 0 + 7 1 1 0 – 8 1 0 1 + 9 1 1 0 + 10 1 1 0 + Validation: Instance A B C Class 11 0 0 0 + 12 0 1 1 + 13 1 1 0 + 14 1 0 1 – 15 1 0 0 +

RUBRIC

Excellent Quality95-100%

Introduction45-41 points

The background and significance of the problem and a clear statement of the research purpose is provided. The search history is mentioned.

Literature Support91-84 points

The background and significance of the problem and a clear statement of the research purpose is provided. The search history is mentioned.

Methodology58-53 points

Content is well-organized with headings for each slide and bulleted lists to group related material as needed. Use of font, color, graphics, effects, etc. to enhance readability and presentation content is excellent. Length requirements of 10 slides/pages or less is met.

Average Score50-85%

40-38 points

More depth/detail for the background and significance is needed, or the research detail is not clear. No search history information is provided.

83-76 points

Review of relevant theoretical literature is evident, but there is little integration of studies into concepts related to problem. Review is partially focused and organized. Supporting and opposing research are included. Summary of information presented is included. Conclusion may not contain a biblical integration.

52-49 points

Content is somewhat organized, but no structure is apparent. The use of font, color, graphics, effects, etc. is occasionally detracting to the presentation content. Length requirements may not be met.

Poor Quality0-45%

37-1 points

The background and/or significance are missing. No search history information is provided.

75-1 points

Review of relevant theoretical literature is evident, but there is no integration of studies into concepts related to problem. Review is partially focused and organized. Supporting and opposing research are not included in the summary of information presented. Conclusion does not contain a biblical integration.

48-1 points

There is no clear or logical organizational structure. No logical sequence is apparent. The use of font, color, graphics, effects etc. is often detracting to the presentation content. Length requirements may not be met

You Can Also Place the Order at www.perfectacademic.com/orders/ordernow or www.crucialessay.com/orders/ordernow Generalization Error Rate Of The Tree Using The Optimistic Approach

error: Content is protected !!

Open chat

You can contact our live agent via WhatsApp! Via our number +1 323 471 4575.

Feel Free To Ask Questions, Clarifications, or Discounts, Available When Placing the Order.