Chapter 3 Gini Calculations Problems Paper You must make your own calculations of the Gini Index and you must show your calculations in the answer document.Insufficient calculation steps will result in reduced points earned.
Instructions
This document was purposely created in Microsoft Word so you can enter your answers into the document. The problems focus on the use of the Gini Index.
YOUR ANSWERS MUST APPEAR WITHIN THE PROBLEM DOCUMENT.
10% WILL BE DEDUCTED IF YOU CREATE A NEW OR SEPARATE DOCUMENT.
10% WILL BE DEDUCTED IF YOU CREATE A TITLE PAGE TYPE OF DOCUMENT.
20% WILL BE IF YOU DO NOT SHOW YOUR CALCULATIONS FOR EACH ANSWER.
Please review the attached document and answer all 3 questions in the same document. Chapter 3 Problems
Instructions
This document was purposely created in Microsoft Word so you can enter your answers into
the document. The problems focus on the use of the Gini Index.
YOUR ANSWERS MUST APPEAR WITHIN THE PROBLEM DOCUMENT.
10% WILL BE DEDUCTED IF YOU CREATE A NEW OR SEPARATE DOCUMENT.
10% WILL BE DEDUCTED IF YOU CREATE A TITLE PAGE TYPE OF DOCUMENT.
20% WILL BE IF YOU DO NOT SHOW YOUR CALCULATIONS FOR EACH ANSWER.
You must make your own calculations of the Gini Index and you must show your calculations in
the answer document. Insufficient calculation steps will result in reduced points earned.
1. Use the following table to calculate your answers to the following questions.
a. What is the Gini index for the Customer ID?
b. Explain your answer in part (a).
c. What is the Gini index for the Male Gender?
d. What is the Gini index for the Female Gender?
e. Compute the weighted average for the Gender type?
f. Compute the Gini index for the Car Type attribute using multiway split.
Family car
Sports car
Luxury car
g. Compute the weighted average for the Car type.
h. Compute the Gini index for the Shirt Size attribute using multiway split.
Small shirt
Medium shirt
Large shirt
Extra Large shirt
i. Compute the weighted average for the Shirt Size.
j. Which attribute is better, Gender, Car Type, or Shirt Size? Why?
k. Explain why Customer ID should not be used as the attribute test condition even though it has the
lowest Gini.
2. Use the following table to calculate your answers to the following questions.
a. Compute the Gini index for the Outlook attribute using multiway split.
1) Rainy
2) Overcast
3) Sunny
4) Compute the weighted average for the Outlook?
b. Compute the Gini index for the Temp attribute using multiway split.
1) Hot
2) Mild
3) Cool
4) Compute the weighted average for the Temp?
c. Compute the Gini Index for Humidity.
1) High
2) Normal
d. Compute the Gini Index for Windy.
1) True
2) False
e. Which attribute is better, Outlook, Temp, Humidity, or Windy? Why?
3. Decision Trees
a. Considering the blue circles are one data set and the red stars are the second data set, calculate the
Gini Index for the data set.
b. It has been decided that the first split should be at X = 0.4
1) What is the impurity on the left?
2) What is the impurity on the right?
c. Next, split again at the point to the right of the rightmost blue circle, approximately 0.73
1) What is the impurity of the next left split, between X > 0.4 and X 0.73?
d. Finally, split the middle group at Y > 0.4
1) What is the impurity of each group/class?
Purchase answer to see full
attachment
Economic Debate- Progressive Income Tax For this Economic Debate, we are going to discuss the…
TOPIC: Going Global Discussion Thread 1 (initial post due Wednesday for full credit) Please note:…
Assignment Topic This week will culminate in the creation of a narrated PowerPoint to create…
The Assignment must be submitted on Blackboard (WORD format only) via allocated folder. Assignments submitted…
you need to post your 2-page information flier to share with your Final Project Group.…
discussion: Discuss the methods used at your company to measure and ensure quality products and…